I was contacted today by a newspaper reporter from Charlotte, North Carolina, to comment on the death of a local blogger, part of a pair of women who have taken Charlotte by storm with their social commentary blog. I wanted to research this myself to write about it here, so I headed to Google, the search engine of choice, and entered in death, social, bloggers, charlotte, north carolina and clicked over to Blog Search when Web and News came up empty. I expected to get a few hits as the reporter said the death of this young woman was the “talk of the town” and the community was turning out to support the surviving blogger.
What I got were ten search results all from Google Blogger/Blogspot sites.
My first reaction? Google must now give priority to their own bloggers in the search results. It’s a good assumption based upon the evidence.
Ah, but wait. That’s not the way to really judge the search results fairly. After all, I don’t just make snap assumptions. I evaluate all the information at hand. Maybe these Blogspot blogs have the information I’m seeking.
Two of the ten were legitimate blogs, though not with the answers I was seeking. The rest were splogs. Here is an example of the text from the first two on the search results:
union funeral home in whiteville north carolina
31 Jan 2008 by hurjmmrhol
perestroika acupuncture insomnia night at the museum Three days grace just like you Home interiors swinging married couples santas reindeer clip art coal insert for zc pre fab oakhurst in charlotte, nc natural to a tee vermont regulate …
All the countries have joined together… – http://zxxxxx.blogspot.com/ – References
social services in tonasket washington
22 Jan 2008 by vohoypkrgl
spoke rims dmv charlotte nc the color purple metaphors americanbaptist map of pittsburgh areas www srbaseball com thedallymail bare essentals las vegas albert zavaro. Wamsutta serendipity active care physical therapy Hand held golf …
Comprehensive news articles from the magazine…. – http://xxxxxx.blogspot.com/
Ah, you recognize them. Splogs. Those spam blogs that generate crap content and stuff themselves with links. Having been to Whiteville, North Carolina, and grew up near Tonasket, Washington, I’m embarrassed for their communities for having such ugly online ties.
Again, let’s not jump to conclusions. Let’s go to the next set of ten search results.
Oh, my. Ten out of ten splogs all on Blogspot. And not only that, Tonasket and some of the other splogs from Blogspot make a repeat appearance in the list. Maybe page three? Two out of ten are legit.
I kept on going, now intrigued at all the Blogspot blog posts popping up. How far would I have to go to get to a good ratio of good content to Blogspot splogs?
On page five, fifty search results in, I finally found my first non-Blogspot blog – but it was also a splog. On page 10, I found three legits to seven splogs, but page 11 was back to 2 out of 10. By page 16, I gave up. With about 32 legitimate blog posts out of 160 search results – things were not looking up.
What brought on this flood of splog search results?
My search keywords were: death, social, bloggers, charlotte, north carolina. North Carolina was in every one of the search results, closely followed by death, social, and occasionally blog or blogger. “Social security” was also found in combination with social and death. Maybe I hit the jackpot in splog search terms? But this is not a one time incident. It happens daily for me. It’s also not the real point that needs to be addressed.
It seems that about 39,000 fake blogs were created from among the 805,000 new blogs started on Blogspot over the past two weeks and FlightSplog, monitoring new blogs at Blogspot, “documented 2,763 porn splogs from a single splogger”.
Netcraft’s article reported that IceRocket was going to stop indexing Blogspot until they cleaned up their act, and other similar services were also crying foul at the overwhelming numbers of splogs on Blogspot which flooded their databases and plagued their users.
It’s three years later. I expect those numbers are a minuscule drop in the splog bucket now.
By the way, I still haven’t found the information I’m seeking about the bloggers in North Carolina. I’ll keep digging, but I shouldn’t have to.
Google, Are You Listening?
Thank you for the link love for these past few years. I adore how you have kept your cool under such tremendous pressure and the onslaught of millions of new blogs and blog posts every day, combined with us whining bloggers hacking and whacking at your good intentions. However, Blogger/Blogspot is becoming a nuisance and blight on the web, and I’d like to address this issue with you.
The overwhelming number of spam blogs hosted on your free blog hosting service interferes with our ability to find the information we need on your search engine. Many of them scrape our blog content, words, pictures, sound and video we worked hard to create and for you to index.
I know you are working hard to fix the broken Page Rank and trying to build up the TrustRank and profiling system so searchers will only find the quality content they need, and site owners will get the score they deserve not game. I know there are supposed to be filters and protections in place to stop spam blogs, but please, let us help you make it easier while you are improving your algorithms against the bad guys.
All I ask is that you make it easier for us to help you clean up the web.
Let us tell you when we’ve spotted a splog. Force the Blogger bar back onto all Blogger/Blogspot blogs. Put back the “flag this blog” warning members can use to identify a splog or copyright violator.
Make it easier to submit DMCA violators and splogs through a one stop online visit, not a trip to the post office. Make us learn quickly that reporting copyright violators will get something done about it.
Create a more viable scorecard that tracks which Blogspot/Blogger blogs are getting the most complaints and shut the offenders down faster.
Add an “Alert” checkbox to tag search results as splogs as we stumble across them along our search journey through Google. From the tagged results, pass them through a filtering algorithm that tests for coherent English grammar. If found to be coherent, kick it out of the list but save the information in the database that it had been identified once. If identified 10 or 20 more times, kill it. If not coherent, shut them down.
There has to be a nice way of doing this. Sure, there is always room for abuse, but let us help you. The good white hat wearing web users represent the majority and we are tired of this. We want Google cleaned up. We think starting with cleaning up Blogspot/Blogger is a good place to begin.
We, the bloggers of the world, really like you Google. We put your ads, search, maps, news, and gadgets on our blogs. We write our post content to meet your needs so you will like us. We design our web designs not just with web standards but Google standards in mind. Our lust for all things Google puts billions in your pockets. We live and breath through Google, so let us help you help us.
Thank you for listening.