18 januari 2007
Google blogsearch has serious problems with Blogger-blogs
At this moment I am really busy doing my homework for the state of the Dutch Biblioblogosphere. One of the measures I wanted to use is the number of posts written last year. Google Blogsearch is the ideal candidate since they have date limitations, and you can search for the blog version of the site: command, namely blogurl:. Checking two very regular blogs I got results I can't trust whatsoever. Edwin is a daily blogger, posts at least twice a day, perhaps three times. But the Google Blogsearch for the number of posts on his blogspot only gave 221 results, and that's without date limitations. I expected at least double that number. The other blog I checked, also a blogger hosted blog by Margreet van den Berg, yielded only 89 results. She has been Blogging for years already, one post a day. So there should have been heaps more. Checking my own Blog, managed by Blogger, but hosted on my own site gave only 385 results. Slightly better than the other 2 examples, but a few days ago I celebrated my 500th post.
The real pity is that at Ask Blogsearch there is no proper Site: like command, nor at Icerocket. With Technorati you can't limit easily on date ranges. And Feedster has to small a dataase to be meaningful for Dutch blogs.
So I got stuck.
I can only hypothesize that the indexing of Blogger posts in Google Blogsearch was somehow affected by changeover to the New Blogger.
The real pity is that at Ask Blogsearch there is no proper Site: like command, nor at Icerocket. With Technorati you can't limit easily on date ranges. And Feedster has to small a dataase to be meaningful for Dutch blogs.
So I got stuck.
I can only hypothesize that the indexing of Blogger posts in Google Blogsearch was somehow affected by changeover to the New Blogger.
Comments:
Links to this post:
<< Home
It's strange indeed Wouter. But it get's even stranger when u use the google search function on my blog. When u type in 'Zeeuwse' u get about 500 posts, 'edwin' is 557 hits, and so on. That is Google Web though. And they did not index well in the first month.
U get bad results with 'inurl:zbdigitaal' but try that one in ask blog search an u will get more than thousand.
On a total of posts of 725. Odd.
That is why i started backing up in wordpress and savind the links in Delicious. Since I started ping both services of google the links tend to show up more often. That pays off I presume...
U get bad results with 'inurl:zbdigitaal' but try that one in ask blog search an u will get more than thousand.
On a total of posts of 725. Odd.
That is why i started backing up in wordpress and savind the links in Delicious. Since I started ping both services of google the links tend to show up more often. That pays off I presume...
Volgens Feeds4all.com:
1) Digitaal Inlichtingenwerk Zeeuwse Bibliotheek : 593 posts [sinds 2006/10/04]
(http://www.feeds4all.com/Feed.aspx?FeedID=53955&Tag=digitaal%20inlichtingenwerk>
2)Margreet van den Berg - ICT en onderwijs: 125 posts [sinds 2005/05/10]
(http://www.feeds4all.com/Feed.aspx?FeedID=41408&Tag=Margreet%20van%20den%20Berg%20-%20ICT%20en%20onderwijs)
3) WoW! Wouter over het Web: 239 posts [sinds 2006/01/03]
(http://www.feeds4all.com/Feed.aspx?FeedID=50168&Tag=WoW!%20Wouter%20over%20het%20Web)
1) Digitaal Inlichtingenwerk Zeeuwse Bibliotheek : 593 posts [sinds 2006/10/04]
(http://www.feeds4all.com/Feed.aspx?FeedID=53955&Tag=digitaal%20inlichtingenwerk>
2)Margreet van den Berg - ICT en onderwijs: 125 posts [sinds 2005/05/10]
(http://www.feeds4all.com/Feed.aspx?FeedID=41408&Tag=Margreet%20van%20den%20Berg%20-%20ICT%20en%20onderwijs)
3) WoW! Wouter over het Web: 239 posts [sinds 2006/01/03]
(http://www.feeds4all.com/Feed.aspx?FeedID=50168&Tag=WoW!%20Wouter%20over%20het%20Web)
I'm an engineer on the Google BlogSearch team. I took a look and there are a couple of problems that we'll investigate further. There is no problem related to new Blogger, though. I checked our backend servers and we do have all your posts.
I'll let you know when we get these issues resolved.
I'll let you know when we get these issues resolved.
@Jeremy,
Thanks for your reply. I am really interested to hear from you when Blogsearch has resolved the observed problems.
Thanks for your reply. I am really interested to hear from you when Blogsearch has resolved the observed problems.
Blogsearch also posts strange results for my blog. It gets my name wrong. It says my surname is "Jnes" instead of "Jones". It is leaving a letter out, when the feed and all possible settings are correct. If it is leaving a letter out of my name, is it leaving important letters out of posts themselves "can't" becoming "can" for instance? There is clearly something wrong somewhere but I am unable to put my finger on it. However, it is within Blogsearch as Google itself cannot find "Jnes" for any of my pages, neither can any other search engine. So the word "Jnes" does not exist on any pages of my site, yet Blogsearch says it does. Strange.
Een reactie plaatsen
Links to this post:
<< Home


