2005-02-13

Google Image Bot Bears Base Bugs?

For the first time since this site was known as the Nanobox, Google's image bot is crawling my website. But unfortunately, all it's finding is 404s. It appears as though it isn't handling the base tag properly. For example, the regular Googlebot looked at the URL http://nanobox.chipx86.com/public/Gallery/R/Ranma%20%C2%BD/Group/, which has a base tag with the href attribute set to http://nanobox.chipx86.com/. On that page, it saw a link with an href attribute set to public/Gallery/R/Ranma%20%C2%BD/Group/ranma33.jpg. Using the specified base, it should have directed the Googlebot to http://nanobox.chipx86.com/public/Gallery/R/Ranma%20%C2%BD/Group/ranma33.jpg, but instead Googlebot-Image went to http://nanobox.chipx86.com/public/Gallery/R/Ranma%20%C2%BD/Group/public/Gallery/R/Ranma%20%C2%BD/Group/ranma33.jpg and got a 404. It clearly ignored the specified base and just used the default one. So now Google thinks that I have a website full of broken images. Looks like it may be quite a while until I get my images indexed by them.

I've made modifications to viewer spider in Downy WebFront to circumvent this problem.

2 comments

Anonymous

Do you want Google to index your pages/images?

~Naveronski

Nanobot

Yes, which is why I've taken care to make my site spider-friendly. I didn't anticipate this bug when I made viewer spider in Downy.

Post new comment

Comment moderation policy: Your comment will be reviewed before it is added to the site. This is in response to spam and other forms of abuse. I gladly accept comments containing criticism as long as the language is clean.

This weblog is powered by Blogger.