on technology, personal & funn
9 May
1 Apr
Like the regular website, google is also using robots.txt. Interestingly Google is using it to block search bots to index some of their content. Below is a small list from their robots.txt file. Looks like they are blocking those pages which will generate dynamic content. In other words they have tried to blog those pages which will return search results. Not sure how effective it is.
Looking @ this robots.txt file you will find out lot of interesting information. Ofcourse you need some patients for that…
User-agent: *
Allow: /searchhistory/
Disallow: /search
Disallow: /groups
Disallow: /images
Disallow: /catalogs
Disallow: /catalogues
Disallow: /news
Disallow: /nwshp
Allow: /news?btcid=
Disallow: /news?btcid=*&
Allow: /news?btaid=
Disallow: /news?btaid=*&
Disallow: /setnewsprefs?
Disallow: /index.html?
Disallow: /?
Disallow: /addurl/image?
Disallow: /pagead/
Disallow: /relpage/
29 Mar
Wallpaper Link : http://www.flickr.com/photos/25305804@N08/2379631425/
EEEPC Linux Wallpaper, originally uploaded by tortugoc.