Gobinath

on technology, personal & funn

Twitter Weekly Updates for 2009-05-09

  • 0 Comments
  • Filed under: Misc
  • robots.txt

    Like the regular website, google is also using robots.txt. Interestingly Google is using it to block search bots to index some of their content. Below is a small list from their robots.txt file. Looks like they are blocking those pages which will generate dynamic content. In other words they have tried to blog those pages which will return search results. Not sure how effective it is.

    Looking @ this robots.txt file you will find out lot of interesting information. Ofcourse you need some patients for that… :)

    User-agent: *
    Allow: /searchhistory/
    Disallow: /search
    Disallow: /groups
    Disallow: /images
    Disallow: /catalogs
    Disallow: /catalogues
    Disallow: /news
    Disallow: /nwshp
    Allow: /news?btcid=
    Disallow: /news?btcid=*&
    Allow: /news?btaid=
    Disallow: /news?btaid=*&
    Disallow: /setnewsprefs?
    Disallow: /index.html?
    Disallow: /?
    Disallow: /addurl/image?
    Disallow: /pagead/
    Disallow: /relpage/

    Wallpaper - Mar 29

    Wallpaper Link : http://www.flickr.com/photos/25305804@N08/2379631425/

    EEEPC Linux Wallpaper, originally uploaded by tortugoc.

  • 0 Comments
  • Filed under: Wallpaper