Wi-Fizzle.com - Putting the fizzle in Wi-Fi since 2005 .. (yes, this was a poor choice for a domain name)

<div class="news_item">#57<div class="news_title">HOWTO: Coax wget into ignoring robots.txt

Posted by dandriff on Saturday November 18, 2006@02:39PM

Don't just you love when options are left off of the --help/-h/-help etc.. of a program? Well boys and girls, add wget to your list of programs with undocumented useful features (probably not in the help because they're worried about people abusing the feature I guess).

So, to get make wget ignore robots.txt rules, use:

 wget -erobots=off http://your.site.here

This information/option is listed on the gnu.org wget FAQ