在linux下完整的用wget命令整站采集网站做镜像
的命令是:

wget -m -e robots=off -U “Mozilla/5.0 (Windows; U; Windows NT 5.1; zh-CN; rv:1.9.1.6) Gecko/20091201 Firefox/3.5.6” “http://www.example.com/”

wget命令
参数注释:

“-e robots=off”  让wget耍流氓无视robots.txt协议

-U “Mozilla/5.0 (Windows; U; Windows NT 5.1; zh-CN; rv:1.9.1.6) Gecko/20091201 Firefox/3.5.6”  伪造agent信息

Leave a Reply

Your email address will not be published. Required fields are marked *