Monday, 15 April 2013

How to Use HTTrack


HTTrack is a free and open source Web crawler and offline browser, developed by Xavier Roche and licensed under the GNU General Public License.
It allows one to download World Wide Web sites from the Internet to a local computer. By default, HTTrack arranges the downloaded site by the original site's relative link-structure. The downloaded (or "mirrored") website can be browsed by opening a page of the site in a browser.

Steps

  1. 1
    Type httrack
  2. 2
    Choose a project name (this will be the name of the folder containing the project). This can include multiple websites.
  3. 3
    Optionally enter a base path (the default will store the project in a websites directory in your home directory).
  4. 4
    Enter the url(s) of the websites you want to mirror (separated by commas or spaces).
  5. 5
    Choose an action by typing its number.
    • Mirror website
    • Mirror website with wizard
    • Just get files indicated
    • Mirror all links in URLs
    • Test links in URLs
    • Quit
  6. 6
    Optionally enter a proxy.
  7. 7
    Optionally define wildcards.
  8. 8
    Optionally define additional options, such as recurse level.
  9. 9
    Confirm you are ready to launch the mirror.
  10. 10
    Congratulations, it will now begin mirroring - be patient until it reports the mirroring as completed!

No comments:

Post a Comment