Copy Entire Website

sally_91

Distinguished
Dec 22, 2012
79
0
18,630
Hi,

I'm trying to copy a website that's on web.archive.org, as you can no longer access it through any other means.


I've used programs like WebCopy and HTTrack with no luck.

I use Firefox as my web browser and I'm only able to copy a single page at a time.

I thought I could copy all the pages separately and compile them into one .htm or .html file but I'm not sure how.
 
Solution


1. The current representation at archive.org is NOT the original

2. A little bit of HTML, and you can make your own. Properly constructed, you can put it on a CD or USB, and have it work just as it is on archive.org.


If you are doing this without the makers consent you are basically commiting theft.
 
Doesnt exactly work like that.

First and foremost if the website has anything besides html and css ( so javascript, asp, ruby, perl, etc) you cant copy that code so you will have a non-functioning website. If the website has a search function, a login, a database of items you browe (like forums or shopping) or even a multi-picture banner/slideshow then it has other programming languages than just html/css.

Secondly, you have to know and copy the folder structure of the original site in order to make even html usable.
Usually all the html pages are in the root folder and images in a separate folder, but if the designer organized it beyond that then you will have look at every single link to figure out the structure.
 

sally_91

Distinguished
Dec 22, 2012
79
0
18,630


Jeez, sounds complicated.

The website I'm trying to copy is this one
It has 12-13 pages at the most and is very basic.

It's already on archive.org and I'm afraid that it won't always be there.
 

USAFRet

Titan
Moderator


So go through each page, copy ALL the text, paste into a Word doc or other.
Your own personal copy.
 

sally_91

Distinguished
Dec 22, 2012
79
0
18,630


True, I just wish I could keep the elegance and functionality of the original website design.

I was going to share it with this person and presentation would have been key.
 

USAFRet

Titan
Moderator


1. The current representation at archive.org is NOT the original

2. A little bit of HTML, and you can make your own. Properly constructed, you can put it on a CD or USB, and have it work just as it is on archive.org.
 
Solution

Wolfshadw

Titan
Moderator
[strike]How big a can of worms are you looking to open?

You can copy each page into a word processor program like MS Word or LibreOffice Writer and save the file as an HTML document.
If you're running Windows, you can also (likely) turn on your own Internet Information Server/Personal Web Server that makes it available, in web form, anywhere on your network.
However, to make it accessible from anywhere in the world, it get's real messy from there.
[/strike]

Scratch that. USAFRet is being all smart again! :/
-Wolf sends