[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Omaha.pm] good mirror script?



Good to know, thanks.  What I with httrack turned out pretty good though.

-Ryan


Daniel Linder wrote:
Just a follow-up.

I ran this:
wget --convert-links -E -r -K -l 1
http://www.directwholesale.net/diagrams/diagrams.asp

The big change here from yours were the "-K"(keep original files) and "-l
1" (limit to one depth of recursion).

When this ran _AND_COMPLETED_, it printed the lines:
FINISHED --00:36:50--
Downloaded: 12,675 bytes in 3 files
Converting www.directwholesale.net/diagrams/diagrams.asp.html... 2-0
Converting
www.directwholesale.net/diagrams/diagrammodels.asp?link=.html... 0-2
Converting www.directwholesale.net/diagrams/DiagramMake.asp?link=.html...
0-100
Converted 3 files in 0.005 seconds.

Checking the differences between "diagrams.asp.html" and
"diagrams.asp.html.orig" shows that it has cleaned up some of the URLs to
make them local-filesystem friendly.

Looks like you have to wait for it to complete before it does the cleanup...

Dan

On Tue, April 10, 2007 23:53, Daniel Linder wrote:
On Mon, April 9, 2007 14:48, Ryan Stille wrote:
BUT - back to wget, here's what I'm doing, if you have any suggestions
I'm open to them.

wget --convert-links -E -r -H http://64.85.6.118/diagrams/diagrams.asp
When I ran the wget on my website I got the files with the URLs stripped
out.  I tried your exact example, but didn't wait for it to finish.  When
I ran wget pointed to a smaller website and used "-K" (to save the files
the --convert-links modifies) I can see it removing the "http://xxxx";
bits.

If you run the wget on a smaller site, does it work as you expect it to
there?

In my case, the wget had to completely traverse the entire site tree, then
at the end it reported multiple lines like "Converting
www.site.tld/file1.htm..."

Dan

- - - -
"Wait for that wisest of all counselors, time." -- Pericles
"I do not fear computers, I fear the lack of them." -- Isaac Asimov

_______________________________________________
Omaha-pm mailing list
Omaha-pm@pm.org
http://mail.pm.org/mailman/listinfo/omaha-pm



- - - -
"Wait for that wisest of all counselors, time." -- Pericles
"I do not fear computers, I fear the lack of them." -- Isaac Asimov

_______________________________________________
Omaha-pm mailing list
Omaha-pm@pm.org
http://mail.pm.org/mailman/listinfo/omaha-pm