Re: wget - the question for advanced users
- From: noi <noi@xxxxxxxx>
- Date: Thu, 28 Dec 2006 08:45:15 GMT
On Wed, 27 Dec 2006 14:32:59 +0100, Piotr wrote this:
Hi!
I'm trying to make a local copy of my homepage with photos:
www.pbase.com/piotrstankiewicz
More precisely I try to make a copy of this site with all child pages
starting with www.pbase.com/piotrstankiewicz (which is not a problem) and
additionnaly I want to download all the photos used. Unfortunately here I
have a problem as the photos are placed on other servers from the domain
pbase.com.
I tried something like that:
wget -r -H -p --convert-links --no-parent -l 3 --html-extension
-Dpbase.com --exclude-domains forum.pbase.com,search.pbase.com
http://www.pbase.com/piotrstankiewicz
Unfortunately it doesn't work.
I don't know why wget tries to download directories above
www.pbase.com/piotrstankiewicz and it starts downloading ex.
www.pbase.com/register.html (it looks like the option --no-parent doesn't
work or I don't fully understand it's behavior). When is starts to
download the documents from the main forder, it continues and continues
and it wants do download the contents of all the server. :(
How to force the situation that all the pages which doesn't start with
www.pbase.com/piotrstankiewicz (with the exception of photos placed on
other servers) are ignored?
I tried to use the option -I /piotrstankiewicz,/piotrstankiewicz/image In
such a situation all the web pages are downloaded ok (in the way I expect)
but wget doesn't download any photos (ex. it ignores that photo
http://i5.pbase.com/o4/43/588543/1/60342441.SA0469_20na30_final.jpg )
I tried also to use the option --exclude-directories
/galleries,/help,/login (without -I option) as there are the links from
the www.pbase.com/piotrstankiewicz site directing to the structure above
but wget it ignores and it starts to download the contents of all the
site.
Any ideas?
---------------------------------------------------
http://www.pbase.com/piotrstankiewicz
Did you try the --mirror option?
Start over use the --mirror option for your homepage. Enable log file
then maybe you can see what Wget is doing.
.
- Follow-Ups:
- Re: wget - the question for advanced users
- From: Piotr
- Re: wget - the question for advanced users
- References:
- wget - the question for advanced users
- From: Piotr
- wget - the question for advanced users
- Prev by Date: Re: Mandriva 2007 install will only run in 640x480 mode. Have beat my head against this for 15+ hours...
- Next by Date: Matrox G550 PCI DualView and xorg.conf
- Previous by thread: wget - the question for advanced users
- Next by thread: Re: wget - the question for advanced users
- Index(es):
Relevant Pages
|