« Проверка орфографии в Open Office 2.3 Message Box - как получить текст »
Posted on: Wednesday, December 26, 2007 at 7:21 pm
web surfing.
You can follow any responses to this entry through the RSS 2.0 feed.
You can leave a response, or trackback from your own site.
How to extract url’s from html
December
26th
Sometimes I need for download many(but not all) URL’s from the web page. I will describe in this post how it’s possible to do this.
For me it’s handy to use wget console utility for download everything. This download manager has option -i for download all urls from file.
So, for download many links we have to do following:
- In Firefox browser select interesting urls and perform “View Selection Sources”
- Copy html code that contain interesting links for download
- Create file, for example “links.txt” with that html.
- Launch
perl extractUrls.pl links.txt > li.txt
for extract links.
- Launch
wget -i li.txt
for download extracted links
extractUrls.pl:
I may remove some action for you by using clipboard or another techniques if you really interesting with that. Video tutorial will be made by your request also.
Regards,
Dmytro
Posted on: Wednesday, December 26, 2007 at 7:21 pm
web surfing. You can follow any responses to this entry through the RSS 2.0 feed.
You can leave a response, or trackback from your own site.
One Response to “How to extract url’s from html”
Leave a Reply
You must be logged in to post a comment.

January 12th, 2008 at 3:57 am
tnx for article, it really helped me a bit! :))