Hello all!
Need some help here. I had the task of trying to download files (they are mostly pdf and excel files) from website. I will try and explain how the website works. (1) The website is password protected (2) The website has 2000 "sections", in which there is a pdf file in each section which I am trying to download. (3) So there is the main page with all the links to the 2000 sections. I can basically click on each link, and gets transferred to each section and download the pdf file, and do this one by one. (4) Each pdf file has a identical web address, and each file is differentiated by a different number which is indicated by * in the following link
https://village.trialwebsite.com/doc/(*)
This is just a sample weblink for illustration purposes only
Is there a way to do it faster? I tried using flashget and flashgot, but when I started downloading, all i downloaded are the pics and some strange files with weird extensions....basically everything but the pdf files!!!
Thanks everyone for your help
Downloading from different pages of the same website
Downloading from different pages of the same website
Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.8.1.20) Gecko/20081217 Firefox/2.0.0.20
Re: Downloading from different pages of the same website
Depending upon the layout of the website (how the paths & files are named), the Build Gallery may help.
So perhaps something like:
If there is no workable pattern to the naming, then you might look at a program like, HTTrack Website Copier.
So perhaps something like:
Code: Select all
https://village.trialwebsite.com/doc/section[1-2000;1]/somesequenceofnumbers[1-3;1].pdf
https://village.trialwebsite.com/doc/section[1-2000;1]/somesequenceofnumbers[1-3;1].xls
Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.9.1.19) Gecko/20110420 SeaMonkey/2.0.14 Pinball NoScript FlashGot AdblockPlus
Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.8.1.21) Gecko/20090403 SeaMonkey/1.1.16