Log in

View Full Version : Questions on web riping & PDFing


Dave the Rave
April 13th, 2004, 06:23 PM
I am planing some new files, on mine making and anti-armor weapons, but I have some doubts... Which program is suitable for web ripping ? Many of the informations I´ve found are on websites, and I need to download all its contents to ease my work.

And is possible to edit an already made PDF ? How can I do it ? Any sugestion about programs that allows me to do it ?

Any sugestion is wellcome...

Jacks Complete
April 13th, 2004, 06:39 PM
I've got a program called "WebReaper", from http://www.webreaper.net/ which really does the job. It handles Javascript and most things, but you can't tell it to override the spiders.txt file, which is a shame. I've had it wander off and download for an entire night, and it works well. The bonus is that each Top Level Domain goes into a different folder, so you can scrap the obvious junk that your filter config missed, and see clearly lots of other interesting sites that you might have missed the link to otherwise.

zaibatsu
April 13th, 2004, 07:17 PM
Please note, this conversation will not be allowed to move onto the topic of cracking PDF files - removing security and such.

When I need to get a webpage, I just use webcapture on Acrobat.

MightyQuinnŽ
April 14th, 2004, 03:22 AM
I have a program that will open *most* pdf files in Microsoft Word. Keeps all the formatting and allows editing. You may save the file in a .doc format from there.

It will not work on protected .pdf's, but does a good job for the most part.

Also....OmniPage Pro will do the same job, but better.

steyr
April 14th, 2004, 11:20 AM
Is here someone who has link to Adobe Webcapture?

I want to rip some www's - of course E&W related and put them on ftp.

aikon
April 14th, 2004, 11:29 AM
For downloading web-pages i use the Oflline Explorer from metaproducts.
Just visit their homepage and try it for 30 days free.
http://www.metaproducts.com/

Dave the Rave
April 14th, 2004, 01:01 PM
Of Course that we won´t talk about cracking PDF, that will be ilegal !!!

My idea is just to have the hability of edit and translate the language of the informations inside the file. Rigth now, I´m working on an tradution of the Abwehr, from german to english, but I want to keep the original images and formating...

Migthyquinn, what´s the website and the name of the program you use to edit ? And where can I found omnipage pro ?

JC, thanks about webreaper, I´ll try it. If I have any question, may I ask it to you ?

I don´t know nothing about webcapture on adobe, it´s an plugin or it´s already on the full program ? Can you tell me more, please Zaibatsu ?

And by the way, what happened at your Nick ? Where is your capitalysed " Z " ?

-------------

I´ve now found the "solid converter pro" to edit PDF files, I´ll try it now.

zaibatsu
April 14th, 2004, 02:31 PM
Sorry, I should have explained a little more. All this information will relate to the version of Adobe Acrobat I use, v6 Professional.

On the menus, go to File, Create PDF, From Web Page and then obviously select what you want. To get most pages it's rather simple to just select get entire site, which seems to work fine for me. I used this to grab a website featuring single-shot rifle patents, and it worked very nicely.

To get Acrobat 6, the easiest way would be to download the 30 day trial from Adobe's website, and then find a crack for it, such as one of the cracks which introduce a new exe launcher file.

Dave:

Zaibatsu, zaibatsu, it's all the same to me.

Dave the Rave
April 14th, 2004, 03:50 PM
Nope, the "solid converter pro" sucks...

It can´t convert images, just plain text, so the main purpose that is change text but keep images is lost...

I´ll search more for an converter that can handle images... Im still taking advices & sugestions.

-----------------------------

Another topic inside the topic, how about "uncook" files om PDF ? Those files that contain itens that the adobe cant handle and therefore turns all the file into crap ? Is there any program that can correct it ?

Jacks Complete
April 14th, 2004, 07:18 PM
Dave,
if you want to ask questions go ahead. I'm no expert with Webreaper - I tend to use Firefox (the browser) to rip any pages now, it is only full sites that I want to copy that I use webreaper for, and then on a fast connection!

MightyQuinnŽ
April 14th, 2004, 07:31 PM
.... I want to keep the original images and formating...

Migthyquinn, what´s the website and the name of the program you use to edit ? And where can I found omnipage pro ?

I have ScanSoft PDF Converter for Word. It keeps all images and formatting and such. Email me and I will get it to you.

**Edit.....I will add the link for the PDF software below.

http://www.50caliber.net/pdfs/pdfc.zip

I am still in search of OmniPage Pro 14 made by the same company. ;)

jelly
April 14th, 2004, 08:46 PM
aikon is right... I highly recommend the "Offline Explorer Enterprise"!!!

I have tried out many website rippers... this $400 program is the best of all.

And it's the only ripper that can record streaming audio/video to your hard disk
(e.g. Real Media or Windows Media Player files via RTSP:// or MMS://).

http://www.metaproducts.com/mp/mpProducts_Detail.asp?id=17

You'll find it at metaproducts.com... or on the emule network ;)

a_bab
April 15th, 2004, 06:15 AM
I use Teleport Pro. Very easy to use, very fast.

ossassin
February 6th, 2005, 01:38 AM
What's wrong with hitting "File" and "Save As," and saving it as a web archive (.mht) file? That's what I do. Any version of IE should be able to open it.

skier4life99
February 23rd, 2005, 04:42 PM
If you are looking to export any type of office document into .pdf, Open Office works great; It has an "Export to PDF..." option under the File menu. So if you can get your data into a format viewable in MS Office, you can use Open Office to convert it. Best part is that it's free so no illegal cracks or anything to weight on your mind.

http://www.openoffice.org/

Silentnite
February 27th, 2005, 03:14 AM
If you install either the latest Adobe, or maybe the latest Adobe Photoshop CS, Either way, on my computer I did both. I run OpenOffice, and somehow my main "PRINT" button goes directly to a pdf. It "prints" the file to a pdf.

If any of the more adept Adobe users would like to further explain this phenomenon I'd greatly appreciate it. Beyond that, I hope you can duplicate it.

P.S. Look in the RogueScience gmail account for Offline Explorer.

MightyQuinnŽ
February 27th, 2005, 10:28 PM
I run OpenOffice, and somehow my main "PRINT" button goes directly to a pdf. It "prints" the file to a pdf.

If any of the more adept Adobe users would like to further explain this phenomenon I'd greatly appreciate it.

If you open you printers folder, you may notice that the pdf printer is set as the default printer. Set your normal printer as the default (Right click and choose 'set as default') and the problem should correct itself.

Then you can always choose to print to a .pdf by selecting 'file' then 'print' to choose the printer you want to print to.

dguy
March 8th, 2005, 10:26 AM
Magellan Server (http://www.bcltechnologies.com/document/products/magellan/magellan_workflow.htm) Lite can do PDF's with graphics to HTML, you can enter a false email straight away at the above URL and download directly. Things to beware, you must copy a folders contents and it will encompass the folder you choose it to copy to, it tags html (easy to remove) and finally it deletes source folder contents!

dguy
March 8th, 2005, 10:26 AM
Magellan Server (http://www.bcltechnologies.com/document/products/magellan/magellan_workflow.htm) Lite can do PDF's with graphics to HTML, you can enter a false email straight away at the above URL and download directly. Things to beware, you must copy a folders contents and it will encompass the folder you choose it to copy to, it tags html (easy to remove) and finally it deletes source folder contents!

dguy
March 8th, 2005, 10:26 AM
Magellan Server (http://www.bcltechnologies.com/document/products/magellan/magellan_workflow.htm) Lite can do PDF's with graphics to HTML, you can enter a false email straight away at the above URL and download directly. Things to beware, you must copy a folders contents and it will encompass the folder you choose it to copy to, it tags html (easy to remove) and finally it deletes source folder contents!

dguy
March 8th, 2005, 11:05 AM
Magellan Server (http://www.bcltechnologies.com/document/products/magellan/magellan_workflow.htm) Lite can convert PDF's with images to HTML. But there's are some things to be aware of.

dguy
March 8th, 2005, 11:05 AM
Magellan Server (http://www.bcltechnologies.com/document/products/magellan/magellan_workflow.htm) Lite can convert PDF's with images to HTML. But there's are some things to be aware of.

dguy
March 8th, 2005, 11:05 AM
Magellan Server (http://www.bcltechnologies.com/document/products/magellan/magellan_workflow.htm) Lite can convert PDF's with images to HTML. But there's are some things to be aware of.

Jacks Complete
April 5th, 2005, 08:10 PM
There is also a suite of PDF tools from a company called PDF995, which are quite powerful, free (advertware, or you can pay to unlock for faster operation and no adverts) but some of the features aren't insanely user friendly.

Lets you do things like break a PDF into individual pages, add a stamp, export pages as graphics, rip out the text, and lots more. Also has a PDF fake printer so you can print anything to a windows printer that is really a PDF file on disk.

http://www.pdf995.com

Jacks Complete
April 5th, 2005, 08:10 PM
There is also a suite of PDF tools from a company called PDF995, which are quite powerful, free (advertware, or you can pay to unlock for faster operation and no adverts) but some of the features aren't insanely user friendly.

Lets you do things like break a PDF into individual pages, add a stamp, export pages as graphics, rip out the text, and lots more. Also has a PDF fake printer so you can print anything to a windows printer that is really a PDF file on disk.

http://www.pdf995.com

Jacks Complete
April 5th, 2005, 08:10 PM
There is also a suite of PDF tools from a company called PDF995, which are quite powerful, free (advertware, or you can pay to unlock for faster operation and no adverts) but some of the features aren't insanely user friendly.

Lets you do things like break a PDF into individual pages, add a stamp, export pages as graphics, rip out the text, and lots more. Also has a PDF fake printer so you can print anything to a windows printer that is really a PDF file on disk.

http://www.pdf995.com

Silentnite
April 5th, 2005, 08:49 PM
http://mscracks.com/cracks/P5.php Almost all the way down the page, there are some Keygens for PDF995. That should help with the adware and such. Just make sure to run a virus check!

Silentnite
April 5th, 2005, 08:49 PM
http://mscracks.com/cracks/P5.php Almost all the way down the page, there are some Keygens for PDF995. That should help with the adware and such. Just make sure to run a virus check!

Silentnite
April 5th, 2005, 08:49 PM
http://mscracks.com/cracks/P5.php Almost all the way down the page, there are some Keygens for PDF995. That should help with the adware and such. Just make sure to run a virus check!