File APIs for Java Developers
Manipulate DOC, XLS, PPT, PDF and many others from your application.
http://aspose.com/file-tools
The moose likes Java in General and the fly likes Parsing Webpage and Links Big Moose Saloon
  Search | Java FAQ | Recent Topics | Flagged Topics | Hot Topics | Zero Replies
Register / Login


Win a copy of EJB 3 in Action this week in the EJB and other Java EE Technologies forum!
JavaRanch » Java Forums » Java » Java in General
Bookmark "Parsing Webpage and Links" Watch "Parsing Webpage and Links" New topic
Author

Parsing Webpage and Links

maverickml venkatesh
Greenhorn

Joined: Aug 17, 2011
Posts: 8
Hi,

Sorry if I had posted this in the wrong forum.

I have a webpage with links in a table. All i need to do is to navigate to each of these links and gather information from those pages.

As of now i can think of writing web crawler in java (again this has to be supported by the site)

Any other solutions like collecting the webpage response and HTML parsing the same.

Please suggest
Tim Moores
Rancher

Joined: Sep 21, 2011
Posts: 2408
The HttpUnit library (or jWebUnit, which builds on top of it) is perfect for that.
 
I agree. Here's the link: http://aspose.com/file-tools
 
subject: Parsing Webpage and Links
 
Similar Threads
Save File As...
Is your company/clients moving towards Web 2.0?
scanning the webpage
using string value as a variable name
insert multi language