File APIs for Java Developers
Manipulate DOC, XLS, PPT, PDF and many others from your application.
http://aspose.com/file-tools
  • Post Reply
  • Bookmark Topic Watch Topic
  • New Topic

Does anyone have experience with htmlparser that can help me out?

 
Jacky Luk
Ranch Hand
Posts: 634
  • 0
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator

java.io.IOException: Invalid Http response

The link here is dynamic, with tokens, how do I parse some web site using htmlparser?

http://htmlparser.sourceforge.net/

Thanks in advance
Jack
 
Ulf Dittmer
Rancher
Pie
Posts: 42966
73
  • 1
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
That library hasn't been maintained for many years. If this was my problem I'd switch to a current library like HtmlUnit.
 
Jacky Luk
Ranch Hand
Posts: 634
  • 0
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Ulf Dittmer wrote:That library hasn't been maintained for many years. If this was my problem I'd switch to a current library like HtmlUnit.


Thanks Ulf, I'll take a look at that one.
Jack
 
Jacky Luk
Ranch Hand
Posts: 634
  • 0
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
I am just wondering how to capture Thumbelina (an image linked by this little thumbnail) of google with htmlunit.



Just wondering how to get the url of the tr, I can only retrieve the URI, not URL of the element.

Any example?
Thanks
Jack
 
Jeanne Boyarsky
author & internet detective
Marshal
Posts: 33700
316
Eclipse IDE Java VI Editor
  • 0
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Table rows don't have URLs. They have table cells (or headers.) Can you show a code snippet of what you are trying to parse?
 
I agree. Here's the link: http://aspose.com/file-tools
  • Post Reply
  • Bookmark Topic Watch Topic
  • New Topic