Win a copy of Mesos in Action this week in the Cloud/Virtualizaton forum!
  • Post Reply
  • Bookmark Topic Watch Topic
  • New Topic

Does anyone have experience with htmlparser that can help me out?

 
Jacky Luk
Ranch Hand
Posts: 634
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator

java.io.IOException: Invalid Http response

The link here is dynamic, with tokens, how do I parse some web site using htmlparser?

http://htmlparser.sourceforge.net/

Thanks in advance
Jack
 
Ulf Dittmer
Rancher
Posts: 42967
73
  • Likes 1
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
That library hasn't been maintained for many years. If this was my problem I'd switch to a current library like HtmlUnit.
 
Jacky Luk
Ranch Hand
Posts: 634
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Ulf Dittmer wrote:That library hasn't been maintained for many years. If this was my problem I'd switch to a current library like HtmlUnit.


Thanks Ulf, I'll take a look at that one.
Jack
 
Jacky Luk
Ranch Hand
Posts: 634
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
I am just wondering how to capture Thumbelina (an image linked by this little thumbnail) of google with htmlunit.



Just wondering how to get the url of the tr, I can only retrieve the URI, not URL of the element.

Any example?
Thanks
Jack
 
Jeanne Boyarsky
author & internet detective
Marshal
Posts: 34422
347
Eclipse IDE Java VI Editor
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Table rows don't have URLs. They have table cells (or headers.) Can you show a code snippet of what you are trying to parse?
 
  • Post Reply
  • Bookmark Topic Watch Topic
  • New Topic