aspose file tools*
The moose likes Other Open Source Projects and the fly likes Issue with Web Harvest removing spaces after closing tags Big Moose Saloon
  Search | Java FAQ | Recent Topics | Flagged Topics | Hot Topics | Zero Replies
Register / Login


Win a copy of Java 8 in Action this week in the Java 8 forum!
JavaRanch » Java Forums » Products » Other Open Source Projects
Bookmark "Issue with Web Harvest removing spaces after closing tags" Watch "Issue with Web Harvest removing spaces after closing tags" New topic
Author

Issue with Web Harvest removing spaces after closing tags

Ajay Dhar
Ranch Hand

Joined: Jan 26, 2011
Posts: 30
How do I prevent Web Harvest from removing the space after closing tags when I convert html to xml? My configuration file is shown below:



I'm using Web Harvest to extract the paragraphs (<p></p>) from an HTML page. But there's an issue. Web Harvest is removing the space after the closing tags like </b> and </a>. When I remove the HTML tags using JSoup from the results of Web Harvest there is no space between the text of a link and the following word. The same happens for text that was in bold.


Help is greatly appreciated.


OCPJP 6, OCEEJBD 6, GIAC Secure Software Programmer-Java (GSSP-Java)
 
I agree. Here's the link: http://aspose.com/file-tools
 
subject: Issue with Web Harvest removing spaces after closing tags
 
Similar Threads
JSTL unicode xml does not display after x:parse call
Struts Validation Framework Problem...........Do Help
xsl tag problem?
how to see the result of XSL-FO Document
XHTML documents vs. XHTML syntax in HTML documents