aspose file tools*
The moose likes Java in General and the fly likes How To Clean Up All The Formats Of The Web Pages On The Internet? Big Moose Saloon
  Search | Java FAQ | Recent Topics | Flagged Topics | Hot Topics | Zero Replies
Register / Login


Win a copy of EJB 3 in Action this week in the EJB and other Java EE Technologies forum!
JavaRanch » Java Forums » Java » Java in General
Bookmark "How To Clean Up All The Formats Of The Web Pages On The Internet?" Watch "How To Clean Up All The Formats Of The Web Pages On The Internet?" New topic
Author

How To Clean Up All The Formats Of The Web Pages On The Internet?

JiaPei Jen
Ranch Hand

Joined: Nov 19, 2000
Posts: 1309
The web pages that we see over the internet is often formatted; i.e. in tables, with fonts, etc.
Is there a way to clean up all the formats and print those pages in plain text? Take the interfaces and classes of Java 1.4 API for example, I am trying to read them and print
method, parameter, return type, etc.
in plain text using Java. Where may I find explanatin on how to do it?
[ January 02, 2003: Message edited by: JiaPei Jen ]
Arun Boraiah
Ranch Hand

Joined: Nov 28, 2001
Posts: 233
Simple way is copy past the content to plan text editor like notepad. And print it.
-arun


Sharing is learning
JiaPei Jen
Ranch Hand

Joined: Nov 19, 2000
Posts: 1309
I have a lot of this kind of cleanup work to do. I cannot afford to "copy and paste" by hand. This is the reason I would like to write a Java program to do it. Please if anybody could give me the guidance.
Jim Yingst
Wanderer
Sheriff

Joined: Jan 30, 2000
Posts: 18671
Try looking at JTidy.


"I'm not back." - Bill Harding, Twister
 
I agree. Here's the link: http://aspose.com/file-tools
 
subject: How To Clean Up All The Formats Of The Web Pages On The Internet?
 
Similar Threads
to view multiple pages of a file or document
so what's next for the learning curve?
man to cat or something
Containers/Deployment
IE and NN Compability