aspose file tools*
The moose likes Beginning Java and the fly likes Problem with java reading a webpage Big Moose Saloon
  Search | Java FAQ | Recent Topics | Flagged Topics | Hot Topics | Zero Replies
Register / Login


Win a copy of The Java EE 7 Tutorial Volume 1 or Volume 2 this week in the Java EE forum
or jQuery UI in Action in the JavaScript forum!
JavaRanch » Java Forums » Java » Beginning Java
Bookmark "Problem with java reading a webpage" Watch "Problem with java reading a webpage" New topic
Author

Problem with java reading a webpage

Colin A Thompson
Greenhorn

Joined: Dec 08, 2009
Posts: 2
I am having a problem with copying a webpage to a text file. Every time I run my program for this one site the text that gets copied has Asian characters. There are no Asian characters on the page.

I tried my code on other websites and it works fine. Are there security measures that prevent a web page from being copied?

The website I am having problems with is public information and I am not reselling anything of theirs.
Paul Clapham
Bartender

Joined: Oct 14, 2005
Posts: 18541
    
    8

Possibly you are using the wrong charset to convert the downloaded data from bytes to chars. That's just my first guess, though, I'm sure there could be dozens of other things wrong. You don't provide many details for us to comment on.
Campbell Ritchie
Sheriff

Joined: Oct 13, 2005
Posts: 38412
    
  23
Also some text editors or terminal windows may be only able to display ASCII or Latin-1 characters.
 
I agree. Here's the link: http://aspose.com/file-tools
 
subject: Problem with java reading a webpage