Hi guys, I am developing a finance application where my application will need to grab rates (forex, etc) from various banks each day. These banks publish their rates at their website. I use Java application running generic InputBufferedStream of the IO library to grab the HTML data. Most websites work. Unfortunately, there is one, which is (www.pbebank.com) website, their server refuse to respond to my program. Despite adding User Agent equals Mozilla etc. to the header of my URLConnection, I still fail to get the required data. Would appreciate help if someone knows how to. I suspect it is still the server rejecting my application as the header sent still does not match up to their acceptable format. They are rejecting spiders/crawlers. My code is as below.
Error is, the page takes a long time run this function, and after a while, it returns:
java.net.SocketException: Software caused connection abort: recv failed
Subliner Kemp wrote:They are rejecting spiders/crawlers....
Your application is a crawler, and they are blocking crawlers. Have you read their terms of service to ensure that what you are doing is allowed? Either way, if they block you then there isn't much you can do about it.
I’ve looked at a lot of different solutions, and in my humble opinion Aspose is the way to go. Here’s the link: http://aspose.com