I want to read the HTML content of an external website page (eg: www.yahoo.com) Could you please give me pointers as where to start from.
I want my servlet/container to act as a client, requesting the external website page, and be able to receive file as HTML/Text
I intially toyed with idea of forwarding the request to external site URL, and using Filter to intercept the response and parse the same. But problem with this case is, we cannot forward the request to page external to JVM (i am not sure about this).
Stream s = new URL(abc).openConnection() or something like that. Check the API. If you need to 'talk HTTP' then you will need to look at something like Apache's HttpClient. If you want to be responsible, you should look at the behaviour expected of HTTP proxies and act accordingly.