Granny's Programming Pearls
"inside of every large program is a small program struggling to get out"
JavaRanch.com/granny.jsp
The moose likes Java in General and the fly likes Java as Filtering Web Content Big Moose Saloon
  Search | Java FAQ | Recent Topics | Flagged Topics | Hot Topics | Zero Replies
Register / Login
JavaRanch » Java Forums » Java » Java in General
Bookmark "Java as Filtering Web Content" Watch "Java as Filtering Web Content" New topic
Author

Java as Filtering Web Content

J. Insi
Ranch Hand

Joined: Aug 11, 2010
Posts: 90

greetings everyone!

does anyone have ever do filtering the web content using java app?
I thought that this could be used, when we apply some socketserver into our localhost itself.
And with another socket to connect to the real web domain (address).

(take note: that our machine is connected to the internet line).

And...
The problem arose when the bytes received from the real web domain (address) translated from
inputstream... which makes the loading ... output appearance at our browser seems slow.

Does anyone may give me a bit clue to overcome this problem?

* NB: Sorry, i didnt mean to duplicate the thread (from Socket section).
Carey Brown
Ranch Hand

Joined: Nov 19, 2001
Posts: 174

You might try Googling "proxy" for some ideas and source code.

The HTTP protocol has become complicated over the years in favor of higher throughput. I suspect that you're doing one of the following:
  • not making use of stay-open connections (i.e. re-connecting each time)

  • not handling "chunking" properly

  • waiting for all the bytes to arrive before outputting any filtered data to the client

  • you have a very inefficient filtering algorithm

  • you are not using threads for parallel processing of multiple streams.


J. Insi
Ranch Hand

Joined: Aug 11, 2010
Posts: 90
well, thanks for the respond, Carey Brown.
I did searching through Google. And I already did, get my app running.

when you said,
not making use of stay-open connections (i.e. re-connecting each time)
you are not using threads for parallel processing of multiple streams.

I'm using a Socket instances within a Thread class.

and,
you have a very inefficient filtering algorithm
waiting for all the bytes to arrive before outputting any filtered data to the client


That's rite, my algorithm is inefficient. Probably yes. That's why I need some evaluation.
but is it okay If I posted up the
source code I written here? (it's just 2 class(file) only).
J. Insi
Ranch Hand

Joined: Aug 11, 2010
Posts: 90
hallo? is there anybody here....?
Lester Burnham
Rancher

Joined: Oct 14, 2008
Posts: 1337
If this was my problem, I'd start with an existing proxy solution -such as Muffin, but possibly a bit less dated- and would try to adapt it to my purposes.
J. Insi
Ranch Hand

Joined: Aug 11, 2010
Posts: 90
Lester Burnham wrote:If this was my problem, I'd start with an existing proxy solution -such as Muffin, but possibly a bit less dated- and would try to adapt it to my purposes.


ya, i guess so.
i think i should resolved this thread.

thanks guys!
 
I agree. Here's the link: http://aspose.com/file-tools
 
subject: Java as Filtering Web Content