i want to scan a web page to identify frequently occurring strings and how can i form a graph from the html,xml or text file with labels on the graph nodes.
Individual words would be pretty easy. Make a map where key is a word and value is a counter.
Phrases would be a tougher. I'm not sure how to define a "meaningful phrase" that's worth tracking vs random combinations of words.
Does the map thing sound like something you could use?
A good question is never answered. It is not a bolt to be tightened into place but a seed to be planted and to bear more seed toward the hope of greening the landscape of the idea. John Ciardi
Which part are you struggling with - downloading the web page (or opening a file), analysing the text and creating a map of words, creating the graph, something else, ... ? What do you have so far?
My favorite library for this task would be JChart2D, or cewolf/JFreeChart if the chart is supposed to be displayed in a web page. Links to all these can be found in the http://faq.javaranch.com/java/OtherOpenSourceProjectsFaq
I brought this back from the farm where they grow the tiny ads:
a bit of art, as a gift, that will fit in a stocking