This week's book giveaway is in the Big Data forum.
We're giving away four copies of Elasticsearch in Action and have Radu Gheorghe & Matthew Lee Hinman on-line!
See this thread for details.
The moose likes Java in General and the fly likes java exercise help Big Moose Saloon
  Search | Java FAQ | Recent Topics | Flagged Topics | Hot Topics | Zero Replies
Register / Login

Win a copy of Elasticsearch in Action this week in the Big Data forum!
JavaRanch » Java Forums » Java » Java in General
Bookmark "java exercise help" Watch "java exercise help" New topic

java exercise help

henry dias

Joined: Mar 22, 2011
Posts: 2
Find the 10 most common and 10 least common words in the textfile.
Find the 10 most common and 10 least common bigrams in the file. (A bigram is two words following each other in the text. (F.ex. "The red fox" contains 2 bigrams: "The red" and "red fox". - See )
Find the longest phrase that also appears at least twice.
("longest phrase" here means the number of words, NOT letters)

Henry Wong

Joined: Sep 28, 2004
Posts: 19344

Please tell us what you have done so far. Please tell us *exactly* what issue you are running into. We can't help you if we don't know where you are stuck.


Books: Java Threads, 3rd Edition, Jini in a Nutshell, and Java Gems (contributor)
henry dias

Joined: Mar 22, 2011
Posts: 2
The issue i am facing is for correct regexp

i am getting words as citizen. citzen, and citizen; i want to remove the , . ; after words and then check for thier repetition in file .

Rob Spoor

Joined: Oct 27, 2005
Posts: 19911

Please UseCodeTags next time. I've added them for you this time.

How To Ask Questions How To Answer Questions
Campbell Ritchie

Joined: Oct 13, 2005
Posts: 41098
Welcome to the Ranch

I do not think a regular expression will help you at all. Go through the Java™ Tutorials Collections section and you will find a counting application example.
I agree. Here's the link:
subject: java exercise help