File APIs for Java Developers
Manipulate DOC, XLS, PPT, PDF and many others from your application.
The moose likes Java in General and the fly likes java exercise help Big Moose Saloon
  Search | Java FAQ | Recent Topics | Flagged Topics | Hot Topics | Zero Replies
Register / Login

Win a copy of Soft Skills this week in the Jobs Discussion forum!
JavaRanch » Java Forums » Java » Java in General
Bookmark "java exercise help" Watch "java exercise help" New topic

java exercise help

henry dias

Joined: Mar 22, 2011
Posts: 2
Find the 10 most common and 10 least common words in the textfile.
Find the 10 most common and 10 least common bigrams in the file. (A bigram is two words following each other in the text. (F.ex. "The red fox" contains 2 bigrams: "The red" and "red fox". - See )
Find the longest phrase that also appears at least twice.
("longest phrase" here means the number of words, NOT letters)

Henry Wong

Joined: Sep 28, 2004
Posts: 19061

Please tell us what you have done so far. Please tell us *exactly* what issue you are running into. We can't help you if we don't know where you are stuck.


Books: Java Threads, 3rd Edition, Jini in a Nutshell, and Java Gems (contributor)
henry dias

Joined: Mar 22, 2011
Posts: 2
The issue i am facing is for correct regexp

i am getting words as citizen. citzen, and citizen; i want to remove the , . ; after words and then check for thier repetition in file .

Rob Spoor

Joined: Oct 27, 2005
Posts: 19783

Please UseCodeTags next time. I've added them for you this time.

How To Ask Questions How To Answer Questions
Campbell Ritchie

Joined: Oct 13, 2005
Posts: 40051
Welcome to the Ranch

I do not think a regular expression will help you at all. Go through the Java™ Tutorials Collections section and you will find a counting application example.
I agree. Here's the link:
subject: java exercise help