wood burning stoves 2.0*
The moose likes Java in General and the fly likes How to learn Text mining in Java? Big Moose Saloon
  Search | Java FAQ | Recent Topics | Flagged Topics | Hot Topics | Zero Replies
Register / Login


Win a copy of OCA/OCP Java SE 7 Programmer I & II Study Guide this week in the OCPJP forum!
JavaRanch » Java Forums » Java » Java in General
Bookmark "How to learn Text mining in Java?" Watch "How to learn Text mining in Java?" New topic
Author

How to learn Text mining in Java?

Ashish Dutt
Ranch Hand

Joined: Apr 12, 2013
Posts: 172

Dear all,
I want to learn text mining in Java.
Please suggest me any good book or tutorials at the beginner level.
I have already gone through books like Data Mining: Practical Machine Learning Tools and Techniques, Third Edition (The Morgan Kaufmann Series in Data Management Systems) and Mining the Social Web: Analyzing Data from Facebook, Twitter, LinkedIn, and Other Social Media Sites by Matthew A. Russell but honestly they did not make much sense to me. i would like a book or a resource that has instructions from ground zero and implements Java.
Also which open source framework would be good to implement here.
I am using netbeans 7.3 IDE therefore if you are suggesting me any open source then please tell me how do i integrate it with netbeans too

Still a beginner at crafting quality coding but have the zeal to learn more. Your help would be appreciated.
My Blog on Learning Analytics
Campbell Ritchie
Sheriff

Joined: Oct 13, 2005
Posts: 39478
    
  28
Don't know. But I do know that question is too difficult for “ beginning”, so I shall move it.
Ashish Dutt
Ranch Hand

Joined: Apr 12, 2013
Posts: 172

well if you thinks so then please do move it to the appropriate forum, but do let me know where your moving to
thanks
William Brogden
Author and all-around good cowpoke
Rancher

Joined: Mar 22, 2000
Posts: 12809
    
    5
Well, you first steps will involve learning how to parse text into units that make semantic sense.

In addition to the obvious need to parse individual words and track their usage - such as frequency counts, you will want to be able to recognize sentences and paragraphs and extract information like association of key words in a sentence.

How far have you gotten in terms of the pure mechanics of parsing?

Bill
Ashish Dutt
Ranch Hand

Joined: Apr 12, 2013
Posts: 172

William Brogden wrote:Well, you first steps will involve learning how to parse text into units that make semantic sense.

How far have you gotten in terms of the pure mechanics of parsing?

Bill

.
Thanks William for that info but i already know that. I was earlier using python for text mining purpose. The ntlk toolkit provided by Steven Bird is very useful. So i developed a small prototype program for the first step of parsing text in python.
It was not great it would merely read a excel file and clean the text and print it .
I really liked python a lot because of its robustness but i had to switch over to Java for the better.
So my understanding of this first step that involves parsing of text is rudimentary at the moment.
I have used Collections typically ArrayList and Hashmap to clean data from a text file but that cleaning is not good enough.
Please refer to my blog where i have posted the source code of this activity.
I would like to discuss more about this with you for your reply was well informed.
 
I agree. Here's the link: http://aspose.com/file-tools
 
subject: How to learn Text mining in Java?