File APIs for Java Developers
Manipulate DOC, XLS, PPT, PDF and many others from your application.
http://aspose.com/file-tools
The moose likes Java in General and the fly likes About similarity algorithm Big Moose Saloon
  Search | Java FAQ | Recent Topics
Register / Login


JavaRanch » Java Forums » Java » Java in General
Reply Bookmark "About similarity algorithm" Watch "About similarity algorithm" New topic
Author

About similarity algorithm

Mukunda s Dhakate
Greenhorn

Joined: Oct 29, 2007
Posts: 12
Hello friends,
I am trying to check similarity of two documents or two sentances.
I am using TF-IDF and cosine similarity check for this.
But in some cases it does not work.
for example
"Train is delayed"
and
"Train is running late"
these two sentances are similar but we only can recognize this,
using cosine similarity algorithm we cannot get the required result.
Is there any other similarity algorithm which will also work for such cases,
or could any one help me on this case.
Please help me on this.

Thank you
Regards
Mukunda
 
I agree. Here's the link: http://aspose.com/file-tools
 
subject: About similarity algorithm
 
Similar Threads
Need Thread Handling Algorithm
Compare two text
How does google know when you've made a typo? (serious question)
vi
Punction Correct (Sentances)