File APIs for Java Developers
Manipulate DOC, XLS, PPT, PDF and many others from your application.
http://aspose.com/file-tools
The moose likes Android and the fly likes Information Extraction Big Moose Saloon
  Search | Java FAQ | Recent Topics
Register / Login
JavaRanch » Java Forums » Mobile » Android
Reply Bookmark "Information Extraction" Watch "Information Extraction" New topic
Author

Information Extraction

deepika deepi
Ranch Hand

Joined: Jan 23, 2012
Posts: 105
hi experts,

i need some java coding for Information Extraction in Java for tourism or any personality name it should give me the details regarding it on the output screen
if i have an paragraph of documents like this

October 14, 2002, 4:00 a.m. PT
For years, Microsoft Corporation CEO Bill Gates
railed against the economic philosophy of opensource
software with Orwellian fervor, denouncing
its communal licensing as a "cancer" that stifled
technological innovation.
Today, Microsoft claims to "love" the open-source
concept, by which software code is made public to
encourage improvement and development by
outside programmers. Gates himself says Microsoft
will gladly disclose its crown jewels--the coveted
code behind the Windows operating system--to
select customers.
"We can be open source. We love the concept of
shared source," said Bill Veghte, a Microsoft VP.
"That's a super-important shift for us in terms of
code access.“
Richard Stallman, founder of the Free Software
Foundation, countered saying



AND IT SHOULD EXTRACT THE OUTPUT.Eg:

October 14, 2002, 4:00 a.m. PT
For years, Microsoft Corporation CEO Bill Gates
railed against the economic philosophy of opensource
software with Orwellian fervor, denouncing
its communal licensing as a "cancer" that stifled
technological innovation.
Today, Microsoft claims to "love" the open-source
concept, by which software code is made public to
encourage improvement and development by
outside programmers. Gates himself says Microsoft
will gladly disclose its crown jewels--the coveted
code behind the Windows operating system--to
select customers.
"We can be open source. We love the concept of
shared source," said Bill Veghte, a Microsoft VP.
"That's a super-important shift for us in terms of
code access.“
Richard Stallman, founder of the Free Software
Foundation, countered saying

then the output will be high lighting the output window on the screen

NAME TITLE ORGANIZATION
Bill Gates CEO Microsoft
Bill Veghte VP Microsoft
Richard Stallman founder Free Software


can any one help me out to get this output ?



Hardik Trivedi
Ranch Hand

Joined: Jan 30, 2010
Posts: 252
Well you must have to maintain database for the keywords. Read the file sequentially, then you need to check for every keywords for every line.

It is bit of tough though. to form the information which you want to build. But as of now you can start using simple string manipulation and file reading.
 
 
subject: Information Extraction
 
Threads others viewed
So Microsoft tries to recruit Eric Raymond ...
Do you know how these famous names were formed?
Commercial? Open Source? Which one is Big?
[ ANN ] MerchantSpace Commerce 3.0, enterprise e-Commerce Software with a SOA
O'Reilly Conference
MyEclipse, The Clear Choice

cast iron skillet 49er

more from paul wheaton's glorious empire of web junk: cast iron skillet diatomaceous earth rocket mass heater sepp holzer raised garden beds raising chickens lawn care CFL flea control missoula heat permaculture