Hi guys ..
i want to build a dsah board after fetching data from a website.
means seeing a website page content i want to develop an .xml file which stores tha data of that web page and that .xml file is to be used to generate a dashboard to generate a report.
say a web page is displaying various information on movies.....say it's revenue, it's production cost and all
Now i want to store the top 10 movies from that page (on any selective criteria say ..revenue) in an .xml file. that xml file willl begenerated at my machine...and will be send to create a dashboard report.
So my question is is it possible....i mean how can i read a web page content and store it in a .xml file..and if it is posssible what is the way to do this....and if it is possible how can we create that .xml file by reading the web page..
please help me with suitable way..
There's no natural mapping from an HTML page to XML, so you'll need to code that yourself. I'd approach this using a library like HtmlUnit that makes it easy to access a web site programmatically. It cleans the HTML so it becomes well-formed XML, and then presents a DOM and XPath interface that you can use to extract whichever parts of the page you're interested in.
Joined: Mar 26, 2009
thanks for your response.
well you said that we can use library like HtmlUnit...so is this library already present or we need to create it.
or is there any tool to read the contents of a web page .