File APIs for Java Developers
Manipulate DOC, XLS, PPT, PDF and many others from your application.
http://aspose.com/file-tools
The moose likes XML and Related Technologies and the fly likes Split Big XML into smaller ones by size Big Moose Saloon
  Search | Java FAQ | Recent Topics | Flagged Topics | Hot Topics | Zero Replies
Register / Login
JavaRanch » Java Forums » Engineering » XML and Related Technologies
Bookmark "Split Big XML into smaller ones by size" Watch "Split Big XML into smaller ones by size" New topic
Author

Split Big XML into smaller ones by size

GaneshSg Sundaresan
Greenhorn

Joined: Oct 17, 2012
Posts: 2
Hello Everyone,

Need your valuable suggestions on splitting big XML file dynamically in Java into smaller ones by size.

Before starting parsing of the xml we need to check the file size and should split the file so that each file is not more then 10 MB.

Also the XML should be well formed.

If you are aware of any better framework / better way of doing this please let me know.

Thanks,
Ganesh
Paul Clapham
Bartender

Joined: Oct 14, 2005
Posts: 18989
    
    8

It's easy enough to split a file into smaller pieces, but splitting an XML file into smaller pieces which are themselves well-formed XML files? Can't be done.

But perhaps your XML documents have some specific features which allow you to split them into a set of sub-documents which, with little modification, could be made into well-formed XML files?

As for doing the split before you start parsing, I wouldn't recommend that. If your idea of a split is even possible, it should be done by software which understands the rules of XML, and that means a parser.
GaneshSg Sundaresan
Greenhorn

Joined: Oct 17, 2012
Posts: 2
Thanks Paul.

Are you aware of any framework that does that?

Thanks,
Ganesh
Harikumar Maryadha
Greenhorn

Joined: Nov 07, 2012
Posts: 1
Hi Ganesh,

i am also having same requirement . did you get information like how we can do it ?

Thanks,
Hari

 
I agree. Here's the link: http://aspose.com/file-tools
 
subject: Split Big XML into smaller ones by size