aspose file tools*
The moose likes Beginning Java and the fly likes parsing quoted text Big Moose Saloon
  Search | Java FAQ | Recent Topics | Flagged Topics | Hot Topics | Zero Replies
Register / Login
JavaRanch » Java Forums » Java » Beginning Java
Bookmark "parsing quoted text" Watch "parsing quoted text" New topic
Author

parsing quoted text

ronald ali mangaliag
Greenhorn

Joined: Feb 05, 2004
Posts: 18
i wanted to parse a text that is similar to the one below:

firstname lastname age birthday

the first name may be enclosed in quotes as in "ronald ali" and that is true also for the lastname... the age should be numeric and birthday should be a valid bday with this format yyyy-MM-dd...

i used Stringtokenizer... but the problem is, the dashes (-) in birthday is treated as a character...

saving each token on a List makes the size of the same to 8 instead of only 4 (pertaining to the four fields)....

what do you think should i do? do you have a solution? even if it doesnt use stringtokenizer... i was thinking of using string.split() but am not well versed in regular expression... please advice....

thank you

ali
Paul Sturrock
Bartender

Joined: Apr 14, 2004
Posts: 10336

Well, StringTokenizer will work fine to tokenize that String into the four elements you want. You might have to show us your code and perhaps we can see what you are doing wrong.

There are other ways to do this though. You could (as you have noticed) use the split() method of the String class. Strictly you would need to use the "any whitespace" symbol in your regex ([\s]), but since your regex is so simple, just using a space in the split method will work. If you are still unsure of regex's, you could treat the String as a char [] and process each character at a time.


JavaRanch FAQ HowToAskQuestionsOnJavaRanch
Scheepers de Bruin
Ranch Hand

Joined: Jul 19, 2005
Posts: 99
Ok suppose you get a string that contains the following:
String input = "\"firstname\" lastname age birthday";
(the \" is how you 'escape' the double quote character, i.e. how you tell java to stick a double quote character in a string without interpreting it as a String delimiter)

You can use the StringTokenizer like this:
StringTokenizer st = new StringTokenizer(input, " ");
(Using space as the delimiter)

Or you can use the split method:
String[] params= input.split(" ");


We're doomed!!<br />Yay!!!<br />No that's bad Girr!!<br />Yay!!!
Stan James
(instanceof Sidekick)
Ranch Hand

Joined: Jan 29, 2003
Posts: 8791
Did any of those tips help with the embedded blank in \"Ronald Ali\" ?

I drag a lot of bad habits from my pre-Java days, but I'd probably get one token at a time from the string the hard way with a "cursor" or position in the string:

You can smarten this up with regular expressions ... maybe one that will match the first quoted string (allowing blanks inside) OR the next unquoted string up to a blank or end of input.
[ August 31, 2005: Message edited by: Stan James ]

A good question is never answered. It is not a bolt to be tightened into place but a seed to be planted and to bear more seed toward the hope of greening the landscape of the idea. John Ciardi
 
I agree. Here's the link: http://aspose.com/file-tools
 
subject: parsing quoted text
 
Similar Threads
Searching Strings in a textfile
Tokenizing after reading from a file
stingTokenizer
Beginning code - what am I missing?
Java code to append data into a existing xml file