mini-parsers

Mini-Parsers is a Java API for parsing short discrete text strings into native types where a single type may have multiple textual representations.

This is useful for parsing command line argument values piped or pasted from different sources, data entry validation, or normalizing data in a column during data cleansing and ETL.

For example, the same instant in time (java.time.Instant) may have multiple formats. The two strings 1423526400000 and 2015-02-10T02:04:30+00:00 are equivalent if the first is interpreted as the milliseconds since the epoch.

Out of the box, Mini-Parsers will normalize dozens of time formats.

This project relies on the Parboiled PEG library.

Chris K Wensel
Chris K Wensel
Data and Analytics Architect