Welcome

I am a Bay Area Software and Systems Architect, the founder of Concurrent, Inc., and the author of Cascading.

Cascading is an application library for rapidly developing highly complex data processing workflows.

Please feel free to contact me regarding any projects needing experienced help in data and text mining, content management, business process management, and distributed application integration.

Specifically I can provide expertise in deploying clustered data processing applications using Apache Hadoop and running on Amazon EC2.

Some past portfolio work:
Text mining over large data sets

Above, dusk in Prague.