Our Mission

What would data analytics infrastructure (namely Hadoop) look like if we rebuilt it from scratch today? We think it would be containerized, modular, and easy enough for a single person to use while still being scalable enough for a whole company. Tools like Docker and Kubernetes provide the perfect building blocks for us revolutionize data infrastructure!

Pachyderm is “Git for Data Science.” We offer complete version control for data and give your data science team the same first-class development tools as software developers. Pachyderm is ideal for building machine learning pipelines and ETL workflows because we track every model/output directly to the raw input datasets that created it (aka: Provenance).

Since everything in Pachyderm is a container, data scientists can use any languages or libraries they want (e.g. Spark, R, Python, OpenCV, etc) without any additional infrastructure overhead.

Meet our Team

Pachyderm is a team of passionate individuals who love all things data, open source, and ML/AI. Oh, and we also love infrastructure tools and building developer communities! Don't be shy, feel free to email us with any questions: info@pachyderm.com

The Founders

wrapkit
Joe Doliner
Co-Founder
wrapkit
Joey Zwicker
Co-Founder

Rest of the Pach

wrapkit
Matt Steffen
Platform Enginer
wrapkit
Gabriel Grant
Frontend Engineer
wrapkit
Bryce McAnally
Platform Engineer
wrapkit
Nick Harvey
Head of Marketing & Developer Advocate
wrapkit
Thomas Hall
Sales & Customer Success

Our Biggest Fans

wrapkit
Leni
Moral Officer
wrapkit
Hamilton
Purrrveyor of goods

Join our team and help build a new world order!

Join Us !