Data Engineer
Calyx Containers | Full-Time
About Calyx Containers
Calyx Containers is a B2B packaging company serving the cannabis (mainly for now), food, and consumer goods industries. We're building technology that transforms how businesses order, track, and manage custom packaging—from instant quotes to production visibility. Our platform integrates sales, manufacturing, and fulfillment into a unified customer experience called Calyx Command. Calyx Command puts our customers in the driver's seat of their packaging through a space-themed consumer-centric experience. We are pioneering the future of the packaging tech stack while making sure our customers have fun doing it.
About the Role
We're looking for a hands-on Data Engineer to own and scale our data infrastructure. You'll build the pipelines and architecture that unify data across our CRM, ERP, and production systems—enabling real-time visibility into customer behavior, order fulfillment, and manufacturing operations. This is a foundational role with significant ownership and growth potential. We’ve already built base infrastructure for you to expand upon.
What You'll Do
- Design and maintain ETL/ELT pipelines connecting CRM, ERP, MES, Sales Intelligence, and internal application databases
- Own our Data Lake architecture using a Bronze/Silver/Gold medallion pattern for raw ingestion, cleansed data, and business-ready datasets
- Build and operate our Golden Record System for customer identity resolution across external platforms
- Implement data quality gates, freshness SLAs, anomaly detection, and automated alerting
- Create datasets that power AI features (damage detection, document parsing, demand forecasting etc)
- Collaborate with product and engineering to expose clean data via REST APIs
- Establish observability dashboards for pipeline health and data lineage
Our Tech Stack
- Database: PostgreSQL (Neon-backed), Drizzle ORM
- Backend: Node.js, Express, TypeScript
- AI/ML: LLM APIs for document parsing and analytics
- Infrastructure: Replit deployments, structured logging with Pino, distributed tracing
- Data Patterns: Medallion architecture (Bronze/Silver/Gold), CDC-style sync jobs, batch and real-time pipelines
What We're Looking For
- 4+ years of experience in data engineering or a related role
- Strong SQL skills and proficiency with TypeScript or Python
- Experience with relational databases (PostgreSQL preferred) and data modeling
- Familiarity with ERP systems and CRM platforms
- Understanding of identity resolution, master data management, and data quality principles
- Comfort working across the stack—you'll touch APIs, database schemas, and monitoring
- Self-starter who thrives with autonomy in a fast-moving environment
Nice to Have
- Experience in manufacturing, packaging, or cannabis industry compliance
- Exposure to production/MES systems like LabelTraxx
- Background with event-driven architectures or real-time streaming
- Familiarity with Drizzle ORM or similar TypeScript-first database tooling
Why Join Us
- Ownership: You'll define how we collect, store, and use data across the company
- Impact: Your work directly powers customer-facing features and operational decisions
- Growth: Opportunity to build and lead a data team as we scale
- Modern stack: TypeScript end-to-end, no legacy systems to maintain
- Profit Sharing: Ability to participate in the company's profit-sharing plan