Full Stack Engineer
Fully Remote in Spain or Poland
We are working with a leading online scheduling platform designed to simplify the process of coordinating meetings and events. Founded over 18 years ago, it helps individuals and teams avoid the "back-and-forth" of email scheduling by allowing users to propose multiple time slots and let participants vote on their availability.
Responsibilities of the role:
- Architect Production AI Systems: Design reliable, production-ready AI systems, selecting optimal tools for robust real-world performance.
- Curate Data & Feature Stores: Prepare high-quality datasets and maintain feature stores to ensure data consistency for training and inference.
- Build Scalable ML Pipelines: Develop end-to-end data and ML pipelines using Airflow and dbt for seamless ingestion, deployment, and monitoring.
- Design & Deploy Models: Prototype and train diverse neural architectures, including LLMs, with a focus on reproducibility and performance.
- Implement Advanced Retrieval (RAG): Design Graph RAG and hybrid retrieval systems, including graph construction and entity linking.
- Enable Edge Intelligence: Optimize and quantize large models for efficient on-device and edge processing.
Requirements of the role:
- Experience delivering complete AI components—from planning and modeling to deployment, monitoring, and iteration.
- Strong Python skills and deep familiarity with ML frameworks such as Scikit-Learn, TensorFlow, PyTorch, and Hugging Face. You’re comfortable designing, evaluating, and prototyping diverse model types.
- Hands-on experience with MLOps tools (e.g., MLflow, ZenML), dbt modeling, and working with cloud data warehouses or data lakes.
- Experience building and scheduling pipelines in Airflow. Familiarity with modern data stacks such as Kafka, Spark, and cloud warehouses (BigQuery, Redshift, Snowflake). Ability to define event-level tracking schemas for reliable analytics.
- Strong understanding of model behavior and evaluation. Experience developing frameworks for assessing model quality, reliability, hallucination detection, prompt regression, safety scoring, or multi-hop reasoning. Familiarity with RAG, graph-based retrieval, and prompt design.
- A focus on shipping systems that are robust, explainable, and usable by others.