Job Description
Overview of the Company
Easalytics, a subsidiary of CTComp, is a growing B2B SaaS company providing advanced analytics and data science capabilities to the fitness industry. Our expertise includes software development, data integration, data warehousing, machine learning, AI, data visualization, and statistical analysis. We are looking for someone with a software development background and recent experience in data engineering who is passionate about data and analytics. Join our small team of technologists and be an influential ground-floor member disrupting the industry!
Personal Qualities You Will Need
- Excellent documentation and communication skills
- Ability to thrive independently and in small cross-functional teams in a rapidly evolving environment
- Self-starter with attention to detail, results-oriented, and able to work with minimal guidance
- Personal initiative for continual improvement
- Desire and ability to gain business domain knowledge
Required Skills/Technology
- 2+ years of data engineering experience, preferably supporting analytics/ML/LLM applications
- 1+ years deploying/supporting LLM agents
- Proficiency in Python
- Experience with ChatGPT, Claude, or LLama models
- Designing and maintaining vector databases, preferably using Pinecone or similar tools
- Creating and maintaining database structures for machine learning and LLM models
- LLM prompt engineering and prompt tuning
- Experience with LLM toolkits like Langchain or AutoGen, including structured outputs, functions, tool calling, and Agents
- Manipulating and querying Parquet and JSON files
- Maintaining chat history using external databases, preferably DynamoDB
- Designing data models and developing database structures in Microsoft SQL Server
- Experience with Data Warehouse design and development in SQL Server
- Developing ETL solutions, complex SQL queries, stored procedures, functions, views, and triggers
- Creating automated data migration tasks such as importing and exporting data
- Strong analytical, quantitative, problem-solving, and critical thinking skills
- Knowledge of data quality best practices, validation, and troubleshooting
- Experience with source control tools like Azure DevOps and GitHub
- Excellent written and verbal communication skills, capable of presenting detailed analyses to diverse audiences
Experience With These is a Major Plus
- LangGraph and similar tools for designing LLM Graphs
- Business Intelligence applications, preferably Tableau
- Creating test cases for monitoring LLM models
- PostgreSQL, MongoDB
- Shell, Bash, and other command line languages
- Tuning training data and metadata for LLMs using open-source or commercial toolkits
- CI/CD pipelines with Github Actions or similar
- AWS services like Glue, Athena, EMR, EKS, DynamoDB
- Lambda functions
- Agile and Scrum methodologies
Company Description
Easalytics has been providing cutting-edge analytics and data science solutions to the fitness industry for over 5 years. Our team is 100% remote and values communication, teamwork, positivity, innovation, personal initiative, and continuous improvement.