About Our Organization :
- Dow Jones is a global provider of news and business information, delivering content to consumers and organizations worldwide across multiple formats, including print, digital, mobile, and live events. With over 130 years of experience, Dow Jones has built one of the world's largest news-gathering operations. It is home to leading publications such as the flagship Wall Street Journal, America's largest newspaper by paid circulation; Barron's, MarketWatch, Mansion Global, Financial News, Investor's Business Daily, Factiva, Dow Jones Risk & Compliance, Dow Jones Newswires, OPIS, and Chemical Market Analytics. Dow Jones is a division of News Corp (Nasdaq: NWS, NWSA; ASX: NWS, NWSLV).
About the Role
Dow Jones is seeking a skilled Senior Machine Learning Engineer to join our AI Engineering Team. You will be responsible for designing, building, and maintaining machine learning pipelines and infrastructure, supporting both conventional and GenAI models. You will collaborate with data scientists to seamlessly integrate various machine learning models, including large language models (LLMs).
As a key team member, you will play a crucial role in operationalizing machine learning solutions to meet organizational needs and deliver tangible value. You will leverage your strong software engineering skills to develop robust, secure, and scalable production systems, utilizing your expertise in machine learning algorithms and techniques.
Responsibilities include:
- Collaborate with data scientists and engineers to integrate ML models into various AI/ML pipelines, covering pre-processing, fine-tuning, and deployment.
- Implement optimized data storage and indexing systems for NLP, utilizing advanced database technologies.
- Develop tools and frameworks for model training, tuning, and evaluation, ensuring seamless infrastructure integration.
- Partner with data scientists and engineers to integrate models into production, leveraging cloud infrastructure as needed.
- Monitor and improve model performance for accuracy and efficiency.
- Provide ongoing support, troubleshoot issues, and implement updates for ML models.
- Build and maintain data processing pipelines for high volumes of structured and unstructured data.
- Develop and maintain documentation for all ML infrastructure and support processes.
- Stay updated on GenAI, NLP, ML, and IR technologies, incorporating best practices and leveraging cloud infrastructure for efficiency.
Qualifications include:
- Bachelor's degree in Computer Science, Engineering, Data Science, or a related STEM field.
- At least 3 years of industrial experience in a machine learning engineering, data science, or data engineering role.
- Experience with cloud-based infrastructure and services (we use GCP, but experience with other vendors is also valuable).
- Strong programming skills in Python or other high-level languages used in machine learning.
- Experience with NLP and ML frameworks/libraries such as PyTorch, HuggingFace, LangChain, spaCy, NLTK, scikit-learn, etc. (a plus).
- Experience developing and managing NLP-focused data infrastructure, including storage, indexing, and retrieval systems.
- Proficiency with vector storage, indexing, and graph database technologies for scalable cloud operations.
- Experience with containerization and orchestration technologies like Docker and Kubernetes in cloud environments.
Our Benefits include:
- Comprehensive healthcare plans for you and your family (paid by Dow Jones).
- Extra paid time off (30 days of holidays per year).
- Possibility to work remotely for 3 months/year or one week per quarter.
- Meal benefit with Pluxee (€171/month).
- Retirement plans with company contributions.
- Comprehensive insurance plans.
- Well-being resources, including a $200 quarterly allowance.
- Family care benefits and caregiving support.
- Subscription discounts and employee referral programs.