Seeking a mid-level Spark Java Developer with expertise in big data processing, Python, and Apache Spark, particularly within the finance domain. Candidate should have strong experience working with financial instruments, market risk, and large-scale distributed computing systems.
This role involves developing and optimizing data pipelines for risk calculations, trade analytics, and regulatory reporting.
Key Responsibilities
- Develop and optimize scalable Spark Java-based data pipelines for processing and analyzing large-scale financial data
- Design and implement distributed computing solutions for risk modeling, pricing, and regulatory compliance
- Ensure efficient data storage and retrieval using Big Data
- Implement best practices for Spark performance tuning including partitioning, caching, and memory management
- Maintain high code quality through testing, CI/CD pipelines, and version control (Git, Jenkins)
- Work on batch processing frameworks for Market risk analytics
Qualifications and Skills
- 5+ years of experience in software development with at least 5 years of experience in Spark Java and Big Data frameworks
- Strong proficiency in Python and Spark Java with knowledge of core Spark concepts (RDDs, Dataframes, Spark Streaming, etc.)
- Experience working in financial markets, risk management, and financial instruments
- Familiarity with market risk concepts including VaR, Greeks, scenario analysis, and stress testing
- Hands-on experience with Hadoop and Spark
- Proficiency in Git, Jenkins, and CI/CD pipelines
- Excellent problem-solving skills and strong mathematical and analytical mindset
- Ability to work in a fast-paced financial environment
Job Family Group: Technology
Job Family: Applications Development
Time Type: Full time
Citi is an equal opportunity and affirmative action employer. Qualified applicants will receive consideration without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or status as a protected veteran.