Enable job alerts via email!

Data Architect

CG Infinity

United States

Remote

USD 120,000 - 180,000

Full time

13 days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

CG Infinity Inc. is seeking a Data Architect to lead the design and implementation of scalable data pipelines. The ideal candidate has 15+ years of experience in IT and strong expertise with Apache Spark, Databricks, and data quality tools. This remote role involves collaboration with cross-functional teams to deliver clean, reliable data for analytics and reporting.

Benefits

Flexible benefits package
401k plan with employer match
Comprehensive health coverage

Qualifications

  • 15+ years of IT experience required.
  • Proven skills in Databricks and PySpark.
  • Strong experience in big data processing and data quality.

Responsibilities

  • Design and optimize large-scale data pipelines using Spark and Databricks.
  • Collaborate with data teams to ensure data delivery and quality.
  • Implement data validation and quality checks.

Skills

Data Quality
Big Data Processing
Distributed Querying
Collaboration

Tools

Databricks
Apache Spark
PySpark
Deequ
Trino

Job description

Job Title: Data Architect, Remote

Get to Know Us:
CG Infinity, Inc. is a software consulting firm that was founded in 1998. We offer solutions that are tailored to the needs of each individual client that we work with instead of offering standard, run-of-the-mill solutions to everyone. We work closely with our clients throughout the entire process and offer solutions for a myriad of challenges.

Our Culture:
Our people-first approach to technology offers best-in-class service and success rates. Here are some of the main services that we offer at CG Infinity: Salesforce Implementations, Customer Experience & CRM, Application Development & Integration, Production Support & QA, and Data Analytics & AI.

Summary of Position:
We are seeking a Senior Data Engineer with hands-on experience in big data processing, data quality, and distributed querying. You will be responsible for designing and building robust, scalable data pipelines using modern tools such as Databricks, Apache Spark, PySpark, Deequ, and Trino. You’ll play a key role in enabling reliable, fast, and clean data delivery to support analytics, reporting, and data science use cases.


Key Responsibilities:
  • Design, develop, and optimize large-scale data pipelines using Apache Spark and Databricks.
  • Write scalable PySpark code to process structured and semi-structured data.
  • Use Trino to query data across various sources in a federated manner.
  • Implement data validation and quality checks using Deequ.
  • Collaborate with data scientists, analysts, and other engineers to ensure high-quality data delivery.
  • Tune Spark jobs for performance and cost efficiency in a cloud environment.
  • Contribute to building a modern data platform with a focus on automation, reliability, and scalability.


Qualifications:
  • 15+ years of IT experience
  • Data Skills needed:
    • Databricks (highly preferred)
    • PySpark

What Can We Offer You?
CG Infinity, Inc. offers an exceptionally strong benefits package that compares favorably with those offered by Fortune 500 companies. CG Infinity, Inc. has teamed with a highly regarded ASO, to ensure a great choice for our benefits package.
CG Infinity, Inc. employees have the flexibility to select benefits based on such factors as their personal preference, family situation, and financial objectives, along with our voluntary packages, such as additional Life as well as FSAs.

CG Infinity, Inc. also offers an excellent 401k plan. Upon eligibility, CG Infinity, Inc. contributes an employer match of 100% of the first three percent and 50% of the fourth and fifth percent. All employees enrolled in the 401k retirement plan are 100% vested immediately.

Applicants Must be US Citizens.
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Data Architect

Visionaire Partners

null null

Remote

Remote

USD 72,000 - 230,000

Full time

4 days ago
Be an early applicant

Snowflake Data Architect (Remote - US)

Jobgether

null null

Remote

Remote

USD 72,000 - 149,000

Full time

15 days ago

Data Architect – Databricks, ADF in Property and Casualty domain exp and guidewire

Veracity Software Inc

New York null

Remote

Remote

USD 119,000 - 215,000

Full time

15 days ago

AI Data Architect - Remote

CentralSquare Technologies

null null

Remote

Remote

USD 72,000 - 149,000

Full time

15 days ago

Data Architect/Cloud Data Architect

Prominence Advisors

null null

Remote

Remote

USD 72,000 - 149,000

Full time

14 days ago

Data Engineer IV/Data Architect (REMOTE), Day Shift, Information Technology

Adventist HealthCare

Gaithersburg null

Remote

Remote

USD 116,000 - 175,000

Full time

2 days ago
Be an early applicant

Salesforce Data Architect, Revenue Solutions

NeuraFlash

null null

Remote

Remote

USD 120,000 - 160,000

Full time

11 days ago

Sr. Principal IT Architect

RTX

Dedham null

Remote

Remote

USD 124,000 - 250,000

Full time

4 days ago
Be an early applicant

IT Data Architect

Fulton Financial

null null

Remote

Remote

USD 77,000 - 129,000

Full time

Today
Be an early applicant