Enable job alerts via email!

Data Engineer(Web Crawling Expertise) PureLogics No Magic, Just Logic

Purelogics Llc

New York (NY)

On-site

USD 100,000 - 130,000

Full time

4 days ago
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

A leading technology company in the USA seeks an AI Engineer skilled in data engineering and web scraping. The successful candidate will train AI models and create data pipelines to support chatbots and analytics. Join a team that values perfection and offers professional growth opportunities.

Benefits

Health Insurance
Provident Fund
Annual Paid Leaves
Compensation Plans
Paid Certifications & Training
Car Finance Program
Bike Finance Program
Child Education Program
Two Annual Trips
Stars Of the Month Rewards
Quarterly Meetups
Referral Bonuses
Birthday & Eid Gifts

Qualifications

  • Proficient in Python and related libraries (PyTorch, TensorFlow).
  • Experience with AI-powered scraping frameworks (Scrapy, Selenium, etc.).
  • Minimum 3 years experience in AI and data engineering.

Responsibilities

  • Train, fine-tune, and optimize AI models.
  • Build intelligent web crawling and scraping solutions.
  • Develop and manage scalable data pipelines.

Skills

Python
PyTorch
TensorFlow
Hugging Face Transformers
AI-powered scraping tools
NLP
Data pipeline tools

Job description

We are seeking an AI Engineer with proven experience in data engineering and web crawling/scraping using AI tools. The ideal candidate should be capable of training and deploying custom AI/LLM models and building data pipelines that ingest and structure large volumes of scraped content for chatbot and analytics use.

Responsibilities:
  • Train, fine-tune, and optimize AI/LLM models for specific tasks.
  • Build intelligent web crawling and scraping solutions using AI-based tools to extract structured data from public websites.
  • Process and organize scraped data for chatbot consumption and data analysis.
  • Develop and manage scalable data pipelines and workflows.
Required Skills:
  • Proficiency in Python, PyTorch/TensorFlow, and Hugging Face Transformers.
  • Strong experience in AI-powered scraping tools or frameworks (e.g., Scrapy with NLP for content filtering, Selenium with AI logic, Diffbot, or custom ML-based scrapers).
  • Familiarity with vector databases (e.g., FAISS, Pinecone) and embedding techniques.
  • Experience with LangChain, OpenAI API, or similar LLM orchestration tools.
  • Understanding of ETL workflows and data pipeline tools (e.g., Airflow, Spark).

Nice to Have:

  • Experience with cloud platforms (AWS/GCP/Azure).
  • Prior work integrating scraped datasets into chatbots or NLP-based systems.
  • Knowledge of prompt engineering and document ingestion strategies.
Experience:
  • Min 3 Years
About Us:

PureLogics is a full services technology company with having presence in the USA, UAE, and in Lahore. Over the past 18+ years, we have matured from a narrowly-focused five-person team to a well-established technology hub with around employees. We’re CMMI Level 2 and ISO Certified company and highly acclaimed AWS consulting partners.

The success of our business mainly lies in building a team of A-players, who work together and build together, and who crave perfection in everything they produce for our elite clients. We offer the opportunity to young and enthusiastic individuals that are eager to take on tough challenges under our mentorship toward a bright future.

What are we offering?
  • Health Insurance
  • Provident Fund
  • Annual Paid Leaves
  • Compensation Plans
  • Paid Certifications & Training
  • Car Finance Program
  • Bike Finance Program
  • Child Education Program
  • Two Annual Trips
  • Stars Of the Month Rewards
  • Quarterly Meetups
  • Referral Bonuses
  • Birthday & Eid Gifts
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.