Experience: 10+ years
Salary: Confidential (based on experience)
Expected Notice Period: 30 Days
Shift: (GMT+05:30) Asia/Kolkata (IST)
Opportunity Type: Remote
Placement Type: Full Time Permanent position (Payroll and Compliance managed by Nexstem)
*Note: This is a requirement for one of Uplers' client – Nexstem
What do you need for this opportunity?
Must have skills required:
- CUDA, GPU programming, Robotics, Vulkan, Python, Kernels, Linux, metaprogramming, STL, C++
About us
Nexstem is building the Human Body API — a non-invasive technology that reads and interprets brain and body signals in real time. Founded in 2020, we create biosignal-powered infrastructure enabling applications across health, performance, research, and human–computer interaction. Backed by global partners and recognized by leading platforms, we’re making the human body machine-readable and accessible to the world.
Role: Principal Architect Engineer – Systems, Compiler & OS-Level
Location
Bangalore
About the Role
As part of our mission to democratize neurotechnology, we are seeking a Principal Architect Engineer to lead the design of performance‑critical, low‑level system software that sits close to our hardware stack and powers the real‑time Human Body API. This role is ideal for someone who has deep expertise in C++ systems programming, OS/kernel fundamentals, compiler/toolchain awareness, and hardware–software integration, and who thrives at the intersection of neuroscience, embedded systems, and scalable real‑time computing.
Key Responsibilities
- System Architecture & OS‑Level Design: Architect modular, extensible system frameworks that interface directly with hardware and firmware.
- Define low‑level hardware/firmware/kernel/driver/API interfaces.
- Make critical architectural decisions balancing latency, throughput, and reliability.
- Drive compiler/toolchain optimization strategies for performance‑critical paths.
- Develop high‑performance C++14/17/20 system components with focus on low‑latency, real‑time execution, multi‑threading, lock‑free data structures, memory management, and cache efficiency.
- Implement device drivers, communication protocols, real‑time signal acquisition and streaming pipelines, and OS‑level abstractions exposing hardware capabilities to higher‑level SDKs.
- Collaborate with hardware teams on embedded interfaces, sensor data pipelines, and kernel subsystems (interrupts, I/O scheduling, low‑level tuning).
- Profile and optimize systems across CPU/GPU/accelerator architectures using tools such as gdb, perf, valgrind, eBPF.
- Implement production‑grade reliability, fault‑tolerance, recovery, and watchdog mechanisms.
- Define engineering standards, best practices, and system‑level design documents; mentor engineers in C++, systems programming, compiler‑level thinking, and performance engineering.
- Stay current with cutting‑edge work in OS internals, real‑time computing, compiler optimizations, high‑performance systems programming, and translate algorithm and neural‑signal research into efficient runtime implementations.
Education
- Master’s in Computer Science/Engineering or related field.
- Exceptional Bachelor’s candidates may be considered.
Experience
- 10+ years in system‑level C++ development, including OS‑level, kernel, or compiler‑adjacent work.
- Proven track record building complex, production‑grade low‑level systems.
Core Technical Skills
- Expert C++ (C++14/17/20): STL, templates, design patterns, metaprogramming.
- Multi‑threading & concurrency; lock‑free design and synchronization; memory models, allocators, performance tuning.
- Linux Kernel development, device drivers, system calls, interrupts; Embedded Linux or RTOS; Firmware interfaces, hardware bring‑up.
- Toolchains and compilers (LLVM, Clang, GCC); debugging and profiling (gdb, perf, valgrind, sanitizers).
- Streaming, real‑time signal processing, or high‑frequency data acquisition.
Bonus / Good to Have
- GPU programming (CUDA, OpenCL, Vulkan compute).
- Rust or Python integration with C++ pipelines.
- Experience in neurotech, robotics, IoT, or any real‑time hardware domain.
- Knowledge of DSP, numerical methods, or ML‑based signal processing.
Soft Skills
- Strong systems thinking and ability to understand large, complex architectures.
- Exceptional problem‑solving under tight latency/performance requirements.
- Cross‑functional collaboration with hardware, AI, and product teams.
- Excellent communicator who documents clearly and leads by example.
How to apply for this opportunity
- Step 1: Click ‘Apply’ and register or login on our portal.
- Step 2: Complete the Screening Form & upload updated Resume.
- Step 3: Increase your chances to get shortlisted & meet the client for the Interview.
About Uplers
Our goal is to make hiring reliable, simple, and fast. Our role will be to help all our talents find and apply for relevant contractual onsite opportunities and progress in their career. We will support any grievances or challenges you may face during the engagement. So, if you are ready for a new challenge, a great work environment, and an opportunity to take your career to the next level, don't hesitate to apply today. We are waiting for you!