premium

HTEC

AI Inference Engineer

Banja Luka

C++Linux Python

Iskustvo

Medior, Senior

Angažman

Puno radno vrijeme

Rok prijave

još 30 dana

O kompaniji

HTEC Group is a global consulting, software engineering, and digital product development company that empowers the world's best high-tech companies, disruptive startups, and global enterprises with innovative product design and sophisticated engineering services.  
HTEC Group was founded in 2008 in Belgrade, Serbia and today has its global headquarters in San Francisco. The company has consultancy, innovation, and product design offices in Silicon Valley, New York, and London, with its technological heart spread across development centers in Central and Southeast Europe. Overall, HTEC employs more than 2,000 highly skilled professionals in 29 locations in 12 countries. 
HTEC combines Silicon Valley-based design thinking with the best engineering talent to support global clients with complete digital product development, from strategy and conceptualization to digital product design and agile engineering at scale. The company possesses vast expertise across a multitude of domains, including Healthcare, Retail, Transportation and Smart Mobility, Logistics, FinTech, Green Energy, Media, and Deep Technology. 

Opis posla

Be part of the team creating the software foundation for next-generation AI compute platforms. In this role, you’ll work across the full stack — from low-level kernels and hardware-optimized operators to large-scale ML deployment frameworks — in close collaboration with compiler developers, ML scientists, and hardware specialists. This position offers the chance to contribute to state-of-the-art AI infrastructure, fine-tune software for custom hardware, and deepen your expertise in system software and machine learning.

Responsibilities (some of the following)
Design, develop, and maintain components of the deployment stack and software kernels for AI compute platforms
Optimize and implement core ML operators (e.g., GEMMs, convolutions, BLAS routines, SIMD kernels)
Translate computational graphs from ML frameworks onto the underlying hardware
Contribute to compiler infrastructure together with compiler and hardware teams
Investigate and resolve issues through system-level debugging and performance analysis
Deliver scalable software solutions under ambitious development schedules
Define and apply practices for testing, deployment, and scaling AI systems

Kvalifikacije

Bachelor’s degree in Computer Science, Engineering, Mathematics, or related discipline, with 3+ years of professional software development experience
Solid knowledge of computer architecture, system software, data structures
Strong programming skills in C/C++ or Python in Linux environments using common development tools
Hands-on experience implementing algorithms in high-level languages (C/C++/Python)
Exposure to specialized hardware (GPUs, FPGAs, DSPs, AI accelerators) and frameworks such as OpenCL or CUDA
Experience designing or working with high-performance software systems
Solid knowledge of ML fundamentals
Motivated team player with a strong sense of responsibility

Dodatne Informacije

You are a great fit if you have experience in at least one of the following areas:
Model serving frameworks (e.g., Triton Inference Server, DeepSpeed Inference, vLLM)
Deep learning frameworks (e.g., PyTorch, TensorFlow)
ML runtimes (e.g., ONNX Runtime, TVM, IREE, XLA)
Distributed collectives (e.g., Gloo, MPI)
Software testing and validation methodologies
Deploying ML workloads (LLMs, VLMs, NLP, etc.) across distributed systems
Implementation of ML operators and kernels (e.g., SIMD routines, Activation functions, Pooling layers, Quantization layers)
Hardware-aware optimizations and performance tuning
2+ years of experience developing software targeting AI hardware

Contribution to open-source projects (e.g., LLVM, PyTorch, TensorFlow, ONNX Runtime, xDSL, IREE) is a big plus.

Oglas istekao?

Slični Oglasi

Top IT poslovi u tvom inboxu

Pretplati se na Dzobs.com newsletter i jednom sedmično ti šaljemo najnovije poslove za odabranu poziciju.

HTEC

AI Inference Engineer

O kompaniji

Opis posla

Responsibilities (some of the following)

Kvalifikacije

Dodatne Informacije

You are a great fit if you have experience in at least one of the following areas:

Slični Oglasi

Senior QA Automation Engineer

Robotics Engineer – Digital Twin & Simulation

AI Performance Engineer

QA Automation Engineer

Top IT poslovi u tvom inboxu