NLTK, spaCy, or Hugging Face Transformers. Familiarity with time-series databases and analysis tools. Knowledge of AI model serving frameworks like TensorFlow Serving or ONNX Runtime. Experience with AI ethics and bias mitigation techniques. Familiarity with GPU acceleration and distributed computing for AI workloads. Why Join warpSpeed: Career Growth and More ❯
quantization and pruning Experience with kernel development using CUDA or OpenCL for image processing Hands-on experience with TensorRT, embedded hardware accelerators and the ONNX Strong proficiency in both C++ and Python Software development experience on embedded devices such as NVIDIA Orin. Nice to Haves Excellent debugging skills Experience with More ❯
understanding of docker and containerization. (Good to have) Experience with Pytorch and Python3, and comfortable with C++. (Good to have) Understanding of Torch script, ONNX runtime, TensorRT. (Good to have) Understanding of half-precision inference and int8 quantization. What we offer Company equity % in an early-stage startup. More ❯
understanding of Docker and containerization. (Good to have) Experience with Pytorch and Python3, and comfortable with C++. (Good to have) Understanding of Torch script, ONNX runtime, TensorRT. (Good to have) Understanding of half-precision inference and int8 quantization. What we offer Company equity % in an early-stage startup. More ❯
setting up and scaling MLOps platforms in global organizations. Technical Expertise: Strong understanding of AI/ML technologies, algorithms, and frameworks (e.g., TensorFlow, PyTorch, ONNX), as well as experience with AI/ML workload optimization and deployment. Deep expertise in data architecture and engineering principles, data modelling, ETL processes, data More ❯
practices. PREFERRED EXPERIENCE & SKILLS Strong Computer Science fundamentals and problem-solving skills. Strong understanding of applied machine learning using current ML Frameworks: Pytorch, Tensorflow, ONNX, CNTK, R, etc.; Exposure to C/C++, Go, Rust a plus. Good understanding of multi-core compute hardware and device driver fundamentals. Good knowledge More ❯
models with wearable data (e.g., continuous heart rate, motion, respiration). Exposure to embedded AI or edge model deployment (e.g., TensorFlow Lite, Core ML, ONNX). Knowledge of healthcare data privacy and security (e.g., HIPAA, GDPR). Familiarity with GMLP (Good Machine Learning Practice) and clinical evaluation frameworks. The successful More ❯
optimization. Benchmark, analyze, and improve AI workload performance. Collaborate with the hardware team to guide architectural decisions. Extend support to additional frameworks (e.g., TensorFlow, ONNX). Produce developer documentation and resources. Requirements: 5+ years of experience in AI/ML software development. Deep understanding of PyTorch internals and other major More ❯
of: CUDA, OpenCL, HIP, SYCL Knowledge of deep learning algorithms Interested in optimising tough linear algebra equations Knowledge of AI framework internals (PyTorch, TensorFlow, ONNX etc) Full details are available. Please don't hesitate to get in touch with max@platform-recruitment.com to learn more. More ❯
of: CUDA, OpenCL, HIP, SYCL Knowledge of deep learning algorithms Interested in optimising tough linear algebra equations Knowledge of AI framework internals (PyTorch, TensorFlow, ONNX etc) Full details are available. Please don't hesitate to get in touch with max@ platform-recruitment. com to learn more. More ❯
Have proven hands-on experience of complex analytics at scale for example in the areas of IoT and sensor data. Understand the PMML and ONNX model portability standards. Have experience with Teradata partner's analytical products, Cloud Service providers such as AzureML and Sagemaker and partner products such as Dataiku More ❯
DRW is a diversified trading firm with over 3 decades of experience bringing sophisticated technology and exceptional people together to operate in markets around the world. We value autonomy and the ability to quickly pivot to capture opportunities, so we More ❯
be instrumental in shaping the voice of the ConnexAI product used by our worldwide customer base. Key Responsibilities: You will use tools like TensorRT, ONNX, TorchServe, and Triton to optimise and scale models for real-time, production-level deployment. You will implement and maintain CI/CD pipelines and deploy … 1+ year of deploying large-scale LLMs and/or TTS systems in production. Experience using Docker and Kubernetes (desirable) Experience with vLLM, TensorRT, ONNX, TorchServe, Triton, or similar tools. Python, cloud platforms, and performance optimisation experience Why Join Us? At ConnexAI, you will solve complex problems in a fast More ❯