Talener
- 5+ years building production ML inference systems
- Python – core to everything in this role
- PyTorch (TorchScript, ONNX, FastAPI/TorchServe) and TensorFlow (SavedModel, tf.data, XLA, TFLite) – both required
- Deep hands-on experience with transformer-based models (BERT family – DistilBERT, SBERT, etc.) in production
- Inference optimization at scale – quantization, distillation, compilation, kernel/profile-level performance work
- AWS infrastructure – EC2, Batch, Lambda, SageMaker across different media workload types
- Hybrid search architecture experience – BM25 + vector search + cross-encoder reranking
- Asynchronous processing systems – reliability, caching, deduplication, observability
- Data pipeline and workflow orchestration (Airflow or similar)
- Video frameworks – FFmpeg, large-scale frame-level inference
- Must have experience in the media industry
- Must have experience working with large amounts of data, including text, images and videos
- Experience with TransNetV2 or similar video shot boundary detection
- Familiarity with HuggingFace open source LLMs
- OpenAI API or other foundation model provider experience
- Hybrid CPU/GPU environment experience at scale
{“@context”:”http://schema.org”,”@type”:”JobPosting”,”baseSalary”:null,”datePosted”:”2026-05-22″,”validThrough”:”2027-05-22″,”description”:”<div fr-original-style="" style="box-sizing: border-box; font-family: inherit; font-size: inherit;"><strong fr-original-style="" style="box-sizing: border-box; font-weight: bold; font-family: inherit; font-size: inherit;"><span fr-original-style="font-family: Tahoma, Geneva, sans-serif; font-size: 14px;" style="font-family: Tahoma, Geneva, sans-serif; font-size: 14px; box-sizing: border-box;">Title: </span></strong><span fr-original-style="font-size: 14px;" style="font-size: 14px; box-sizing: border-box;"><span fr-original-style="font-family: Tahoma,Geneva,sans-serif;" style="font-family: Tahoma, Geneva, sans-serif; box-sizing: border-box;"> ML Engineer</span></span></div><div fr-original-style="" style="box-sizing: border-box; font-family: inherit; font-size: inherit;"><span fr-original-style="font-size: 14px;" style="font-size: 14px; box-sizing: border-box;"><span fr-original-style="font-family: Tahoma,Geneva,sans-serif;" style="font-family: Tahoma, Geneva, sans-serif; box-sizing: border-box;"><br fr-original-style="" style="box-sizing: border-box; font-family: inherit; font-size: inherit;"></span></span></div><div fr-original-style="" style="box-sizing: border-box; font-family: inherit; font-size: inherit;"><span fr-original-style="font-size: 14px;" style="font-size: 14px; box-sizing: border-box;"><span fr-original-style="font-family: Tahoma,Geneva,sans-serif;" style="font-family: Tahoma, Geneva, sans-serif; box-sizing: border-box;"><strong fr-original-style="" style="box-sizing: border-box; font-weight: bold; font-family: inherit; font-size: inherit;">Location:</strong> Remote</span></span></div><div fr-original-style="" style="box-sizing: border-box; font-family: inherit; font-size: inherit;"><span fr-original-style="font-size: 14px;" style="font-size: 14px; box-sizing: border-box;"><span fr-original-style="font-family: Tahoma,Geneva,sans-serif;" style="font-family: Tahoma, Geneva, sans-serif; box-sizing: border-box;"><br fr-original-style="" style="box-sizing: border-box; font-family: inherit; font-size: inherit;"></span></span></div><div fr-original-style="" style="box-sizing: border-box; font-family: inherit; font-size: inherit;"><span fr-original-style="font-size: 14px;" style="font-size: 14px; box-sizing: border-box;"><span fr-original-style="font-family: Tahoma,Geneva,sans-serif;" style="font-family: Tahoma, Geneva, sans-serif; box-sizing: border-box;"><strong fr-original-style="" style="box-sizing: border-box; font-weight: bold; font-family: inherit; font-size: inherit;">Client:</strong> Global newswire and media organization.Their content reaches more than half the global population daily. The tech org is modern and investing heavily in ML infrastructure to support large-scale media processing across text, image, and video.</span></span></div><div fr-original-style="" style="box-sizing: border-box; font-family: inherit; font-size: inherit;"><span fr-original-style="font-size: 14px;" style="font-size: 14px; box-sizing: border-box;"><span fr-original-style="font-family: Tahoma,Geneva,sans-serif;" style="font-family: Tahoma, Geneva, sans-serif; box-sizing: border-box;"><br fr-original-style="" style="box-sizing: border-box; font-family: inherit; font-size: inherit;"></span></span></div><div fr-original-style="" style="box-sizing: border-box; font-family: inherit; font-size: inherit;"><span fr-original-style="font-size: 14px;" style="font-size: 14px; box-sizing: border-box;"><span fr-original-style="font-family: Tahoma,Geneva,sans-serif;" style="font-family: Tahoma, Geneva, sans-serif; box-sizing: border-box;"><strong fr-original-style="" style="box-sizing: border-box; font-weight: bold; font-family: inherit; font-size: inherit;">Role Description</strong></span></span></div><div fr-original-style="" style="box-sizing: border-box; font-family: inherit; font-size: inherit;"><span fr-original-style="font-size: 14px;" style="font-size: 14px; box-sizing: border-box;"><span fr-original-style="font-family: Tahoma,Geneva,sans-serif;" style="font-family: Tahoma, Geneva, sans-serif; box-sizing: border-box;">This is a senior hands-on ML engineering role focused on building and optimizing inference systems that run in production at scale. You'd be working across text, image, and video pipelines – processing millions of media assets to power news intelligence products. Think DistilBERT for NER, SBERT for embeddings, TransNetV2 for video shot detection, and external multimodal APIs for captioning.</span></span></div><div fr-original-style="" style="box-sizing: border-box; font-family: inherit; font-size: inherit;"><span fr-original-style="font-size: 14px;" style="font-size: 14px; box-sizing: border-box;"><span fr-original-style="font-family: Tahoma,Geneva,sans-serif;" style="font-family: Tahoma, Geneva, sans-serif; box-sizing: border-box;">This is not an MLOps or platform role. They need someone who can profile a transformer, rewrite its serving path for a 2-3x latency improvement, tune an HNSW index, and make smart infrastructure decisions on SageMaker instance selection to hit p95 targets at the lowest cost. If your background is primarily Terraform, Kubernetes admin, or CI/CD pipelines – this isn't the right fit.</span></span></div><div fr-original-style="" style="box-sizing: border-box; font-family: inherit; font-size: inherit;"><span fr-original-style="font-size: 14px;" style="font-size: 14px; box-sizing: border-box;"><span fr-original-style="font-family: Tahoma,Geneva,sans-serif;" style="font-family: Tahoma, Geneva, sans-serif; box-sizing: border-box;">You'll partner closely with MLOps, platform engineering, data scientists, and product teams – but ownership of model performance, inference logic, and pipeline efficiency lives here.</span></span></div><div fr-original-style="" style="box-sizing: border-box; font-family: inherit; font-size: inherit;"><span fr-original-style="font-size: 14px;" style="font-size: 14px; box-sizing: border-box;"><span fr-original-style="font-family: Tahoma,Geneva,sans-serif;" style="font-family: Tahoma, Geneva, sans-serif; box-sizing: border-box;"><br fr-original-style="" style="box-sizing: border-box; font-family: inherit; font-size: inherit;"></span></span></div><div fr-original-style="" style="box-sizing: border-box; font-family: inherit; font-size: inherit;"><span fr-original-style="font-size: 14px;" style="font-size: 14px; box-sizing: border-box;"><span fr-original-style="font-family: Tahoma,Geneva,sans-serif;" style="font-family: Tahoma, Geneva, sans-serif; box-sizing: border-box;"><strong fr-original-style="" style="box-sizing: border-box; font-weight: bold; font-family: inherit; font-size: inherit;">Required Skills</strong></span></span></div><ul fr-original-style="" style="box-sizing: border-box; font-family: inherit; font-size: inherit; margin-top: 0px; margin-bottom: 10px;"><li fr-original-style="font-family: Tahoma, Geneva, sans-serif; font-size: 14px;" style="font-family: Tahoma, Geneva, sans-serif; font-size: 14px; box-sizing: border-box;">5+ years building production ML inference systems</li><li fr-original-style="font-family: Tahoma, Geneva, sans-serif; font-size: 14px;" style="font-family: Tahoma, Geneva, sans-serif; font-size: 14px; box-sizing: border-box;">Python – core to everything in this role</li><li fr-original-style="font-family: Tahoma, Geneva, sans-serif; font-size: 14px;" style="font-family: Tahoma, Geneva, sans-serif; font-size: 14px; box-sizing: border-box;">PyTorch (TorchScript, ONNX, FastAPI/TorchServe) and TensorFlow (SavedModel, tf.data, XLA, TFLite) – both required</li><li fr-original-style="font-family: Tahoma, Geneva, sans-serif; font-size: 14px;" style="font-family: Tahoma, Geneva, sans-serif; font-size: 14px; box-sizing: border-box;">Deep hands-on experience with transformer-based models (BERT family – DistilBERT, SBERT, etc.) in production</li><li fr-original-style="font-family: Tahoma, Geneva, sans-serif; font-size: 14px;" style="font-family: Tahoma, Geneva, sans-serif; font-size: 14px; box-sizing: border-box;">Inference optimization at scale – quantization, distillation, compilation, kernel/profile-level performance work</li><li fr-original-style="font-family: Tahoma, Geneva, sans-serif; font-size: 14px;" style="font-family: Tahoma, Geneva, sans-serif; font-size: 14px; box-sizing: border-box;">AWS infrastructure – EC2, Batch, Lambda, SageMaker across different media workload types</li><li fr-original-style="font-family: Tahoma, Geneva, sans-serif; font-size: 14px;" style="font-family: Tahoma, Geneva, sans-serif; font-size: 14px; box-sizing: border-box;">Hybrid search architecture experience – BM25 + vector search + cross-encoder reranking</li><li fr-original-style="font-family: Tahoma, Geneva, sans-serif; font-size: 14px;" style="font-family: Tahoma, Geneva, sans-serif; font-size: 14px; box-sizing: border-box;">Asynchronous processing systems – reliability, caching, deduplication, observability</li><li fr-original-style="font-family: Tahoma, Geneva, sans-serif; font-size: 14px;" style="font-family: Tahoma, Geneva, sans-serif; font-size: 14px; box-sizing: border-box;">Data pipeline and workflow orchestration (Airflow or similar)</li><li fr-original-style="font-family: Tahoma, Geneva, sans-serif; font-size: 14px;" style="font-family: Tahoma, Geneva, sans-serif; font-size: 14px; box-sizing: border-box;">Video frameworks – FFmpeg, large-scale frame-level inference</li><li fr-original-style="font-family: Tahoma, Geneva, sans-serif; font-size: 14px;" style="font-family: Tahoma, Geneva, sans-serif; font-size: 14px; box-sizing: border-box;">Must have experience in the media industry</li><li fr-original-style="font-family: Tahoma, Geneva, sans-serif; font-size: 14px;" style="font-family: Tahoma, Geneva, sans-serif; font-size: 14px; box-sizing: border-box;">Must have experience working with large amounts of data, including text, images and videos</li></ul><div fr-original-style="" style="box-sizing: border-box; font-family: inherit; font-size: inherit;"><span fr-original-style="font-size: 14px;" style="font-size: 14px; box-sizing: border-box;"><span fr-original-style="font-family: Tahoma,Geneva,sans-serif;" style="font-family: Tahoma, Geneva, sans-serif; box-sizing: border-box;"><strong fr-original-style="" style="box-sizing: border-box; font-weight: bold; font-family: inherit; font-size: inherit;">Nice to Have</strong></span></span></div><ul fr-original-style="" style="box-sizing: border-box; font-family: inherit; font-size: inherit; margin-top: 0px; margin-bottom: 10px;"><li fr-original-style="font-family: Tahoma, Geneva, sans-serif; font-size: 14px;" style="font-family: Tahoma, Geneva, sans-serif; font-size: 14px; box-sizing: border-box;">Experience with TransNetV2 or similar video shot boundary detection</li><li fr-original-style="font-family: Tahoma, Geneva, sans-serif; font-size: 14px;" style="font-family: Tahoma, Geneva, sans-serif; font-size: 14px; box-sizing: border-box;">Familiarity with HuggingFace open source LLMs</li><li fr-original-style="font-family: Tahoma, Geneva, sans-serif; font-size: 14px;" style="font-family: Tahoma, Geneva, sans-serif; font-size: 14px; box-sizing: border-box;">OpenAI API or other foundation model provider experience</li><li fr-original-style="font-family: Tahoma, Geneva, sans-serif; font-size: 14px;" style="font-family: Tahoma, Geneva, sans-serif; font-size: 14px; box-sizing: border-box;">Hybrid CPU/GPU environment experience at scale</li></ul><div fr-original-style="" style="box-sizing: border-box; font-family: inherit; font-size: inherit;"><span fr-original-style="font-size: 14px;" style="font-size: 14px; box-sizing: border-box;"><span fr-original-style="font-family: Tahoma,Geneva,sans-serif;" style="font-family: Tahoma, Geneva, sans-serif; box-sizing: border-box;"><strong fr-original-style="" style="box-sizing: border-box; font-weight: bold; font-family: inherit; font-size: inherit;">Compensation</strong></span></span></div><div fr-original-style="" style="box-sizing: border-box; font-family: inherit; font-size: inherit;"><span fr-original-style="font-size: 14px;" style="font-size: 14px; box-sizing: border-box;"><span fr-original-style="font-family: Tahoma,Geneva,sans-serif;" style="font-family: Tahoma, Geneva, sans-serif; box-sizing: border-box;">Base salary up to 150,000.00 + 15% bonus target</span></span></div><div fr-original-style="" style="box-sizing: border-box; font-family: inherit; font-size: inherit;"><span fr-original-style="font-size: 14px;" style="font-size: 14px; box-sizing: border-box;"><span fr-original-style="font-family: Tahoma,Geneva,sans-serif;" style="font-family: Tahoma, Geneva, sans-serif; box-sizing: border-box;"><br fr-original-style="" style="box-sizing: border-box; font-family: inherit; font-size: inherit;"></span></span></div><div fr-original-style="" style="box-sizing: border-box; font-family: inherit; font-size: inherit;"><span fr-original-style="font-size: 14px;" style="font-size: 14px; box-sizing: border-box;"><span fr-original-style="font-family: Tahoma,Geneva,sans-serif;" style="font-family: Tahoma, Geneva, sans-serif; box-sizing: border-box;">For additional information or to apply, please contact Bethany Moulthrop at <a fr-original-style="" href="mailto:bmoulthrop@talener.com" style="box-sizing: border-box; user-select: auto; background: transparent; color: rgb(0, 112, 192); text-decoration: none; font-family: inherit; font-size: inherit; cursor: pointer;">bmoulthrop@talener.com</a></span></span></div><div fr-original-style="" style="box-sizing: border-box; font-family: inherit; font-size: inherit;"><span fr-original-style="font-size: 14px;" style="font-size: 14px; box-sizing: border-box;"><span fr-original-style="font-family: Tahoma,Geneva,sans-serif;" style="font-family: Tahoma, Geneva, sans-serif; box-sizing: border-box;"><br fr-original-style="" style="box-sizing: border-box; font-family: inherit; font-size: inherit;"></span></span></div><div fr-original-style="" style="box-sizing: border-box; font-family: inherit; font-size: inherit;"><span fr-original-style="font-family: Tahoma, Geneva, sans-serif; font-size: 14px;" style="font-family: Tahoma, Geneva, sans-serif; font-size: 14px; box-sizing: border-box;">#LI-REMOTE</span></div>”,”employmentType”:”FULL_TIME”,”hiringOrganization”:{“@type”:”Organization”,”name”:”Talener”},”jobLocation”:{“@type”:”Place”,”address”:{“@type”:”PostalAddress”,”streetAddress”:null,”addressLocality”:”Greater New York City Area”,”addressRegion”:”NY”,”postalCode”:null,”addressCountry”:null}},”title”:”ML Engineer”,”url”:”https://talener.com/jobs/?cjobid=BA89005522&rpid=1630155&postid=VY9wj6HzxqU”,”identifier”:{“@type”:”PropertyValue”,”name”:”Talener”,”value”:null}}