ML / LLM Engineer
- ראש העין
- משרה קבועה
- משרה מלאה
- Design, fine-tune, and optimise conversational LLMs for real-time use at scale
- Implement prompt engineering and behaviour tuning across user contexts
- Collaborate with multi-modal engineers on synchronised audio/image/video outputs
- Lead dataset curation and build model evaluation pipelines for quality control
- Explore mechanisms for gamified interaction and multi-character dynamics
- Make architectural choices across inference frameworks and training strategies
- 5+ years of experience in production-grade Python development
- In-depth knowledge of transformer models, tokenization, sampling, and reasoning
- Expertise with vLLM, TensorRT-LLM, DeepSpeed, FSDP or equivalent tooling
- Distributed training experience across multi-GPU/multi-node setups
- Proficient in mixed-precision training, memory optimization, and profiling
- Familiarity with real-time deployment of latency-critical LLM services
- Experience contributing to open-source AI libraries (e.g. HF Transformers, Triton)
- Exposure to instruction tuning, multilingual adaptation, or NSFW safety alignment
- Background in roleplay generation, fine-grained classification, or agentic tools
Mploy