Description
We are looking for an AI Specialist Engineer to enhance the performance of large language and vision models for on-device inference. Your expertise will be crucial in developing and deploying cutting-edge AI solutions, ensuring optimal efficiency across diverse hardware architectures.
Responsibilities:
- Compress and optimize large language and vision models for on-device inference.
- Develop pipelines for model distillation and hardware-specific compilation.
- Benchmark performance across various NPU/GPU architectures.
Qualifications:
- Expertise in model distillation, pruning, and 4-bit/8-bit quantization techniques.
- Hands-on experience with TensorRT, ONNX Runtime, and edge deployment.
- Strong C++ and Python skills.
Company
Hyphen Connect specializes in Web3 and AI talent sourcing, offering talent on-demand, RPO, and people & culture services to help teams hire and grow in crypto, blockchain, and AI initiatives.
Related postings
10a Labs
AI EngineerUnited StatesSamsara
AI EngineerUnited StatesIgnite Digital Services
AI EngineerUnited StatesReadMe
AI EngineerUnited States