Training: : Machine Learning & AI Model Optimization Training

Level

Advanced

Duration

24h / 3 days

Date

Individually arranged

Price

Individually arranged

Machine Learning & AI Model Optimization Training

The Machine Learning & AI Model Optimization course is a practical, 3-day program focused on improving the performance and efficiency of machine learning models during training and inference. Participants will learn techniques such as quantization, pruning, mixed-precision training, and tools for accelerating models, including ONNX, TensorRT, and Triton. The program combines 80% hands-on workshops with 20% theoretical introduction, providing skills to optimize machine learning models in domains such as Computer Vision, Natural Language Processing, and LLMs/SLMs, with a focus on edge device deployment and production environments.

Who is this training for?

Data scientists and machine learning engineers with project experience in ML
Specialists responsible for deploying models in resource-constrained environments
Professionals working with large language models and computer vision who want to optimize performance
ML engineers interested in advanced training and inference methods to reduce time and costs

What will you learn during this training?

Apply advanced optimization techniques for machine learning and deep neural network models
Accelerate training and inference processes while maintaining model quality
Compress models and adapt them for edge device deployment
Work with tools (ONNX, TensorRT, Triton) and distributed frameworks for handling large models
Design scalable and efficient AI solutions ready for production

Training Program

Day 1: Optimization Basics and Advanced Work Environments
- Module 1: Introduction to ML & AI Model Optimization
  - Needs and objectives of optimization across modeling stages
  - Overview of techniques: quantization, pruning, mixed-precision training
  - Data pipeline optimization – improving loading and augmentation
  - Agile model lifecycle management for optimization (benchmarking, evaluation)
- Module 2: Data Pipelines and Flow Management
  - Automating loading, preprocessing, and augmentation for efficiency
  - Practical: testing and tuning augmentation pipelines
  - Validating data quality and efficiency in model training
- Module 3: Technologies and Tools for Optimization
  - ONNX – model exchange and acceleration standard
  - Inference acceleration frameworks: TensorRT, Triton
  - Integration with popular libraries (PyTorch, TensorFlow)
  - Hands-on workshop: preparing a model for optimization
Day 2: Optimization Techniques and Practical Deployment
- Module 4: Training Process Optimization
  - Mixed-precision training – reducing resource requirements
  - Gradient accumulation and distributed training
  - Hyperparameter selection and tuning for optimization
- Module 5: Model Compression and Post-Training Adaptation
  - Quantization and pruning methods and applications
  - Performance- and memory-optimized model formats
  - Deployment of optimized models on edge devices
  - Case study: optimizing large language models and computer vision models
Day 3: Production Optimization and Scaling
- Module 6: Applying Optimization in Production Environments
  - Specifics of optimizing LLMs and CV models
  - Deploying optimized models in cloud and on-premise environments
  - Performance monitoring, diagnostics, and inference troubleshooting
  - Practical cost and efficiency analysis: case studies
  - Scaling and managing AI models at large scale
- Module 7: Best Practices and Trends in AI Optimization
  - Resource usage monitoring and automated scaling
  - Diagnostics and troubleshooting after deployment
  - Tools for automated benchmarking and post-update testing
  - Trends in large model optimization and adaptive algorithms
  - Discussion and participant knowledge exchange

Contact us

we will organize training for you tailored to your needs

Przemysław Wołosz

Key Account Manager

+48 730 830 801

przemyslaw.wolosz@infoShareAcademy.com