|
|
Scalable AI with Apache Spark and MLlib
NGN 450,000
|
Venue: Port Harcourt
Other Dates
| Venue | Date | Fee | |
|---|---|---|---|
| Lagos, Lagos State, Nigeria | 20 - 24 Apr, 2026 | NGN450000 | |
| Abuja, State, Nigeria | 21 - 25 Sep, 2026 | NGN450000 |
COURSE OVERVIEW
This course equips participants with the knowledge and relevant skills required to build, train, and deploy scalable
machine learning models using distributed computing. It introduces the core concepts of Spark’s architecture, explores
data preprocessing and feature engineering at scale, and covers supervised and unsupervised learning techniques
implemented with MLlib. The course emphasizes practical applications, performance optimization, and integration of
Spark with real world AI pipelines, equipping participants to handle large scale datasets and deliver efficient,
production ready machine learning solutions.
WHO SHOULD ATTEND?
This course is designed for data scientists, machine learning engineers, big data practitioners, and software developers who want to leverage Spark and MLlib to scale AI workloads. It is also valuable for technical managers and decision-
makers seeking to understand the capabilities and limitations of scalable machine learning systems for business or research applications. Prior knowledge of Python, basic machine learning concepts, and familiarity with distributed
systems will be beneficial.
COURSE OUTCOMES
Delegates will gain the skills and knowledge to:
• Know the fundamentals of distributed computing with Apache Spark.
• Build and train scalable machine learning models using Mllib.
• Apply classification, regression, clustering, and recommendation techniques.
• Optimize ML workflows and manage model persistence.
• Use Spark ML pipelines for automation and reproducibility.
• Integrate Spark with cloud platforms and data lakes.
• Troubleshoot performance issues and tune distributed systems.
KEY COURSE HIGHLIGHTS
At the end of the course, you will understand;
• Spark architecture and distributed data processing.
• Data wrangling and feature engineering at scale.
• Building ML pipelines for automation and deployment.
• Distributed model evaluation and parameter tuning.
• Integration with Hadoop, HDFS, and cloud services (e.g., AWS, Azure).
• Case studies in fraud detection, predictive analytics, and recommendations.
• Labs using Databricks or standalone Spark environments.
Course Booking
Please use the “book now” or “inquire” buttons on this page to either book your space or make further enquiries
| Port Harcourt | Feb 09 - 13 Feb, 2026 |
| Lagos, Lagos State, Nigeria | 20 - 24 Apr, 2026 |
| Abuja, State, Nigeria | 21 - 25 Sep, 2026 |
| NGN 450,000.00 | (3 Days (N450,000) 5 Days (1,200,000) 10 Days (3,200,000)) |
Emmanuel Joseph 09056761232
Related Courses
5 days, 11 - 15 May, 2026
GTC