
Skills you'll gain: Apache Hadoop, Apache Spark, PySpark, Apache Hive, Big Data, IBM Cloud, Kubernetes, Docker (Software), Scalability, Data Processing, Development Environment, Distributed Computing, Performance Tuning, Open Source Technology, Data Transformation, Debugging
Intermediate · Course · 1 - 3 Months

University of Pittsburgh
Skills you'll gain: Apache Hadoop, Apache Spark, PySpark, Data Pipelines, Distributed Computing, Big Data, Apache Hive, Data Processing, Data Storage, Scikit Learn (Machine Learning Library), Predictive Modeling, Scalability, Data Management, File Systems, Data Science, Data Transformation, Information Technology, Data Analysis
Build toward a degree
Intermediate · Course · 1 - 4 Weeks

Skills you'll gain: Data Warehousing, Data Flow Diagrams (DFDs), Data Modeling, Data Pipelines, Ansible, Cloud Security, Diagram Design, Data Validation, Database Design, Apache Airflow, Star Schema, Snowflake Schema, Interviewing Skills, Apache Spark, PySpark, CI/CD, Docker (Software), SQL, Workflow Management, Git (Version Control System)
Intermediate · Professional Certificate · 3 - 6 Months

Skills you'll gain: NoSQL, Apache Spark, Apache Hadoop, MongoDB, Database Development, Database Systems, Databases, Database Management Systems, Database Management, Extract, Transform, Load, Database Software, Database Administration, PySpark, Apache Hive, Machine Learning Methods, Big Data, Machine Learning, Applied Machine Learning, Generative AI, Model Evaluation
Beginner · Specialization · 3 - 6 Months

Johns Hopkins University
Skills you'll gain: Apache Hadoop, Big Data, Apache Hive, Apache Spark, NoSQL, Data Infrastructure, File Systems, Data Processing, Data Management, Dataflow, Analytics, Data Science, Databases, Data Integration, SQL, Query Languages, File I/O, Data Architecture, Distributed Computing, Performance Tuning
Intermediate · Specialization · 3 - 6 Months

Pearson
Skills you'll gain: PySpark, Apache Hadoop, Apache Spark, Big Data, Apache Hive, Data Lakes, Analytics, Data Pipelines, Data Processing, Data Import/Export, Linux Commands, Linux, File Systems, Data Management, Distributed Computing, Command-Line Interface, Relational Databases, Software Installation, Java, C++ (Programming Language)
Intermediate · Specialization · 1 - 4 Weeks

Skills you'll gain: PySpark, Apache Spark, Model Evaluation, MySQL, Data Pipelines, Scala Programming, Extract, Transform, Load, Logistic Regression, Customer Analysis, Apache Hadoop, Predictive Modeling, Applied Machine Learning, Data Processing, Data Persistence, Advanced Analytics, Big Data, Apache Maven, Data Access, Apache, Python Programming
Beginner · Specialization · 1 - 3 Months

Skills you'll gain: Apache Kafka, Data Warehousing, Extract, Transform, Load, Microsoft SQL Servers, Performance Tuning, Data Pipelines, Cloud Computing Architecture, Business Intelligence, Real Time Data, Apache Hadoop, Cloud Infrastructure, Data Modeling, Database Design, Data Quality, Responsible AI, Apache Spark, SQL, Generative AI, Data Governance, Quality Management
Intermediate · Specialization · 1 - 3 Months

Skills you'll gain: Apache Kafka, Apache Hadoop, Apache Spark, Real Time Data, Scala Programming, Data Integration, Command-Line Interface, AWS Kinesis, Apache Hive, Big Data, Applied Machine Learning, Data Processing, Systems Design, System Design and Implementation, Apache Cassandra, Data Pipelines, Java, Distributed Computing, Query Languages, IntelliJ IDEA
Intermediate · Specialization · 3 - 6 Months

University of California San Diego
Skills you'll gain: Big Data, Apache Hadoop, Scalability, Data Processing, Data Science, Distributed Computing, Unstructured Data, Data Analysis, Real Time Data, Data Quality, Data Storage
Mixed · Course · 1 - 3 Months

Skills you'll gain: Apache Spark, Apache Hadoop, Data Lakes, Big Data, Linux Commands, Linux, File Systems, Data Management, Command-Line Interface, Data Processing, Software Installation, Distributed Computing, Scalability, System Configuration
Intermediate · Course · 1 - 4 Weeks

Skills you'll gain: Apache Hive, Apache Mahout, NoSQL, Apache Hadoop, Extract, Transform, Load, Big Data, Data Warehousing, Data Pipelines, Data Infrastructure, Application Deployment, Databases, SQL, Applied Machine Learning, Database Development, Performance Tuning, Data Processing, File Systems, Real Time Data, Query Languages, Database Management
Beginner · Specialization · 3 - 6 Months