Certification Requirements
DataCamp's Data Engineer Associate Certification is awarded to individuals who successfully complete two timed exams (DE101 and DE201) and one practical exam.
The DE101 is a 2-hour exam that assesses your proficiency in data management theory, data management in SQL, and exploratory analysis theory. To successfully pass this exam, you should be able to:
- Interpret a database schema and explain database design concepts (such as normalization, design, schemas, data storage options)
- Perform data extraction, joining and aggregation tasks
- Perform cleaning tasks to prepare data for analysis
- Assess data quality and perform validation tasks
- Use data visualization tools to demonstrate characteristics of data
- Read and analyze data visualizations to represent the relationships between features
The DE201 is a 2 hour exam that assesses your proficiency in data management theory, data management in Python, and programming for data engineering in Python. To successfully pass this exam, you should be able to:
- Identify different cloud tools that can be used for storing data and creating and maintaining data pipelines
- Perform standard data import, joining and aggregation tasks
- Perform cleaning tasks to prepare data for analysis
- Assess data quality and perform validation tasks
- Collect data from non-standard formats (e.g. json) by modifying existing code
- Use common programming constructs to write repeatable production quality code for analysis
- Demonstrates best practices in production code including version control, testing and package development
- Use software engineering principles (OOP, profiling, debugging) to write efficient, modular code in python
What to expect on the practical exam
The final step in this certification is a practical exam. The practical exam assesses your skills in data management and programming for Data Engineering in Python. You'll complete tasks related to a business problem and be expected to return data to meet the given requirements.
To pass the practical exam, you'll need to be able to:
- Perform standard data import, joining and aggregation tasks
- Perform cleaning tasks to prepare data for analysis
- Assess data quality and perform validation tasks
- Collect data from non-standard formats (e.g. json) by modifying existing code
- Use common programming constructs to write repeatable production-quality code for analysis
How long do I have to complete the practical exam?
You’ll have a total of 30 days from the time you register for certification to the time you must complete all requirements, including the exam(s) and practical exam. If you complete the exam(s) right away, then you’ll have the remaining time to work on the practical exam. However, if you take longer to complete the exam(s), you’ll have less time to complete the practical exam. Visit the Practical Hub in the Get Started pages for additional resources.
Re-taking your certification
Candidates who are unsuccessful in any component will have to wait 14 days before they can attempt the certification again. Upon retaking the certification, you will be required to complete all exams again, including any that you may have passed on a previous attempt.
If you have questions or feedback related to certification, please submit your inquiry here.