Data Engineer

DataCamp's Data Engineer Associate Certification is awarded to individuals who successfully complete the timed DE101 exam and a further practical exam.

What to expect on the timed exams

DE101 is a 2 hour exam that assesses your proficiency in data management theory, data management in SQL, and exploratory analysis theory. To successfully pass this exam, you should be able to:

 

  • Interpret a database schema and explain database design concepts (such as normalization, design, schemas, data storage options)
  • Perform data extraction, joining and aggregation tasks
  • Perform cleaning tasks to prepare data for analysis
  • Assess data quality and perform validation tasks
  • Use data visualization tools to demonstrate characteristics of data
  • Read and analyze data visualizations to represent the relationships between features

 

DE201 is a 2 hour exam that assesses your proficiency in data management theory, data management in Python, and programming for data engineering in Python. To successfully pass this exam, you should be able to:

 

  • Identify different cloud tools that can be used for storing data and creating and maintaining data pipelines
  • Perform standard data import, joining and aggregation tasks
  • Perform cleaning tasks to prepare data for analysis
  • Assess data quality and perform validation tasks
  • Collect data from non-standard formats (e.g. json) by modifying existing code
  • Use common programming constructs to write repeatable production quality code for analysis
  • Demonstrates best practices in production code including version control, testing and package development
  • Use software engineering principles (OOP, profiling, debugging) to write efficient, modular code in python

 

What to expect on the practical exam

The final step in this certification is a practical exam. The practical exam assesses your skills in data management and programming for Data Engineering in Python. You'll complete tasks related to a business problem and be expected to return data to meet the given requirements.

 

To pass the practical exam, you'll need to be able to:

 

  • Perform standard data import, joining and aggregation tasks
  • Perform cleaning tasks to prepare data for analysis
  • Assess data quality and perform validation tasks
  • Collect data from non-standard formats (e.g. json) by modifying existing code
  • Use common programming constructs to write repeatable production quality code for analysis

 

If you have questions or feedback related to certification, please submit your inquiry here.