DE4E: Data Engineering for Everybody#

DE4E: Data Engineering for Everybody by Pseudo-Lab.

Pseudo Lab Data Camp Donates Stars Badge Forks Badge Pull Requests Badge Issues Badge GitHub contributors License Badge Hits

Loved the project? Please visit our Website


Welcome to our DE4E repository!
We aim to give you a complete understanding of data engineering, from fundamentals to advanced concepts. Whether you’re new or experienced, our repository empowers data lovers with the knowledge and skills for success in the data-driven era. Join us on this exciting journey as we unlock the full potential of data engineering together!

DE4E: Data Engineering for Everybody

Data Engineering for Everybody

Acknowledgement πŸ™

DE4E: Data Engineering for EverybodyλŠ” κ°€μ§œμ—°κ΅¬μ†Œμ˜ DSF ν”„λ‘œκ·Έλž¨μ—μ„œ μ‹œμž‘λ˜μ—ˆμŠ΅λ‹ˆλ‹€.
μ‹œμž‘μ— μ•žμ„œ κ°μ‚¬μ˜ 말씀을 μ „ν•©λ‹ˆλ‹€.

κ°€μ§œμ—°κ΅¬μ†ŒλŠ” DataCamp의 후원을 λ°›μ•„ Donates ν”„λ‘œκ·Έλž¨μ„ μ§„ν–‰ν•˜κ³  μžˆμŠ΅λ‹ˆλ‹€. ν”„λ‘œκ·Έλž¨μ„ 톡해 ꡬ직자, λΆˆμ™„μ „ μ·¨μ—…μž, λΉ„μ˜λ¦¬ 연ꡬ κ³Όν•™μž, ν•™μƒλΆ„λ“€κ»˜ DataCampμ—μ„œ μ œκ³΅ν•˜λŠ” λ‹€μ–‘ν•œ μ½”μŠ€μ™€ νŠΈλž™μ„ μ œκ³΅ν•©λ‹ˆλ‹€. λ³Έ ν”„λ‘œμ νŠΈλŠ” DataCamp Donates ν”„λ‘œκ·Έλž¨ 쀑 ν•˜λ‚˜μΈ Data Science FellowshipμœΌλ‘œλΆ€ν„° μ‹œμž‘λ˜μ—ˆμŠ΅λ‹ˆλ‹€.

DE4EλŠ” 데이터 뢄석가, 데이터 κ³Όν•™μž, 데이터 μ—”μ§€λ‹ˆμ–΄, λ¨Έμ‹ λŸ¬λ‹ μ—”μ§€λ‹ˆμ–΄κ°€ ν•¨κ»˜ λͺ¨μ—¬ λ°μ΄ν„°μ˜, 데이터에 μ˜ν•œ, 데이터λ₯Ό μœ„ν•œ Data Engineering Repositoryλ₯Ό λ§Œλ“€μ–΄ λ‚˜κ°€κ³ μž ν•©λ‹ˆλ‹€.

Contents

Schedule

idx

Date

Subject

Presenter

Pre-Question

Tag

0

2023-03-26

Session 0. Orientation

μ΄μ˜μ „

Why should we learn Data Engineering?

#OT #Direction # Motivation

1

2023-04-02

Session 1. Introduction to Data Engineering

μ΄μ˜μ „

What is Data Engineering?

#Data Engineering #Discussion

2

2023-04-09

Session 2. Data Sources and Data Collection

μ΄λ™μš±, κΉ€μ„Έν˜„

How can we collect data from variaty sources?

#Source Data #Data Collection #Data Type #Structured Data #Unstructured Data #Batch Data #Real-time Data

3

2023-04-16

Session 3. Data Transformation and Cleaning

μ΄μ˜μ „

How can we transform data more efficiently?

#Data Processing

4

2023-04-30

Session 4. Data Storage

μ†‘μœ€ν˜Έ, 전희선

How can we store data more efficiently?

#Data Store #Database #Data Lake #Lakehouse #Object-Storage #NoSQL

5

2023-05-07

Session 5. Data Processing Frameworks

μ •κ²½λ₯œ, 이화림

How data processing framework help us?

#Hadoop Eco-system #Parallel Computing

6

2023-05-14

Session 6. Data Processing Frameworks II

κΉ€μ˜ˆμ‹ , μ΅œν•œμŠΉ

Learn about various data processing framework

#Apache Spark #Apache Kafka #Apache Storm #Apache Flink

7

2023-05-28

Session 7. Introduction to Apache Airflow

κΉ€μ„±ν›ˆ, 이희민

How can we schedule, orchestrate data processing?

#Apache Airflow #Tutorial

8

2023-06-04

Session 8. Cloud Computing and Data Engineering

이민행, μ΄μ˜μ „

What is Cloud Computing? and Why it is so important?

#Cloud Computing #Multi-Cloud #Data Engineering

9

2023-06-18

Capstone Project

이화림

Let’s dive into Data Engineering Capstone Proeject

#Capstone Project

10

2023-07-09 -

Project Management

μ΄μ˜μ „, 전희선, μ •κ²½λ₯œ, μ΄λ™μš±, κΉ€μ˜ˆμ‹ 

Build Together!

#Share #Motivation #Delighted to work together #Pseudo-Lab

Contributors πŸ˜ƒ



About us πŸ‘‹πŸΌ

κ°€μ§œμ—°κ΅¬μ†ŒλŠ” λ¨Έμ‹ λŸ¬λ‹, 데이터 μ‚¬μ΄μ–ΈμŠ€, 데이터 μ—”μ§€λ‹ˆμ–΄λ§μ„ μ€‘μ‹¬μœΌλ‘œ λͺ¨μΈ λΉ„μ˜λ¦¬λ‹¨μ²΄μž…λ‹ˆλ‹€. λˆ„κ΅¬λ‚˜ μ›ν•˜λŠ” 연ꡬλ₯Ό ν•  수 μžˆλŠ” μ‹œμž‘μ μ΄ λ˜λŠ”, μ§„μ§œλ³΄λ‹€ 더 μ§„μ§œ 같은 μ—°κ΅¬μ†Œλ₯Ό 꿈꾸고 μžˆμŠ΅λ‹ˆλ‹€. 곡유(Share), 동기뢀여(Motivation), ν•¨κ»˜ν•˜λŠ” 즐거움(Delighted to work together)λΌλŠ” ν•΅μ‹¬κ°€μΉ˜λ₯Ό μΆ”κ΅¬ν•˜λ©° μ•½ 1800μ—¬ λͺ…μ˜ 연ꡬ원뢄듀이 μ˜€λŠ˜λ„ ν•¨κ»˜ λ¨Έμ‹ λŸ¬λ‹, 데이터 μ‚¬μ΄μ–ΈμŠ€, 데이터 μ—”μ§€λ‹ˆμ–΄λ§ 뢄야에 μ„ ν•œ 영ν–₯λ ₯을 ν–‰μ‚¬ν•˜κ³  μžˆμŠ΅λ‹ˆλ‹€. 보닀 μžμ„Έν•œ λ‚΄μš©μ€ μ—¬κΈ°μ„œ μ‚΄νŽ΄λ³΄μ‹€ 수 μžˆμŠ΅λ‹ˆλ‹€.

License πŸ—ž

This project is licensed under MIT license.