1. Databricks Workspace๋ž€?

Databricks Workspace๋Š”
๐Ÿ‘‰ Databricks ํ”Œ๋žซํผ์„ ์‚ฌ์šฉํ•˜๋Š” ๋ชจ๋“  ์ž‘์—…์˜ ์ถœ๋ฐœ์ ์ž…๋‹ˆ๋‹ค.

  • ์ฝ”๋“œ ์ž‘์„ฑ (Notebook)
  • ๋ฐ์ดํ„ฐ ์ฒ˜๋ฆฌ (Spark / SQL)
  • ํด๋Ÿฌ์Šคํ„ฐ ์ƒ์„ฑ ๋ฐ ๊ด€๋ฆฌ
  • ์›Œํฌํ”Œ๋กœ์šฐ(Job) ์ƒ์„ฑ
  • ๋ฐ์ดํ„ฐ ๊ฑฐ๋ฒ„๋„Œ์Šค, ML, SQL ๋ถ„์„

๐Ÿ‘‰ Databricks์—์„œ ํ•˜๋Š” ๋ชจ๋“  ์ž‘์—…์€ Workspace์—์„œ ์‹œ์ž‘ํ•ฉ๋‹ˆ๋‹ค.


2. Workspace ํ™ˆ ํ™”๋ฉด ๊ฐœ์š”

์ƒ๋‹จ(Home) ์˜์—ญ

  • ์ตœ๊ทผ ์‚ฌ์šฉํ•œ ํ•ญ๋ชฉ (Recent)
  • ์ฆ๊ฒจ์ฐพ๊ธฐ (Favorites)
  • ๋น ๋ฅธ ์‹œ์ž‘์šฉ ๋ฐ”๋กœ๊ฐ€๊ธฐ

๊ฐ•์˜ ํŒ
๐Ÿ‘‰ ์ดˆ๋ฐ˜์—๋Š” ๊ฑฐ์˜ ์‚ฌ์šฉํ•˜์ง€ ์•Š์Œ
๐Ÿ‘‰ ์‹ค์ œ ์ž‘์—…์€ ์™ผ์ชฝ ๋ฉ”๋‰ด๊ฐ€ ํ•ต์‹ฌ


3. ์™ผ์ชฝ ๋ฉ”๋‰ด (Left Navigation Bar)

๋ฉ”๋‰ด ํ™•์žฅ/์ถ•์†Œ

  • ๊ธฐ๋ณธ์€ ์ถ•์†Œ ์ƒํƒœ์ผ ์ˆ˜ ์žˆ์Œ
  • ๋งˆ์šฐ์Šค๋ฅผ ๊ฐ€์ ธ๊ฐ€๋ฉด ์ž๋™ ํ™•์žฅ
  • โš™๏ธ ๊ฐ•์˜ ์‹œ์—๋Š” ํ•ญ์ƒ ํ™•์žฅ ์ƒํƒœ ์ถ”์ฒœ

4. Workspace ๋ฉ”๋‰ด (๊ฐ€์žฅ ์ค‘์š” โญ)

์—ญํ• 

๐Ÿ‘‰ ์ฝ”๋“œ์™€ ํŒŒ์ผ์ด ์ €์žฅ๋˜๋Š” ๊ณต๊ฐ„

๊ตฌ์กฐ

1
2
3
4
Workspace
โ”œโ”€โ”€ Shared
โ””โ”€โ”€ Users
โ””โ”€โ”€ <์‚ฌ์šฉ์ž๋ช…>

Users > ๋‚ด ํ™ˆ ๋””๋ ‰ํ† ๋ฆฌ

  • ๊ฐœ์ธ ์ž‘์—… ๊ณต๊ฐ„
  • ์ƒ์„ฑ ๊ฐ€๋Šฅ ํ•ญ๋ชฉ:
    • ๐Ÿ“ Folder
    • ๐Ÿ““ Notebook (Python / SQL / Scala)
    • ๐Ÿ“„ File
    • ๐Ÿ“Š Dashboard
    • ๐Ÿ”” Alert

์ค‘์š” ํฌ์ธํŠธ
๐Ÿ‘‰ โ€œNotebook์€ ๊ฒฐ๊ตญ Workspace ์•ˆ์— ์ €์žฅ๋œ๋‹คโ€


5. Repos (์†Œ์Šค์ฝ”๋“œ ๊ด€๋ฆฌ)

๊ธฐ๋Šฅ

  • GitHub / GitHub Enterprise
  • Azure DevOps
  • Bitbucket ๋“ฑ ์—ฐ๋™ ๊ฐ€๋Šฅ

ํ™œ์šฉ

  • Git ๊ธฐ๋ฐ˜ ๊ฐœ๋ฐœ
  • Commit / Pull / Push ๊ฐ€๋Šฅ
  • ํ˜‘์—… ํ•„์ˆ˜ ๊ธฐ๋Šฅ

์‹ค๋ฌด ํŒ
๐Ÿ‘‰ ๊ฐœ์ธ ํ•™์Šต: Workspace
๐Ÿ‘‰ ํŒ€/ํ”„๋กœ์ ํŠธ: Repos ํ•„์ˆ˜


6. Catalog (Catalog Explorer)

์—ญํ• 

๐Ÿ‘‰ ๋ฉ”ํƒ€๋ฐ์ดํ„ฐ ๊ด€๋ฆฌ ํ™”๋ฉด

ํ‘œ์‹œ๋˜๋Š” ํ•ญ๋ชฉ:

  • Databases
  • Tables
  • Views
  • Functions

์ดํ›„ ํ•™์Šต ์ฃผ์ œ

  • Unity Catalog
  • ๋ฐ์ดํ„ฐ ๊ถŒํ•œ ๊ด€๋ฆฌ

7. Workflow (Jobs)

์—ญํ• 

๐Ÿ‘‰ ๋ฐฐ์น˜ ์ž‘์—… & ํŒŒ์ดํ”„๋ผ์ธ ๊ด€๋ฆฌ

๊ฐ€๋Šฅํ•œ ์ž‘์—…:

  • Job ์ƒ์„ฑ
  • Task ๊ฐ„ ์˜์กด์„ฑ ์„ค์ •
  • ์Šค์ผ€์ค„๋ง (Cron)
  • ์žฌ์‹œ๋„ / ์‹คํŒจ ์ฒ˜๋ฆฌ

ํ•˜์œ„ ๊ธฐ๋Šฅ

  • Jobs
  • Job Runs
  • Delta Live Tables (DLT)

8. Compute (ํด๋Ÿฌ์Šคํ„ฐ ๊ด€๋ฆฌ)

์—ญํ• 

๐Ÿ‘‰ Spark ์‹คํ–‰์„ ์œ„ํ•œ ์ปดํ“จํŠธ ๋ฆฌ์†Œ์Šค ๊ด€๋ฆฌ

์ƒ์„ฑ ๊ฐ€๋Šฅํ•œ ๋ฆฌ์†Œ์Šค:

  • All-purpose Cluster
  • Job Cluster
  • SQL Warehouse
  • Cluster Pool
  • Cluster Policy

ํ•ต์‹ฌ ๋ฉ”์‹œ์ง€
๐Ÿ‘‰ Azure Portal์ด ์•„๋‹ˆ๋ผ Databricks์—์„œ ํด๋Ÿฌ์Šคํ„ฐ ์ƒ์„ฑ


9. Data Ingestion

๋ชฉ์ 

๐Ÿ‘‰ ์™ธ๋ถ€ ๋ฐ์ดํ„ฐ๋ฅผ Databricks๋กœ ๊ฐ€์ ธ์˜ค๊ธฐ

๋ฐฉ์‹

  • Native Spark Connectors
  • Partner Tools (Fivetran ๋“ฑ)

์ค‘์š” ํฌ์ธํŠธ
๐Ÿ‘‰ โ€œDatabricks๋Š” ์ฒ˜๋ฆฌ ํ”Œ๋žซํผ, ์ˆ˜์ง‘์€ ๋„๊ตฌ ์„ ํƒโ€


10. Delta Live Tables (DLT)

๊ฐœ๋…

  • Declarative ETL
  • ํŒŒ์ดํ”„๋ผ์ธ ์ •์˜ ๊ธฐ๋ฐ˜ ์ฒ˜๋ฆฌ

์œ„์น˜

  • Workflow ๋ฉ”๋‰ด ํ•˜์œ„

์ดํ›„ ์‹ฌํ™” ์ฃผ์ œ


11. SQL ๋ฉ”๋‰ด

Databricks๋Š” ์„œ๋ฒ„๋ฆฌ์Šค ๋ฐ์ดํ„ฐ ์›จ์–ดํ•˜์šฐ์Šค ์—ญํ• ๋„ ์ˆ˜ํ–‰

๊ฐ€๋Šฅํ•œ ์ž‘์—…:

  • SQL Warehouse ์ƒ์„ฑ
  • SQL Editor
  • Dashboard
  • Alert
  • Query History

SQL ์ค‘์‹ฌ ๋ถ„์„๊ฐ€ ๋Œ€์ƒ ๊ธฐ๋Šฅ


12. Machine Learning ๋ฉ”๋‰ด

ML ๊ด€๋ จ ๊ธฐ๋Šฅ ์ œ๊ณต:

  • Experiments
  • Models
  • Feature Store
  • MLflow

๋ฐ์ดํ„ฐ ์—”์ง€๋‹ˆ์–ด โ†’ ML ์—”์ง€๋‹ˆ์–ด ํ™•์žฅ ํฌ์ธํŠธ


13. Marketplace

๊ธฐ๋Šฅ

  • ์™ธ๋ถ€ ๋ฐ์ดํ„ฐ ๊ตฌ๋งค/๊ตฌ๋…
  • ๋ฌด๋ฃŒ/์œ ๋ฃŒ ๋ฐ์ดํ„ฐ์…‹

๊ธฐ๋ฐ˜ ๊ธฐ์ˆ 

  • Delta Sharing

14. Partner Connect

๋ชฉ์ 

๐Ÿ‘‰ ์™ธ๋ถ€ ์†”๋ฃจ์…˜๊ณผ ์›ํด๋ฆญ ์—ฐ๋™

ํŒŒํŠธ๋„ˆ ์˜ˆ์‹œ:

  • Data Ingestion
  • Visualization (Tableau ๋“ฑ)
  • Security
  • Governance
  • ML Tools

15. ์šฐ์ธก ์ƒ๋‹จ ๋ฉ”๋‰ด

ํ•ญ๋ชฉ

  • User Settings
  • Admin Settings
  • Manage Account (Admin Console)
  • Logout

๊ด€๋ฆฌ์ž๋Š” Admin Settings ์ž์ฃผ ์‚ฌ์šฉ


16. ๊ฐ•์˜์šฉ ํ•ต์‹ฌ ์š”์•ฝ (ํ•œ ๋ฌธ์žฅ์”ฉ)

  • Workspace = Databricks์˜ ๋ชจ๋“  ์ž‘์—… ์‹œ์ž‘์ 
  • ์ฝ”๋“œ ์ €์žฅ = Workspace
  • ์‹คํ–‰ ํ™˜๊ฒฝ = Compute
  • ์ž๋™ํ™” = Workflow
  • ๋ฉ”ํƒ€๋ฐ์ดํ„ฐ = Catalog
  • SQL ๋ถ„์„ = SQL ๋ฉ”๋‰ด
  • ML = Machine Learning ๋ฉ”๋‰ด

17. ์ถ”์ฒœ ๊ฐ•์˜ ํ๋ฆ„

  1. Workspace UI ์ „์ฒด ๊ตฌ์กฐ ์„ค๋ช…
  2. Workspace โ†’ Notebook ์ƒ์„ฑ
  3. Compute โ†’ Cluster ์ƒ์„ฑ
  4. Notebook ์‹คํ–‰
  5. Workflow โ†’ Job ๋งŒ๋“ค๊ธฐ

๋งˆ๋ฌด๋ฆฌ

Databricks Workspace๋Š” ๋‹จ์ˆœํ•œ UI๊ฐ€ ์•„๋‹ˆ๋ผ
๐Ÿ‘‰ ๋ฐ์ดํ„ฐ ์—”์ง€๋‹ˆ์–ด๋ง ์ž‘์—…์˜ ์ปจํŠธ๋กค ํƒ€์›Œ์ž…๋‹ˆ๋‹ค.