- SQL Topics:
- Basic SQL Queries (SELECT, WHERE, GROUP BY, ORDER BY)
- Joins (INNER, LEFT, RIGHT, FULL OUTER)
- Subqueries and Nested Queries
- Window Functions
- Common Table Expressions (CTEs)
- Indexing for Performance
- Transactions (Commit, Rollback)
- Data Normalization
- Aggregation Functions (COUNT, SUM, AVG, MAX, MIN)
- Advanced SQL (PL/SQL, Stored Procedures, Triggers)
- SQL for Data Warehousing (OLAP Queries)
Data Engineering Tools and Techniques
- Data Storage:
- SQL Databases (MySQL, PostgreSQL, SQLite)
- NoSQL Databases (MongoDB, Cassandra)
- Cloud Data Storage (AWS S3, Google Cloud Storage, Azure Blob Storage)
- ETL Tools:
- Apache Airflow (for scheduling workflows)
- Talend (for data integration)
- Apache Nifi (for real-time data ingestion)
- Big Data Processing:
- Apache Hadoop
- Apache Spark (PySpark for Python, SparkR for R)
- Dask (for parallel computing in Python)