InPipeline: Your Data Engineering ResourcebyZach Quinn1 Sheet To Get You Hired Or Promoted In 2025How data engineers can wield the power of the “brag sheet” to achieve career-altering results.Dec 26, 2024Dec 26, 2024
InTDS Archiveby💡Mike ShakhomirovA Guide To Data Pipeline Testing with PythonA gentle introduction to unit testing, mocking and patching for beginnersMar 9, 20241Mar 9, 20241
Utsav RajStructuring Your Data Science Project: A Guide to the Cookiecutter TemplateIn the fast-paced world of software development, efficiency and consistency are key. Cookiecutter, a tool that simplifies project setup and…Jan 11, 20241Jan 11, 20241
InITNEXTbyArtem LajkoNecessary Culture Change with GitOpsDon’t underestimate the Role of Culture in Successful GitOps ImplementationFeb 20, 20241Feb 20, 20241
Rahul SounderData Warehousing Concepts for Beginners — Data EngineersData warehousing is a process of collecting, storing, and managing data from different sources to support business decision-making. It…Jan 8, 2024Jan 8, 2024
InTDS Archiveby💡Mike ShakhomirovData Warehouse Design PatternsHow I organize everything in my new data warehouseJan 29, 20243Jan 29, 20243
InTDS Archiveby💡Mike ShakhomirovLead Data Engineer Career GuideKnowledge and skills for successful data leadershipJan 6, 20247Jan 6, 20247
InTDS ArchivebySarthak SarbahiStreamline Data Pipelines: How to Use WhyLogs with PySpark for Data Profiling and ValidationLearn to use whylogs with PySpark for data profiling and validationJan 7, 2024Jan 7, 2024
InTDS ArchivebySanil KhuranaSystem Design Series: 0 to 100 Guide to Data Streaming SystemsLearning data streaming systems and their nuances by building a real-world recommendation system using Kafka, Cassandra, and microservicesDec 17, 20239Dec 17, 20239
InTDS ArchivebyGustavo R SantosBest Data Wrangling Functions in PySparkLearn the most helpful functions when wrangling Big Data with PySparkDec 12, 20233Dec 12, 20233
Susan OlapadeMastering SQL Query Optimization: My Journey from 5 Hours to Under 10 MinutesNov 26, 20231Nov 26, 20231
InTDS ArchivebyTobi SamHow SQL execution orders varies across databasesWhy you can’t GROUP BY ordinal positions in SQL Server but can in othersDec 7, 20231Dec 7, 20231
InTDS Archiveby💡Mike ShakhomirovData Engineering BooksReaders Digest to Learn Data Engineering GraduallyNov 12, 20235Nov 12, 20235
InTDS ArchivebyXiaoxu GaoDemystify Data BackfillingLet’s talk about data engineers’ nightmareNov 20, 20232Nov 20, 20232
InTDS ArchivebyPablo PortoA Complete Guide to Effectively Scale your Data Pipelines and Products with Contract Testing and…All you need to know to start implementing contract tests with dbtOct 25, 20232Oct 25, 20232
InTDS Archiveby💡Mike ShakhomirovBuilding a Batch Data Pipeline with Athena and MySQLAn End-To-End Tutorial for BeginnersOct 20, 2023Oct 20, 2023
InTDS ArchivebyIffat MalikWindow Functions — A must know for Data Engineers and Data ScientistsBack To Basics | SQL fundamentals for beginnersJun 14, 20235Jun 14, 20235
InTDS ArchivebyCol JungFrom Data Lakes to Data Mesh: A Guide to the Latest Enterprise Data ArchitectureUnderstand why large companies are embracing data meshMay 30, 20239May 30, 20239
InTDS ArchivebyWillem KoendersA Maturity Model for Data Modeling and DesignBest practices for mastering the standardization of data definitionsApr 25, 20233Apr 25, 20233
InTDS ArchivebyMadison SchottCreating a Transparent Data Environment with Data LineageThe benefits of column-level lineage across your stackApr 11, 20232Apr 11, 20232