InTDS ArchivebyViyaleta ApgarStrategic Data Analysis (Part 3): Diagnostic QuestionsDeep dive into the approach for answering “why” questionsOct 26, 20233Oct 26, 20233
InCodeXbyGaurav Thalpati7 ways to speed up your Data Reconciliation processHave you invested days, if not months, in reconciling your data post-migration to the cloud?May 6, 2022May 6, 2022
InTDS ArchivebyAlon AgmonData Access API over Data Lake Tables Without the ComplexityBuild a robust GraphQL API service on top of your S3 data lake files with DuckDB and Go.Sep 28, 20232Sep 28, 20232
InTDS ArchivebyMichal SzudejkoThe Might of Data LiteracyIs this the Key to Successful Use of Data and Analytics?Aug 23, 2023Aug 23, 2023
InTDS ArchivebyMichal SzudejkoHow to Talk About Data and Analysis to Non-Data PeopleA step-by-step tutorial for data professionalsSep 22, 20239Sep 22, 20239
InTDS ArchivebyMurtaza AliHow to Use the T-Test and its Non-Parametric CounterpartDo you really understand this test you likely learned in high school?Sep 17, 20231Sep 17, 20231
InTDS ArchivebyMiriam SantosHow to Generate Real-World Synthetic Data with CTGANExploring the Streamlit App introduced in ydata-syntheticApr 13, 20232Apr 13, 20232
InTDS ArchivebyMiriam SantosMissing Data Demystified: The Absolute Primer for Data ScientistsMissing data, missing mechanisms, and missing data profilingAug 29, 20231Aug 29, 20231
InTDS ArchivebyJohn LenehanSampling Techniques in Data AnalysisHow to choose the right data sampling method for your dataSep 6, 2023Sep 6, 2023
InTDS ArchivebyDavid RubioA step-by-step guide to build an Effective Data Quality Strategy from scratchHow to build an interpretable Data Quality Framework based on user expectationsAug 3, 20234Aug 3, 20234
InTDS ArchivebyMario LarcherDecoding NumPy’s Dot Product: A Brief Exploration of Dimensional WizardryClarifying once and for all the confusion over NumPy’s dot productJul 24, 2023Jul 24, 2023
InTDS ArchivebyKhuyen TranStop Hard Coding in a Data Science Project — Use Config Files InsteadAnd How to Efficiently Interact with Config Files in PythonMay 26, 202332May 26, 202332
InTDS ArchivebyXiaoxu GaoHow to Create Valuable Data TestsWhat matters is not the quantity, but the quality.Jul 3, 20231Jul 3, 20231
InTDS ArchivebyErdogan TaskesenThe Path to Success in Data Science Is About Your Ability to Learn. But What to Learn?The chances of successfully delivering data science projects are greatest when you keep learning, but it’s not always clear what to focus…Jun 29, 20237Jun 29, 20237
InTDS ArchivebyMarie LefevreData Documentation 101: Why? How? For Whom?Best practices for establishing complete and reliable data documentation within your organizationJun 13, 20232Jun 13, 20232
InTDS ArchivebySamuele MazzantiWhen You Should Prefer “Thompson Sampling” Over A/B TestsAn in-depth explanation of “Thompson Sampling”, a more efficient alternative to A/B testing for online learningJun 13, 20236Jun 13, 20236
InTDS ArchivebyTDS EditorsThinking Outside Data Science’s Many BoxesOur weekly selection of must-read Editors’ Picks and original featuresMay 11, 20231May 11, 20231
InTDS ArchivebyBex T.GitHub For The Modern Data Scientist: 7 Concepts You Can’t .gitignoreExplained with a bit of fun, wit and visualsMay 17, 20236May 17, 20236
InTowards AIbyHitesh HindujaComparative Analysis of Popular Statistical Tests: Which One to Use When?Let me begin by sharing my experience in detail. During my early years in the corporate world, my mentor imparted a piece of advice that…Apr 18, 2023Apr 18, 2023
InTDS ArchivebySam StoneData Observability for Analytics and ML teamsPrinciples, practices, and examples for ensuring high quality data flowsApr 6, 2023Apr 6, 2023