😺
wiki
  • Welcome, Internet Strangers!
  • sql
    • etl
      • Basics to Remember
      • Context to Remember
      • Naming Practices
      • ETL Steps
    • performance
      • Please No
      • Initial Explorations
      • Stored Procedures
    • select-tricks
      • Over Partition
      • Stored Procedures*
      • Creating Parameters
  • python
    • Working with Files
    • Classes in Python
    • Dictionaries
    • Working with Strings
    • Using Lambda
    • Seaborn
    • machine-learning
      • Learning Pandas
      • MatPlotLib! The Dreaded Line Graph...
      • matlab-qualgraphs-notes
      • Linear Regression Example
      • kNN Analysis in ScikitLearn
    • Neat Snippets
  • bash
    • helpful_cmd
  • math
    • Basic Definitions
    • Linear Regressions
    • Meaningful Sampling
Powered by GitBook
On this page

Was this helpful?

  1. sql
  2. etl

Basics to Remember

Before getting into the weeds of using SQL...

Before diving too far into code, it's important to remember the role SQL now plays in many ETL pipelines.

In the olden days, a pipeline was by default made nearly entirely of pure SQL building sets from one carefully-maintained local server to another, all under the watchful gaze of a local sysadmin. Since the rise of SaaS, particularly for financial and sales operations, ETL functions often start with an external product, and usually pipe through or to other off-site products.

This is reflected heavily in some best practices (particularly those developed by the SaaS themselves), but not in other works. Flipping between the two can cause a bit of whiplash. Keeping the context of the documentation in mind while reading is an important step.

PreviousetlNextContext to Remember

Last updated 3 years ago

Was this helpful?