😺
wiki
  • Welcome, Internet Strangers!
  • sql
    • etl
      • Basics to Remember
      • Context to Remember
      • Naming Practices
      • ETL Steps
    • performance
      • Please No
      • Initial Explorations
      • Stored Procedures
    • select-tricks
      • Over Partition
      • Stored Procedures*
      • Creating Parameters
  • python
    • Working with Files
    • Classes in Python
    • Dictionaries
    • Working with Strings
    • Using Lambda
    • Seaborn
    • machine-learning
      • Learning Pandas
      • MatPlotLib! The Dreaded Line Graph...
      • matlab-qualgraphs-notes
      • Linear Regression Example
      • kNN Analysis in ScikitLearn
    • Neat Snippets
  • bash
    • helpful_cmd
  • math
    • Basic Definitions
    • Linear Regressions
    • Meaningful Sampling
Powered by GitBook
On this page

Was this helpful?

  1. math

Meaningful Sampling

How to Get a Representative Sample

In nearly any real-world case, information on the entire population will not be available. Instead, statisticians try to get a representative sample of the total population, so descriptive statistics of the sample to generalize to the population.

Finding the right sample size is pretty simple, and a lot of online sites like (this)[https://www.surveysystem.com/sscalc.htm] will do the calculation for you.

Some definitions to keep in mind:

  • Margin of Error is how far off the descriptive statistics of the sample might be from the population's descriptive statistics.

  • Confidence Interval is how replicatable the results will be. For example, a confidence interval of 95% means that 19 times of 20, the results are good. This can be hacked, (relevant xkcd)[https://xkcd.com/882/].

  • Population Size is (WYSIWYG)[https://en.wikipedia.org/wiki/WYSIWYG]. That said, sometimes this will need to be estimated since the true population size may not be known.

  • Likely Sample Proportion is a little tricky. This can be estimated with a pilot study, or left at 0.5 to have the most conservative estimate.

PreviousLinear Regressions

Last updated 3 years ago

Was this helpful?