Data Science Hub
  • Data Science Hub
  • STATISTICS
    • Introduction
    • Fundamentals
      • Data Types
      • Central Tendency, Asymmetry, and Variability
      • Sampling
      • Confidence Interval
      • Hypothesis Testing
    • Distributions
      • Exponential Distribution
    • A/B Testing
      • Sample Size Calculation
      • Multiple Testing
  • Database
    • Database Fundamentals
    • Database Management Systems
    • Data Warehouse vs Data Lake
  • SQL
    • SQL Basics
      • Creating and Modifying Tables/Views
      • Data Types
      • Joins
    • SQL Rules
    • SQL Aggregate Functions
    • SQL Window Functions
    • SQL Data Manipulation
      • String Operations
      • Date/Time Operations
    • SQL Descriptive Stats
    • SQL Tips
    • SQL Performance Tuning
    • SQL Customization
    • SQL Practice
      • Designing Databases
        • Spotify Database Design
      • Most Commonly Asked
      • Mixed Queries
      • Popular Websites For SQL Practice
        • SQLZoo
          • World - BBC Tables
            • SUM and COUNT Tutorial
            • SELECT within SELECT Tutorial
            • SELECT from WORLD Tutorial
            • Select Quiz
            • BBC QUIZ
            • Nested SELECT Quiz
            • SUM and COUNT Quiz
          • Nobel Table
            • SELECT from Nobel Tutorial
            • Nobel Quiz
          • Soccer / Football Tables
            • JOIN Tutorial
            • JOIN Quiz
          • Movie / Actor / Casting Tables
            • More JOIN Operations Tutorial
            • JOIN Quiz 2
          • Teacher - Dept Tables
            • Using Null Quiz
          • Edinburgh Buses Table
            • Self join Quiz
        • HackerRank
          • SQL (Basic)
            • Select All
            • Select By ID
            • Japanese Cities' Attributes
            • Revising the Select Query I
            • Revising the Select Query II
            • Revising Aggregations - The Count Function
            • Revising Aggregations - The Sum Function
            • Revising Aggregations - Averages
            • Average Population
            • Japan Population
            • Population Density Difference
            • Population Census
            • African Cities
            • Average Population of Each Continent
            • Weather Observation Station 1
            • Weather Observation Station 2
            • Weather Observation Station 3
            • Weather Observation Station 4
            • Weather Observation Station 6
            • Weather Observation Station 7
            • Weather Observation Station 8
            • Weather Observation Station 9
            • Weather Observation Station 10
            • Weather Observation Station 11
            • Weather Observation Station 12
            • Weather Observation Station 13
            • Weather Observation Station 14
            • Weather Observation Station 15
            • Weather Observation Station 16
            • Weather Observation Station 17
            • Weather Observation Station 18
            • Weather Observation Station 19
            • Higher Than 75 Marks
            • Employee Names
            • Employee Salaries
            • The Blunder
            • Top Earners
            • Type of Triangle
            • The PADS
          • SQL (Intermediate)
            • Weather Observation Station 5
            • Weather Observation Station 20
            • New Companies
            • The Report
            • Top Competitors
            • Ollivander's Inventory
            • Challenges
            • Contest Leaderboard
            • SQL Project Planning
            • Placements
            • Symmetric Pairs
            • Binary Tree Nodes
            • Interviews
            • Occupations
          • SQL (Advanced)
            • Draw The Triangle 1
            • Draw The Triangle 2
            • Print Prime Numbers
            • 15 Days of Learning SQL
          • TABLES
            • City - Country
            • Station
            • Hackers - Submissions
            • Students
            • Employee - Employees
            • Occupations
            • Triangles
        • StrataScratch
          • Netflix
            • Oscar Nominees Table
            • Nominee Filmography Table
            • Nominee Information Table
          • Audible
            • Easy - Audible
          • Spotify
            • Worldwide Daily Song Ranking Table
            • Billboard Top 100 Year End Table
            • Daily Rankings 2017 US
          • Google
            • Easy - Google
            • Medium - Google
            • Hard - Google
        • LeetCode
          • Easy
  • Python
    • Basics
      • Variables and DataTypes
        • Lists
        • Dictionaries
      • Control Flow
      • Functions
    • Object Oriented Programming
      • Restaurant Modeler
    • Pythonic Resources
    • Projects
  • Machine Learning
    • Fundamentals
      • Supervised Learning
        • Classification Algorithms
          • k-Nearest Neighbors
            • kNN Parameters & Attributes
          • Logistic Regression
        • Classification Report
      • UnSupervised Learning
        • Clustering
          • Evaluation
      • Preprocessing
        • Scalers: Standard vs MinMax
        • Feature Selection vs Dimensionality Reduction
        • Encoding
    • Frameworks
    • Machine Learning in Advertising
    • Natural Language Processing
      • Stopwords
      • Name Entity Recognition (NER)
      • Sentiment Analysis
        • Agoda Reviews - Part I - Scraping Reviews, Detecting Languages, and Preprocessing
        • Agoda Reviews - Part II - Sentiment Analysis and WordClouds
    • Recommendation Systems
      • Spotify Recommender System - Artists
  • Geospatial Analysis
    • Geospatial Analysis Basics
    • GSA at Work
      • Web Scraping and Mapping
  • GIT
    • GIT Essentials
    • Connecting to GitHub
  • FAQ
    • Statistics
  • Cloud Computing
    • Introduction to Cloud Computing
    • Google Cloud Platform
  • Docker
    • What is Docker?
Powered by GitBook
On this page

Was this helpful?

  1. STATISTICS

A/B Testing

Last updated 1 year ago

Was this helpful?

In digital realm, an A/B test is an experiment conducted to assess the performance of different versions of an online experience, based on metrics like signup rates. This is achieved by randomly presenting each version to users and subsequently analyzing the outcomes.

This robust statistical method should be employed when we seek to make data-driven decisions for

  • web designs (e.g. conversion rates optimization)

  • user interfaces (e.g detecting the impact of releasing a new feature)

  • marketing strategies (e.g.evaluate the value of advertising)

  • or other operational changes.

Specifically, it's most insightful when we can clearly define and measure the performance indicators like

  • conversion rates

  • click-through rates

  • signup rates

  • or engagement levels.

A/B testing is ideal when you have two distinct variants to compare and sufficiently large sample sizes, which we will discuss further, to reach statistically significant conclusions. However, even though, A/B tests provide us with invaluable insights there are occasions when we shouldn't conduct A/B tests if:

  • we don't have the infrastructure

  • we don't have enough traffic

  • we lack clear hypothesis to test

  • the test subject is not impactful

  • there are ethical consequences (e.g. testing inappropriate content)

  • we have high opportunity cost (e.g cost of blocking a certain feature from a large number of users is high)

In cases where A/B testing is not option, we can:

  • Perform user experience research using focus groups and surveys to gain insights into the preferred options.

  • Scrutinize user activity logs to enhance our understanding of which version is a more suitable fit.

  • Implement the product change and subsequently conduct a retrospective analysis by examining historical data to verify if the targeted metric responds as anticipated.

When A/B testing is available to us, here are the parameters/factors we need to set/consider prior to the test:

  • Decide on the metric to test (i.e. conversion rate)

  • Determine thresholds for alpha (significance level, generally 5%) and power (i.e. 1-beta, generally 80%) depending on the value of minimal detectable effect (MDE)

  • Define sample size based on MDE, power and metric variance

  • Set experiment length (at least 2 weeks to account for dow - day of week effect or novelty effect)

  • Randomly select sample of users (or the entire population if the available dataset is small)

  • Randomly assign those users to control and treatment groups (similar groups in terms of counts, demographics, income, geolocation, new vs old users, etc)

For more detailed information on

After running the test and logging user actions for each group for the desired duration, we should then be

  • computing metrics of interest (i.e. conversion rate)

  • testing for statistically significant differences to make decision.

A/B testing specifically plays a crucial role when the cost of making uninformed changes could be high, or when fine-tuning an existing system where small improvements could lead to substantial gains over time. However, knowing when to apply or not to apply A/B testing is the first step!

sample sizes and thresholds please check

duration please check

https://guessthetest.com/calculating-sample-size-in-a-b-testing-everything-you-need-to-know/
https://guessthetest.com/how-long-should-you-run-an-a-b-test/
A/B Testing