Data Scientist

Beginner

Analyze data, build basic ML models, and create visualizations. Master Python, statistics, and SQL.

Your Progress0 / 50 questions

2 questions free per topic

Unlock all 50 questions with Pro

Upgrade to Pro

Topics

1

Statistics & Probability Fundamentals

2 free / 10 questions

  • 1
    Explain the normal distribution and why it is so important in statistics. What is the 68-95-99.7 rule?
  • 2
    Explain the difference between mean, median, and mode. When would you use each measure of central tendency?
  • What is the difference between variance and standard deviation? Why do we use standard deviation more often in practice?Pro
  • Explain the difference between independent and dependent events in probability. Give a practical example of each.Pro
  • What is hypothesis testing? Explain the null hypothesis, alternative hypothesis, and p-value in simple terms.Pro
  • Explain Type I and Type II errors in hypothesis testing. What is the relationship between these errors and the significance level?Pro
  • Explain the Central Limit Theorem and why it is considered one of the most important concepts in statistics.Pro
  • Explain Bayes'' Theorem and solve this problem: A disease affects 1 in 1000 people. A test for this disease is 99% accurate for those who have it and has a 2% false positive rate. If someone tests positive, what is the probability they actually have the disease?Pro
  • How would you design and analyze an A/B test? Explain the key considerations including sample size, test duration, and how to interpret results.Pro
  • Explain the difference between correlation and causation. How can you establish causation in data science, and what are the limitations?Pro

Unlock 8 more questions

Get full access with Pro

Upgrade
2

Python for Data Science

2 free / 10 questions

  • 1
    What is the difference between a Pandas DataFrame and a Series? When would you use each one?
  • 2
    How do you read data from a CSV file into a Pandas DataFrame? What are some important parameters you might need to specify?
  • After loading a dataset into a Pandas DataFrame, what methods would you use to explore and understand the data?Pro
  • How do you handle missing values in Pandas? Explain the different approaches and when to use each one.Pro
  • Explain the difference between loc and iloc in Pandas. Provide examples of when to use each one.Pro
  • How do you identify and handle duplicate rows in a Pandas DataFrame? What considerations should you keep in mind?Pro
  • Explain how the groupby operation works in Pandas. How would you use it to calculate aggregate statistics?Pro
  • Explain the different ways to combine DataFrames in Pandas. What is the difference between merge, join, and concat?Pro
  • Why are vectorized operations faster than loops in Pandas and NumPy? How would you refactor a loop-based solution to use vectorization?Pro
  • How do data types affect memory usage in Pandas? What strategies can you use to optimize memory consumption for large datasets?Pro

Unlock 8 more questions

Get full access with Pro

Upgrade
3

Data Visualization

2 free / 10 questions

  • 1
    What is the difference between Matplotlib and Seaborn? When would you use each library?
  • 2
    Describe the main chart types used in data visualization and when each is most appropriate.
  • What are the essential elements you should customize when creating a data visualization to make it clear and professional?Pro
  • How do you decide which type of visualization to use for a given dataset? What factors should you consider?Pro
  • How do you create multiple plots in a single figure using Matplotlib? Explain the subplot functionality and when to use it.Pro
  • How would you create a correlation heatmap using Seaborn? What insights can it reveal about your data?Pro
  • How do you handle missing data when creating visualizations? What are the best practices?Pro
  • What is data storytelling and how do you use visualizations to tell a compelling story with data?Pro
  • What are the most common mistakes in data visualization and how do you avoid them?Pro
  • Explain how to create and interpret advanced statistical visualizations like violin plots, pair plots, and joint plots in Seaborn.Pro

Unlock 8 more questions

Get full access with Pro

Upgrade
4

Exploratory Data Analysis

2 free / 10 questions

  • 1
    What is Exploratory Data Analysis and why is it an important step in the data science workflow?
  • 2
    What are the key steps you follow when performing Exploratory Data Analysis on a new dataset?
  • Explain the key summary statistics you would examine during EDA and what each one tells you about the data.Pro
  • What methods would you use to detect outliers in a dataset? Explain the IQR method and Z-score method.Pro
  • How do you approach missing values during Exploratory Data Analysis? What patterns should you look for?Pro
  • Explain the difference between univariate, bivariate, and multivariate analysis in EDA. What techniques do you use for each?Pro
  • How do you analyze and interpret the distribution of a variable during EDA? What does skewness tell you?Pro
  • How do you analyze feature relationships and correlations during EDA? What are the limitations of correlation analysis?Pro
  • How do you perform a comprehensive data quality assessment during EDA? What issues do you look for?Pro
  • How does EDA inform feature engineering and model selection for machine learning projects?Pro

Unlock 8 more questions

Get full access with Pro

Upgrade
5

Basic Machine Learning

2 free / 10 questions

  • 1
    What is the difference between supervised and unsupervised learning? Can you give examples of each?
  • 2
    Why do we split data into training and testing sets? What happens if we skip this step?
  • What is a feature in machine learning, and why are features important for model performance?Pro
  • Explain the concepts of overfitting and underfitting. How can you detect and prevent each?Pro
  • What is cross-validation, and why is it preferred over a simple train-test split in many scenarios?Pro
  • How does linear regression work, and what assumptions does it make? When would it be inappropriate to use?Pro
  • Explain how decision trees work for classification. What are their main advantages and disadvantages?Pro
  • Explain the bias-variance tradeoff. How does it relate to model complexity, and how do you find the right balance?Pro
  • Compare precision, recall, F1-score, and accuracy for classification problems. When would you prioritize one over the others?Pro
  • What are the main approaches to feature selection, and how do you decide which features to keep in a model?Pro

Unlock 8 more questions

Get full access with Pro

Upgrade

Mock Interview

Test your knowledge with an AI-powered mock interview session.

Start Mock Interview
Text
Voice (Pro)

Quick Stats

  • Total Questions50
  • Topics5
  • DifficultyBeginner
View Interview Checklist