Data Scientist Interview Questions

Data Scientist Interview Questions

In een sollicitatiegesprek voor de functie data scientist (M/V/X) kunt u verwachten dat de werkgever vragen stelt die uw vaardigheden voor gegevensmodellering, probleemoplossing en programmeren onderzoeken. Wees voorbereid op algemene vragen die uw kennis van statistiek en data science. Stel u ook in op open vragen die uw creativiteit, sociale vaardigheden en formele opleiding in gegevensmodellering en programmeren testen.

Meest gestelde sollicitatievragen voor een data scientist (M/V/X) en hoe te antwoorden

Question 1

Vraag 1: Welke gegevensmodelleertechnieken hebben uw voorkeur en waarom?

How to answer
Zo antwoordt u: Gegevens vertalen naar begrijpelijke en bruikbare informatie, is een essentieel onderdeel van de rol van een data scientist. Met deze vraag kunnen werkgevers uw gegevensmodelleringsvaardigheden en achtergrond doorgronden. Noem de voor u preferente gegevensmodelleringstechnieken en bespreek deze, bijvoorbeeld voordelen als gebruiksgemak, flexibiliteit, etc.
Question 2

Vraag 2: Hoe zou u nepaccounts op Instagram detecteren die gebruikt worden om consumenten op te lichten?

How to answer
Zo antwoordt u: Met zulke vragen kan een werkgever uw probleemoplossend vermogen testen. Bij het beantwoorden van open vragen als deze, is het prima om zelf naar verduidelijking te vragen en een whiteboard te gebruiken om te laten zien dat u kunt programmeren en dat u diagrammen kunt maken. Deel uw gedachtegang terwijl u de stappen van het probleem behandelt.
Question 3

Vraag 3: Beschrijf omstandigheden die een lijst, tupel of set in Python vereisen.

How to answer
Zo antwoordt u: Vraagstellers gebruiken dergelijke vragen om uw kennis van de programmeertaal Python te testen. Ga voor het sollicitatiegesprek de grondbeginselen van Python, zoals lijsten, tupels en sets. U zou moeten kunnen uitleggen wanneer en hoe elke tool door data scientists wordt gebruikt.

54,327 data scientist interview questions shared by candidates

1. Given an empty BST consist of n nodes and and an array consist of n numbers. The n nodes in a BST have been already arranged in some fashion(i.e. the BST is not empty), and none of the nodes in BST are having any data, that means we have to pick the n numbers from the given array and have to fill in the given BST. We have to make sure that the structure of the BST doesn't change. That means all the left subtree and right subtree at any given node should not change at all. 2. We have a function which returns a value among {1, 0, -1}. When the function returns -1 that means we have to terminate. we have to keep on calling this function and till we get -1. this means we will get series of 1's and 0's which we have to treat like bit pattern and has to check whether the given number is divisible by 3 or not. for e.g. the function call returns the below output. 101-1=> 101 => it's a 5 which is not divisible by 3.
avatar

Computer Scientist

Interviewed at Adobe

4.1
Sep 11, 2016

1. Given an empty BST consist of n nodes and and an array consist of n numbers. The n nodes in a BST have been already arranged in some fashion(i.e. the BST is not empty), and none of the nodes in BST are having any data, that means we have to pick the n numbers from the given array and have to fill in the given BST. We have to make sure that the structure of the BST doesn't change. That means all the left subtree and right subtree at any given node should not change at all. 2. We have a function which returns a value among {1, 0, -1}. When the function returns -1 that means we have to terminate. we have to keep on calling this function and till we get -1. this means we will get series of 1's and 0's which we have to treat like bit pattern and has to check whether the given number is divisible by 3 or not. for e.g. the function call returns the below output. 101-1=> 101 => it's a 5 which is not divisible by 3.

R4: Assume the distribution of children per family is given by: # children 0 | 1 | 2 | 3 | 4 | >=5 p 0.3 | 0.25 | 0.2 | 0.15 | 0.1 | 0 Consider a random girl in the population of children. What's the probability that she has a sister?
avatar

Data Scientist

Interviewed at Google

4.4
Sep 2, 2021

R4: Assume the distribution of children per family is given by: # children 0 | 1 | 2 | 3 | 4 | >=5 p 0.3 | 0.25 | 0.2 | 0.15 | 0.1 | 0 Consider a random girl in the population of children. What's the probability that she has a sister?

SQL: there is a table of time,post id, action and content. the action can be reported and the content is spam. another table of time,post id, user - of all posts were removed manually the question: What percent of yesterday's content views were on content that has been reported for spam and removed yesterday?
avatar

Data Scientist

Interviewed at Meta

3.5
Jun 2, 2020

SQL: there is a table of time,post id, action and content. the action can be reported and the content is spam. another table of time,post id, user - of all posts were removed manually the question: What percent of yesterday's content views were on content that has been reported for spam and removed yesterday?

• What are the typical Greek symbols used in Q-Learning? • What does Alpha typically represent? • What does Gamma typically represent? • What does Epsilon typically represent? • What is Greedy-Epsilon? • How does a High Alpha versus a Low Alpha impact the model? • What is the Exploration-Exploitation Tradeoff? • What is a Decay Structure? • What is important about a Decay Structure? • How could we apply reinforcement learning to Alexa/Echo which would add functionality? • How would you implement this? • What kind of reward structure would you use? • Why would you use that reward structure? • Tell me about a time when you were not able to complete all parts of a task? • Tell me about a time you not only met expectations but exceeded them?
avatar

Applied Scientist Internship

Interviewed at Amazon

3.5
Mar 17, 2021

• What are the typical Greek symbols used in Q-Learning? • What does Alpha typically represent? • What does Gamma typically represent? • What does Epsilon typically represent? • What is Greedy-Epsilon? • How does a High Alpha versus a Low Alpha impact the model? • What is the Exploration-Exploitation Tradeoff? • What is a Decay Structure? • What is important about a Decay Structure? • How could we apply reinforcement learning to Alexa/Echo which would add functionality? • How would you implement this? • What kind of reward structure would you use? • Why would you use that reward structure? • Tell me about a time when you were not able to complete all parts of a task? • Tell me about a time you not only met expectations but exceeded them?

Viewing 81 - 90 interview questions

Glassdoor has 54,327 interview questions and reports from Data scientist interviews. Prepare for your interview. Get hired. Love your job.