Senior Data Engineer Interview Questions

2,621 senior data engineer interview questions shared by candidates

1. Given the sample: id, status 1, active 2, active 3, active 4, pending 5, expired 6, expired 7, expired 8, pending Pull the unique statuses that show up consecutively 3 times, e.g. from the sample, the output would be 'active', 'expired'. 2. Given the sample: employee, in_out, time A, IN, 6:00 B, IN, 7:00 A, OUT, 8:00 C, IN, 9:30 A, IN, 9:00 A, OUT, 10:00 B, OUT, 11:00 C, OUT, 10:00 Determine which employees are in the building at 10:30.
avatar

Senior Data Engineer

Interviewed at Amazon

3.5
Jan 7, 2022

1. Given the sample: id, status 1, active 2, active 3, active 4, pending 5, expired 6, expired 7, expired 8, pending Pull the unique statuses that show up consecutively 3 times, e.g. from the sample, the output would be 'active', 'expired'. 2. Given the sample: employee, in_out, time A, IN, 6:00 B, IN, 7:00 A, OUT, 8:00 C, IN, 9:30 A, IN, 9:00 A, OUT, 10:00 B, OUT, 11:00 C, OUT, 10:00 Determine which employees are in the building at 10:30.

try to get a pattern out of string or json. ask you to translate numbers from 1 to 100 to names of numbers ( for example 34 translated to thirty-four). you need to come with a solution within 10 seconds, I am able to get it done within 2 minutes. That's not fast enough.
avatar

Senior Data Engineer

Interviewed at Meta

3.5
May 8, 2018

try to get a pattern out of string or json. ask you to translate numbers from 1 to 100 to names of numbers ( for example 34 translated to thirty-four). you need to come with a solution within 10 seconds, I am able to get it done within 2 minutes. That's not fast enough.

1. Imagine a table keeping track of all the rides a driver undertakes in a given day. How do you query that table to find out the time when he did not have any rides. So the table may have a record when he picks up his first ride and then he picks another and then drops both of them. After a while he picks up a few more which again is recorded in the table. So with all that how do you find the data when he does not have any rides
avatar

Senior Data Engineer

Interviewed at Lyft

3.8
Feb 18, 2019

1. Imagine a table keeping track of all the rides a driver undertakes in a given day. How do you query that table to find out the time when he did not have any rides. So the table may have a record when he picks up his first ride and then he picks another and then drops both of them. After a while he picks up a few more which again is recorded in the table. So with all that how do you find the data when he does not have any rides

Data Model: LinkedIn data model - model for 1st degree connection Python: 1. Dedup items in the list - retaining the order of items. ==> cannot use dict/set since order will not be retained. Follow up question on this - How would you handle nested lists? (they are looking for recursion) 2. Find the number of words in a sentence /avg length of word SQL On the product-sales-customers data model that is preloaded in coderpad.io, write the following queries 1. Count of stores in OR state with area_sqft > 25000 2. avg number of Female Customers group by state 3. Customer FirstName, Last Name and count of unqiue products purchased by state followup question - Return the top customer by state based on diverse product purchased (diverse = count(distinct product_id))
avatar

Senior Data Engineering Manager

Interviewed at Meta

3.5
Aug 26, 2020

Data Model: LinkedIn data model - model for 1st degree connection Python: 1. Dedup items in the list - retaining the order of items. ==> cannot use dict/set since order will not be retained. Follow up question on this - How would you handle nested lists? (they are looking for recursion) 2. Find the number of words in a sentence /avg length of word SQL On the product-sales-customers data model that is preloaded in coderpad.io, write the following queries 1. Count of stores in OR state with area_sqft > 25000 2. avg number of Female Customers group by state 3. Customer FirstName, Last Name and count of unqiue products purchased by state followup question - Return the top customer by state based on diverse product purchased (diverse = count(distinct product_id))

Viewing 1 - 10 interview questions

See Interview Questions for Similar Jobs

Glassdoor has 2,621 interview questions and reports from Senior data engineer interviews. Prepare for your interview. Get hired. Love your job.