# Q2: Same as Q1, but now assume there can be duplicates. The output should not have duplicates # Ex: # l1 = [1,2,3,4,5,5,5] # l2 = [1,5,7,11,100] # result = [1,5]
Data Engineering Interview Questions
20,956 data engineering interview questions shared by candidates
Level of severity is in line with what folks have mentioned above. But they have new question set. Funny that they kept asking the same questions for more than a year before changing them out.
SQL Coding: 5 questions 4 tables were given and questions were mostly on joins, aggregate functions, sub queries. Python Coding: I guess there were total 5 questions but could reach only till second question. Count the number of words in a sentence. Count the frequency of words from the list and store the results in a hash map.
SQL - Q1. Find the IDs of the top 5 customers, ordered by average payment per book made by the people they invited.
Python (1) Count "char" in the string. (2) Replace "None" in an array. Codes are simple but passing edge cases are difficult. SQL several % calculation questions
Find max min avg of the order by date, customer, product, state. Coding was easy pretty much what people already posted here.
What percent of all products in the grocery chain's catalog are both low fat and recyclable?
Write a query that returns product_family, units_sold, percentage of promoted.
How do you educate yourself?
The rest 30minutes was for algorithmic programming in language of your choice. I completed 3.5 questions in 30minutes.
Viewing 191 - 200 interview questions