Big Data Developer Interview Questions

1,784 big data developer interview questions shared by candidates

What is combinebykey SCD1 logic Different between edge node and data node Where the code will be deployed? (edge node or in cluster) YARN architecture What are all the versions of spark you have worked? Diff btw SchemaRDD and df Different ways to create dataframe what is bundle in oozie? fork action in oozie? distcp command how do you decide number of mappers in sqoop job? what is the optimal number of mappers provided there is no restriction in establishing connection to DB? how to do you pull clob,blob datatype in oracle to HDFS? semi join,anti-join in scala diff between logical plan and physical plan where can we see logical plan?
avatar

Big Data Engineer

Interviewed at Lowe's Home Improvement

3.5
Jul 19, 2019

What is combinebykey SCD1 logic Different between edge node and data node Where the code will be deployed? (edge node or in cluster) YARN architecture What are all the versions of spark you have worked? Diff btw SchemaRDD and df Different ways to create dataframe what is bundle in oozie? fork action in oozie? distcp command how do you decide number of mappers in sqoop job? what is the optimal number of mappers provided there is no restriction in establishing connection to DB? how to do you pull clob,blob datatype in oracle to HDFS? semi join,anti-join in scala diff between logical plan and physical plan where can we see logical plan?

Reasoning questions include scenario based given 2 statements which of following is true(difficult), picture based( weight problem), one 'for' loop program question,jumbling characters problem, one % based problem. techical include one MR program to print files output based on month and find no of sundays in each month. couple of spark and kafka questions. mostly on rdd . nit sure what it is.
avatar

Big Data Lead

Interviewed at Crédit Agricole

3.8
Mar 14, 2018

Reasoning questions include scenario based given 2 statements which of following is true(difficult), picture based( weight problem), one 'for' loop program question,jumbling characters problem, one % based problem. techical include one MR program to print files output based on month and find no of sundays in each month. couple of spark and kafka questions. mostly on rdd . nit sure what it is.

Viewing 1721 - 1730 interview questions

See Interview Questions for Similar Jobs

Glassdoor has 1,784 interview questions and reports from Big data developer interviews. Prepare for your interview. Get hired. Love your job.