Panda Guru LogoPanda
Guru

EPAM Lead Data Engineer Interview Experience

Round 1: Online Test

Questions:

  1. Multiple choice questions: 10
  2. SQL Question:
    • Find the total number of unique students and total number of students scored full marks by age.
  3. Python Question:
    • Find if the given list contains odd pairs or not, with length n (where n is always an even number).
Candidate's Approach

No approach provided.

Interviewer's Feedback

No feedback provided.


Round 2: Technical Round

Questions:

  1. Discussed technical skills.
  2. How to submit a Spark job and the configurations used.
  3. Spark optimization techniques.
  4. Explanation of the Catalyst optimizer.
  5. Handling small files in Spark.
  6. Comparison between EMR and Glue.
  7. Discussion on Athena.
  8. Partition pruning.
  9. Predicate pushdown.
  10. Write SQL program to find each person with total sales amount and include the highest sales city name.
  11. Write the same code in PySpark.
  12. Python code to find the longest consecutive sequence in the given list.
Candidate's Approach

No approach provided.

Interviewer's Feedback

No feedback provided.