Company Name | YOE (Payscale) | Question |
---|---|---|
HashedIn | 3 | From a DataFrame with (user_id, country, timestamp, spend), find the top-5 spending users in each country for the past 30 days, ordered by spend. |
mixed companies | 3 | Transformations, broadcast join, data skew, role of driver and executor, RDD v/d Dataframe, how stages got created?, shuffeling, techniques used to increase performanance, spark configurations, z-order |