Sunday, December 8, 2024

๐Ÿ…ธ๐Ÿ…ฝ๐Ÿ†ƒ๐Ÿ†๐Ÿ…ด๐Ÿ†…๐Ÿ…ธ๐Ÿ…ด๐Ÿ†† ๐Ÿ…ด๐Ÿ†‡๐Ÿ…ฟ๐Ÿ…ด๐Ÿ†๐Ÿ…ธ๐Ÿ…ด๐Ÿ…ฝ๐Ÿ…ฒ๐Ÿ…ด- ๐‘ป๐’†๐’๐’‚๐’* (๐Ÿ†‚๐Ÿ†ƒ๐Ÿ…ฐ๐Ÿ†๐Ÿ†ƒ๐Ÿ†„๐Ÿ…ฟ)-1

 ๐Ÿšฉ๐Ÿ…ธ๐Ÿ…ฝ๐Ÿ†ƒ๐Ÿ†๐Ÿ…ด๐Ÿ†…๐Ÿ…ธ๐Ÿ…ด๐Ÿ†† ๐Ÿ…ด๐Ÿ†‡๐Ÿ…ฟ๐Ÿ…ด๐Ÿ†๐Ÿ…ธ๐Ÿ…ด๐Ÿ…ฝ๐Ÿ…ฒ๐Ÿ…ด- ๐‘ป๐’†๐’๐’‚๐’* (๐Ÿ†‚๐Ÿ†ƒ๐Ÿ…ฐ๐Ÿ†๐Ÿ†ƒ๐Ÿ†„๐Ÿ…ฟ)-1


✏️ There are a total of 4 rounds:

 ๐Ÿ’ก Lead Technical (Round 1)-Virtual

 ๐Ÿ’ก Manager Interview (Round 2)-Inperson

 ๐Ÿ’ก Director Interview (Round 3)-Inperson

 ๐Ÿ’ก HR Interview (Round 4)-Inperson


Result: Reached HR Round but No Offer Letter Released

Lesson Learned: Need to better prepare for HR discussions

๐‘๐จ๐ฎ๐ง๐-๐Ÿ

----------------------------------๐Ÿ†‚๐Ÿ†€๐Ÿ…ป-------------------------------------------


Question-1:


Given the data for products over several days:

 ๐Ÿ“Œ Write a SQL query to get Calculate the difference in quantity for each product compared to the previous day.


 ๐Ÿ“Œ Write a sql Query Identify any two days where the quantity was lower compared to other days for each product in the following data:


Product Day Quantity

A 1 10

A 2 6

A 3 21

A 4 9

A 5 19

B 1 12

B 2 18

B 3 3

B 4 6

B 5 23




 ๐Ÿ“Œ Write a SQL Query to Employee name & manager name


emp_id, emp_name, manager_id

1 'Zaid' 3

2 'Rahul' 3

3 'Raman' 4 

4 'Kamran'

5 'Farhan' 1


----------------------------------๐Ÿ…ฟ๐Ÿ†ˆ๐Ÿ†‚๐Ÿ…ฟ๐Ÿ…ฐ๐Ÿ†๐Ÿ…บ--------------------

 ๐Ÿ“Œ Create a dataframe & define the schema in Pyspark


input-employee_name, department,salary


 ๐Ÿ“Œ Question-Write the SQL Code

Drop the duplicates

Every Department Highest Salary


 ๐Ÿ“Œ Write it in both SQL & Pyspark

input - city,gender,Spent_amount

output-city,total_spend,female_spend,male_spend



๐Ÿ“Œ What is an Index & different types of indexes?


๐Ÿ“Œ Under what conditions should we use Normalization and Denormalization?


๐Ÿ“Œ Why should we choose Snowflake over Redshift?


๐Ÿ“Œ What is the difference between SparkContext and SparkSession?


๐Ÿ“Œ What are the differences between RDD and DataFrame, and how is DataFrame fault-tolerant?


๐Ÿ“Œ Why is RDD not commonly used nowadays?



-----------------------๐Ÿ†๐Ÿ…พ๐Ÿ†„๐Ÿ…ฝ๐Ÿ…ณ 2 ๐Ÿ…ฑ๐Ÿ†ˆ ๐Ÿ…ผ๐Ÿ…ฐ๐Ÿ…ฝ๐Ÿ…ฐ๐Ÿ…ถ๐Ÿ…ด๐Ÿ†---------------------

๐Ÿ“Œ Questions related to your project

๐Ÿ“Œ What are the drawbacks in your project that you think you can improve?

๐Ÿ“Œ Share a real-time scenario where you worked on PySpark job optimization.

๐Ÿ“Œ How do you estimate data migration optimization?

๐Ÿ“Œ How do you ensure data validation, data security, and data quality? Explain a use case where you implemented these practices.


-----------------------๐Ÿ†๐Ÿ…พ๐Ÿ†„๐Ÿ…ฝ๐Ÿ…ณ 3 ๐Ÿ…ฑ๐Ÿ†ˆ DIRECTOR ---------------------

-

๐Ÿ“Œ Project Discussion

๐Ÿ“Œ Discussion on AWS Services

๐Ÿ“Œ Explain your approach to designing a complete solution

๐Ÿ“Œ Why do you want to join a startup?

๐Ÿ“Œ How will you fit into the culture, given your background in the banking industry?


-----------------------๐Ÿ†๐Ÿ…พ๐Ÿ†„๐Ÿ…ฝ๐Ÿ…ณ 4 ๐Ÿ…ฑ๐Ÿ†ˆ HR ---------------------

๐Ÿ“Œ Introduction and Background Discussion

๐Ÿ“Œ How they align with this role

๐Ÿ“Œ How you will align yourself from Banking to a Startup

๐Ÿ“Œ Technical and soft skills pertinent to the role

๐Ÿ“Œ Examples of how you’ve adapted to new environments and challenges

๐Ÿ“Œ How you will fit into the startup culture

๐Ÿ“Œ What drives you to learn new tools and technologies

๐Ÿ“Œ Examples of successful teamwork and collaboration

๐Ÿ“Œ Where you see yourself in the next 3-5 years

๐Ÿ“ŒSalary Discussion


#intreviewExperience

#dataenginner




No comments:

Post a Comment

"๐Ÿš€ Delta Lake's Vectorized Delete: The Secret to 10x Faster Data Operations!"

"๐Ÿš€ Delta Lake's Vectorized Delete: The Secret to 10x Faster Data Operations!" Big news for data engineers! Delta Lake 2.0+ in...