Sunday, December 8, 2024

🅸🅽🆃🆁🅴🆅🅸🅴🆆 🅴🆇🅿🅴🆁🅸🅴🅽🅲🅴- 𝑻𝒆𝒏𝒂𝒏* (🆂🆃🅰🆁🆃🆄🅿)-1

 🚩🅸🅽🆃🆁🅴🆅🅸🅴🆆 🅴🆇🅿🅴🆁🅸🅴🅽🅲🅴- 𝑻𝒆𝒏𝒂𝒏* (🆂🆃🅰🆁🆃🆄🅿)-1


✏️ There are a total of 4 rounds:

 💡 Lead Technical (Round 1)-Virtual

 💡 Manager Interview (Round 2)-Inperson

 💡 Director Interview (Round 3)-Inperson

 💡 HR Interview (Round 4)-Inperson


Result: Reached HR Round but No Offer Letter Released

Lesson Learned: Need to better prepare for HR discussions

𝐑𝐨𝐮𝐧𝐝-𝟏

----------------------------------🆂🆀🅻-------------------------------------------


Question-1:


Given the data for products over several days:

 📌 Write a SQL query to get Calculate the difference in quantity for each product compared to the previous day.


 📌 Write a sql Query Identify any two days where the quantity was lower compared to other days for each product in the following data:


Product Day Quantity

A 1 10

A 2 6

A 3 21

A 4 9

A 5 19

B 1 12

B 2 18

B 3 3

B 4 6

B 5 23




 📌 Write a SQL Query to Employee name & manager name


emp_id, emp_name, manager_id

1 'Zaid' 3

2 'Rahul' 3

3 'Raman' 4 

4 'Kamran'

5 'Farhan' 1


----------------------------------🅿🆈🆂🅿🅰🆁🅺--------------------

 📌 Create a dataframe & define the schema in Pyspark


input-employee_name, department,salary


 📌 Question-Write the SQL Code

Drop the duplicates

Every Department Highest Salary


 📌 Write it in both SQL & Pyspark

input - city,gender,Spent_amount

output-city,total_spend,female_spend,male_spend



📌 What is an Index & different types of indexes?


📌 Under what conditions should we use Normalization and Denormalization?


📌 Why should we choose Snowflake over Redshift?


📌 What is the difference between SparkContext and SparkSession?


📌 What are the differences between RDD and DataFrame, and how is DataFrame fault-tolerant?


📌 Why is RDD not commonly used nowadays?



-----------------------🆁🅾🆄🅽🅳 2 🅱🆈 🅼🅰🅽🅰🅶🅴🆁---------------------

📌 Questions related to your project

📌 What are the drawbacks in your project that you think you can improve?

📌 Share a real-time scenario where you worked on PySpark job optimization.

📌 How do you estimate data migration optimization?

📌 How do you ensure data validation, data security, and data quality? Explain a use case where you implemented these practices.


-----------------------🆁🅾🆄🅽🅳 3 🅱🆈 DIRECTOR ---------------------

-

📌 Project Discussion

📌 Discussion on AWS Services

📌 Explain your approach to designing a complete solution

📌 Why do you want to join a startup?

📌 How will you fit into the culture, given your background in the banking industry?


-----------------------🆁🅾🆄🅽🅳 4 🅱🆈 HR ---------------------

📌 Introduction and Background Discussion

📌 How they align with this role

📌 How you will align yourself from Banking to a Startup

📌 Technical and soft skills pertinent to the role

📌 Examples of how you’ve adapted to new environments and challenges

📌 How you will fit into the startup culture

📌 What drives you to learn new tools and technologies

📌 Examples of successful teamwork and collaboration

📌 Where you see yourself in the next 3-5 years

📌Salary Discussion


#intreviewExperience

#dataenginner




"🚀 Delta Lake's Vectorized Delete: The Secret to 10x Faster Data Operations!"

"🚀 Delta Lake's Vectorized Delete: The Secret to 10x Faster Data Operations!" Big news for data engineers! Delta Lake 2.0+ in...