Nykaa Data Analyst Interview Questions | Day 9
Nykaa Data Analyst Interview Questions
Name – Nykaa
Designation – Senior Data Analyst
Location – Gurgaon
Salary – 22 LPA (including 10% variable)
Level of questions – 7/10
Nykaa Data Analyst Interview Questions
For the Senior Data Analyst position there were 4 rounds:
Round 1 – Technical Screening (SQL heavy)
Round 2 – Project, Case Study, and SQL
Round 3 – SQL, Python, and Guesstimate/Case Study
Round 4 – Cultural fit with the Hiring manager
Below are some of the questions and analogous concepts asked in the complete recruitment process, the candidate had some experience in the Natural Language Processing domain, so he was asked a few questions on that front:
- What is the use of the NVL function in Oracle?
NVL function is the most important function to replace a null value with another value.
select NVL(null,’ Amit’) from dual;
which will give you output as Amit.
- What is the result of the following query?
case when null=null then ‘Amit’ else ‘Rahul’ end as Case_check
The null=null is always false. So the Answer to this query is Rahul.
- What is a parser?
When SQL Statement has been written and generated the first step is parsing that SQL Statement. Parsing is nothing but checking the syntaxes of SQL queries. All the syntax of Query is correct or not is checked by SQL Parser.
There are 2 functions of the parser:
1. Syntax analysis
2. Semantic analysis
- What is lapply and sapply?
Lapply applies a function to each element of a list and returns the results as a list Sapply applies a function to each element of a list and returns the result in a vector.
- Guesstimate – What is the size of the market for disposable diapers in India?
1.2 billion people x 60% childbearing age = 0.72 B people
0.72 people x 1/2 are women = 0.36 B women of childbearing age 0.36 women x 2/3 have children = 0.24 women with children
0.24 women x 1.5 children each = 0.36 children
0.36 B children x 1/10 under age 2 = 36 million
- Count the total salary department number-wise where more than 2 employees exist.
SELECT deptno, sum(sal) As totalsal
GROUP BY deptno
HAVING COUNT(empno) > 2
- How to retrieve the 3 Minimum salaries ?
SELECT DISTINCT sal
FROM emp a
WHERE 3 >= (SELECT COUNT(DISTINCT sal) FROM emp b WHERE a.sal >= b.sal);
- Case Study 1 – A client has a Diwali-themed e-commerce shop that sells five items. What are some potential problems you foresee with their revenue streams?
a. The immediate issue with the client’s revenue stream is that it will take a severe hit once the holiday season is over.
b. How to generate revenue outside of the holiday season would be a key point to address with the client.
c. The other concern is with only offering five items.
d. The client is severely limiting their opportunity to generate revenue
e. A couple of bad reviews might create a lot of problems for them as they have very limited items
f. These products are mostly around lighting and crackers, these products have brief shelf-life and the defect in the product is also more than usual
g. Competitor issue – Since these are themed product that are released once an year, so a competitor might provide a sub-standard product at lower cost to kill the competition
- How do you remove your own list of stop words from a line of text given below ‘Book My Show is the best website to book a show’
dict = [“is”,”the”,”and”,”are”,”you”,”to”,”here”,”this”,”we”,”This”,”a”,”best”]
words = text.split()
no_noise = [word for word in words if word not in dict]
final = ” “.join(no_noise)
x = stopy(“Book My Show is the best website to book a show”)
- What are the steps involved in a typical Text-Analytics project
We mostly follow the below steps:-
-Get the raw data
-Remove special characters and punctuations after converting the text into tokens
-Remove stop words. These are the common words which are present in text
-Stemming and Lemmatization to remove the noise from the filtered data
-Do a TF-IDF to find out the important words
-We mostly go for n-gram to see the correlated words
– After this point, it’s mostly about the requirement of the project. There are multiple algorithms that we followed at different points in time
*Part of Speech Tagging
*Named Entity Recognition
-How many bi-grams can be generated from a given sentence:
“Sachin Tendulkar is the best batsman in the World”
Sachin Tendulkar, Tendulkar is, is the, the best, best batsman, batsman in, in the, the World
The Data Monk Interview Books – Don’t Miss
Now we are also available on our website where you can directly download the PDF of the topic you are interested in. At Amazon, each book costs ~299, on our website we have put it at a 60-80% discount. There are ~4000 solved interview questions prepared for you.
10 e-book bundle with 1400 interview questions spread across SQL, Python, Statistics, Case Studies, and Machine Learning Algorithms – Ideal for 0-3 years experienced candidates
23 E-book with ~2000 interview questions spread across AWS, SQL, Python, 10+ ML algorithms, MS Excel, and Case Studies – Complete Package for someone between 0 to 8 years of experience (The above 10 e-book bundle has a completely different set of e-books)
12 E-books for 12 Machine Learning algorithms with 1000+ interview questions – For those candidates who want to include any Machine Learning Algorithm in their resume and to learn/revise the important concepts. These 12 e-books are a part of the 23 e-book package
Important Resources to crack interviews (Mostly Free)
There are a few things that might be very useful for your preparation
The Data Monk Youtube channel – Here you will get only those videos that are asked in interviews with Data Analysts, Data Scientists, Machine Learning Engineers, Business Intelligence Engineers, Analytics managers, etc.
Go through the watchlist which makes you uncomfortable:-
All the list of 200 videos
Complete Python Playlist for Data Science
Company-wise Data Science Interview Questions – Must Watch
All important Machine Learning Algorithm with code in Python
Complete Python Numpy Playlist
Complete Python Pandas Playlist
SQL Complete Playlist
Case Study and Guesstimates Complete Playlist
Complete Playlist of Statistics