## Data leakage

Question
Under which situation cross-validation falls in the trap of data leakage?
0
Anonymous 2 weeks 0 Answers 16 views

## Encoding behavior

Question
Why tree based algorithm are less likely to get affected by label encoding, if random forests are deep enough how it can handle categorical variables without one hot encoding?
0
3 weeks 0 Answers 20 views Newbie

## Standardization or Scaling ?

Question
How do we decide if standardization is better or scaling of data is better without using cross validation techniques ? Will it be dependent on the algorithm we are using (distance based or not) or we need to dig deeper ...
0
Anonymous 3 weeks 0 Answers 19 views

## accuracy of linear regression model

Question
Can you name a possible method of improving the accuracy of a linear regression model?
in progress 0
2 months 1 Answer 163 views Member

## error minimization techniques

Question
How do linear and logistic regression differ in their error minimization techniques?
in progress 0
2 months 1 Answer 116 views Member

## linear regression output as probabilities

Question
Why shouldn’t you use linear regression outputs as probabilities?
in progress 0
2 months 1 Answer 129 views Member

## Topic Modeling

Question
In the doc-term matrix, passed in LDA topic modeling, columns refer to?
in progress 0
2 months 0 Answers 70 views Newbie

## Find the 4th Highest employee salary from the following table

Question
Find the 4th highest salary from the given employee table
in progress 0
2 months 0 Answers 163 views Newbie

## Find the 4th Highest employee salary from the following table

Question
Find the 4th highest salary from the given employee table
in progress 0
2 months 1 Answer 88 views Newbie

## linear vs logistics regression

Question
Why can't linear regression be used in place of logistics regression for binary classification?
solved 0
2 months 1 Answer 72 views Member

## logistic regression

Question
what is a decision  boundary?
solved 0
2 months 1 Answer 58 views Member

## you have uploaded the dataset in csv format on google spread sheet and spread it publicly .you want to access in python how you can do

Question
you have uploaded the dataset in csv format on google spread sheet and spread it publicly .you want to access in python how you can do this ..
in progress 0
2 months 1 Answer 52 views Member

## How to identify given data is structured or unstructured

Question
How do you say given data is structed or unstructured 1.let us assume i have given 20k rows and 3 columns in this first column is review text and second column is rating and third column is sentiment class .. 2.In sentiment ...
in progress 0
2 months 1 Answer 47 views Member

## How to convert string into data time value

Question
write a code to convert string into date time value
in progress 0
2 months 1 Answer 44 views Member

## what is the precession and recall

Question
What is the precession and recall explain in detail practical scenerio
in progress 0
2 months 1 Answer 49 views Member

## Explain Confusion Matrix Machine Learning Alagorithm

Question
Explain Confusion matrix in detail with practical scenerio
in progress 0
2 months 1 Answer 41 views Member

## Feature Selection

Question
Two people A and B, train an algorithm on a same set of data. A trains by selecting 10 most important features from the dataset using an Algorithm C and randomly splits the dataset into training and testing dataset. B ...
in progress 0
2 months 1 Answer 65 views Member

## Predict who would visit your mall on this month?

Question
A shopping mall owner has data of all its customers who visited in the year 2019 month-wise(Say in January 2019, the data would have all the names who visited the mall at least once). How can you predict how many ...
in progress 0
2 months 1 Answer 63 views Member

## Can we use R2-Sqaure to validate our model?

Question
Can we use R2-Sqaure to validate our model? When does the R2-score metric make sense?
in progress 0
2 months 1 Answer 88 views Member

## Decision Trees

Question
Why decision trees and their ensembles have such amazing predictive power(And why is it prone to overfit to the dataset)?
in progress 0
2 months 1 Answer 43 views Member

## Which test is used for more than two independent population location parameter?

Question
OPTIONS : - f(ANOVA) -Kruskal Wallis - both
in progress 0
2 months 1 Answer 62 views Member

## Assume the name and columns of table. There given information about different car names and details of particular car like body_style, average_milage, price etc. Count total cars per company.

Question
df['company'].value_counts()
in progress 0
2 months 0 Answers 68 views Member

## Does Python support switch or case statement in Python? Why?

Question
Ans :  Python does not support swith or case statement like other languages.
in progress 0
2 months 0 Answers 42 views Member

## Difference Between Linear and Logistic Regression.

Question
Ans : Linear : is supervised learning regression algorithm. Logistic : is also supervised learning but it is a classification algorithm.
0
2 months 0 Answers 30 views Member

## List the total numbers of products of each brand.

Question
Ans :  select count(Product_Brand),Prouduct_Brand from Product_Master group by Prouduct_Brand
in progress 0
2 months 1 Answer 80 views Member

## List the total numbers of products of each brand.

Question
Ans :  select count(Product_Brand),Prouduct_Brand from Product_Master group by Prouduct_Brand
0
2 months 0 Answers 40 views Member

## List the total numbers of products of each brand.

Question
Ans :  select count(Product_Brand),Prouduct_Brand from Product_Master group by Prouduct_Brand
0
2 months 0 Answers 43 views Member

## Is logistic regression a regression technique?

Question
Logistic regression is used to solve classification problems. So what is the reason why it is called a regression? What is the link here?
in progress 0
2 months 1 Answer 40 views Member

## python for datascience

Question
why do have to use reshape(-1,1) or reshape(1,-1) for a single feature before fitting it to the model in sklearn library?
in progress 0
2 months 2 Answers 39 views Member

## pandas

Question
data = {'a':1 , 'b':2 , 'c':3 ,'d':4} import pandas as pd series = pd.Series(data) dummy = series.copy(deep=True) print(dummy) the above code is to make a copy of series what happens when argument deep is set to False?
in progress 0
2 months 1 Answer 33 views Member

## logistic regression

Question
Can we apply logistic regression for this given data?
in progress 0
2 months 1 Answer 37 views Member

## linear regression

Question
A linear model tends to have a training error of 3% and a testing error of 20% is the model under fitted or overfitted?
in progress 0
2 months 1 Answer 30 views Member

## logistic regression

Question
A trained logistic model represents an equation as 1/1+exp(-(c+a0x0+a1x1+a2x2) how many predictors are used to create this model?
in progress 0
2 months 1 Answer 32 views Member

## python

Question
re.search(‘^From:’, line) what will the above line of code do?
solved 0
2 months 1 Answer 56 views Member

## Clustering algorithms

Question
How to cluster unsupervised data where all the attributes and its values are categorical?
in progress 0
2 months 1 Answer 47 views Newbie

## python

Question
uh = urllib.request.urlopen(url, context=ctx) find out which library module is necessary to import in the python code to execute the above line of code to open an url.
solved 0
2 months 1 Answer 47 views Member

## SQL

Question
which join is used to join a table with itself Inner join full join self join does the virtual table created occupy space for the operation to joining with itself?
solved 0
2 months 1 Answer 35 views Member

## What is CNN?

Question
This is the simple application of a filter to an input that results in inactivation. Repeated application of the same filter to input results in a map of activations called a feature map, indicating the locations and strength of a ...
in progress 0
2 months 0 Answers 35 views Member

## What is the difference between machine learning and deep learning?

Question
Machine Learning | Deep learning Machine Learning is a technique to learn from that data and then apply what has been learned to make an informed decision | The main difference between deep and machine learning is, machine learning models ...
0
2 months 0 Answers 33 views Member

## What is the statistical test for data validation with an example, Chi-square, ANOVA test, Z statics, T statics, F statics, Hypothesis Testing?

Question
Before discussing the different statistical tests, we need to get a clear understanding of what a null hypothesis is. A null hypothesis proposes that has no significant difference exists in the set of a given observation. Null: Two samples' mean is ...
0
2 months 0 Answers 33 views Member

## SQL Query

Question
What is the correct order of writing SQL query from given tags(select, where, group by, having, from, order by)
solved 0
2 months 1 Answer 35 views Member

## SQL Query

Question
ORG             |    counts whitman.edu    17 vt.edu               110 utoronto.ca     1 unicon.net      9 umich.edu     491 ufp.pt             28 uct.ac.za        ...
solved 0
2 months 1 Answer 34 views Member

## What do you understand by Type I and Type II errors?

Question
Type I Error: Type I error (False Positive) is an error where the outcome of a test shows the non-acceptance of a true condition. For example, a cricket match is going on and, when a batsman is not out, the umpire ...
0
2 months 0 Answers 31 views Member

## What is a Confusion Matrix?

Question
Confusion matrix is used to explain a model’s performance and gives the summary of predictions on the classification problems. It assists in identifying the uncertainty between classes. A confusion matrix gives the count of correct and incorrect values and also the ...
0
2 months 0 Answers 30 views Member

## What are Bias and Variance?

Question
Bias is the difference between the average prediction of our model and the correct value. If the bias value is high, then the prediction of the model is not accurate. Variance is the number that gives the difference of prediction over a training ...
0
2 months 0 Answers 27 views Member

## Differentiate between classification and regression in Machine Learning.

Question
In Machine Learning, there are various types of prediction problems based on supervised and unsupervised learning. These are classification, regression, clustering, and association. Here, we will discuss about classification and regression. Classification: In classification, we try to create a Machine Learning model ...
0
2 months 0 Answers 29 views Member

## what are the types of Machine Learning?

Question
So basically there are 3 types of techniques: Supervised Learning: In this type of the Machine Learning technique, machines learn under the supervision of labeled data. Unsupervised Learning: Unlike supervised learning, it has unlabeled data. So, there is no supervision under which it works ...
0
2 months 0 Answers 31 views Member

## All things you need to know about Tensorflow.

Question
TensorFlow: TensorFlow is an open-source software library released in 2015 by Google to make it easier for the developers to design, build, and train deep learning models. TensorFlow is originated as an internal library that the Google developers used to ...
0
2 months 0 Answers 31 views Member

## What will be the output of Following SQL Query

Question
What will be the output of following sql query SELECT     name,         last_name,         salary FROM Salary WHERE salary >    (SELECT AVG (salary)                     FROM Salary);
in progress 0
2 months 1 Answer 54 views Newbie

## What are the various aspects of a Machine Learning process?

Question
Here we will discuss the components involved in solving a problem using machine learning. 1. Domain knowledge This is the first step wherein we need to understand how to extract the various features from the data and learn more about the ...
0
2 months 0 Answers 34 views Member