Question

How to select number of trees in Random forest?

Question

Trees and RF

solved 1

Machine Learning TheDataMonk 55 years 7 Answers 1468 views Grand Master 0

About TheDataMonkGrand Master

I am the Co-Founder of The Data Monk. I have a total of 6+ years of analytics experience 3+ years at Mu Sigma 2 years at OYO 1 year and counting at The Data Monk I am an active trader and a logically sarcastic idiot :)

Follow Me

Answers ( 7 )

shankar Contributor
1
June 10, 2020 at 4:15 pm

Reply
There is no right rule. But I will tell my way to use the number of trees in tree algorithms.
I start with the default number of tree generally 100. Depending on the size of dataset, if it is small then I will tune my n_estimators parameters by using GridSearchCV or Bayesian optimization method. If the dataset is very large then rather tuning it, I will try with some large number of tree-like 1000, 2000, 5000, 100000, etc, and use early stopping to handle overfitting case.
ravi_joe Contributor
0
June 15, 2020 at 6:23 am

Reply
we can use cross validation techniques like gridsearchcv or randomsearh cv to find these hyperparameters
diksha.aggarwal0394 Contributor
0
August 2, 2020 at 7:06 pm

Reply
I use GridSearchCV cross validation technique and tune n_estimators parameter
Ankit kumar
0
August 4, 2020 at 3:27 am

Reply
I use early stopping with some large no of trees to handle overfitting. This seems best to me
Smk Contributor
1
August 5, 2020 at 5:50 am

Reply
In Random Forest, the more the number of trees, the more samples you are creating of your data, the more samples you have created the more you reduce the bias-ness of your data. But a time comes where you have enough samples and now data is getting duplicated in further samples. In order to know the optimal number of trees, we can use cross-validation techniques like gridsearchcv or randomsearhcv to tune the n_estimators hyperparameter.
Shubham Bhatt Contributor
0
August 7, 2020 at 7:27 am

Reply
I would prefer either Grid Search CV or Random Search CV .

Leave an answer

Name*

E-Mail*

Website

Attachment

Browse

Featured image

Browse

Answer*

Previous question

Next question

swaplaw007 Grand Master · Accepted Answer · June 10, 2020

One of the techniques is to use GridsSearchCV() in scikit-Learn where you will have to tune the n_estimators parameter
to find the correct no of trees. But, it is also necessary to pass in the adequate no of trees to the list of n_estimators.
Example – n_estimators = [10,30,100]
Typical values are 10, 30 or 100.
Passing lesser no of trees will not actually give you the benefits of the Random forest method as you loose on the benefit
of creating large no of trees and averaging their output.
Also, creating large no of trees more than that are required will increase the training time and beyond a certain limit,
you will that get substantial benefits in terms of accuracy.

Register Now

Login

Lost Password

How to select number of trees in Random forest?

About TheDataMonkGrand Master

Related questions

What kind of jobs or career opportunities are present in the Machine Learning domain?

Random Forest

Can you use Linear Regression for Classification?

What are the assumptions of Linear Regression?

What is correlation and what is its range?

Answers ( 7 )

Leave an answer