- Artificial Intelligence
- Data
- Data Science
- Features
- Generative AI
- Machine Learning
-
Modeling
- Autopilot Mode
- Classification
- Confusion Matrix
- Cross-Validation
- Deep Learning Algorithms
- Machine Learning Model
- Machine Learning Model Accuracy
- Machine Learning Model Deployment
- Model Blueprint
- Model Fitting
- Model Interpretability
- Model Tuning
- Multiclass Classification
- Neural Network
- Open Source Model Infrastructure
- Overfitting
- Regression
- Training Sets, Validation Sets, and Holdout Sets
- Underfitting
- Predictions
- View global site search results
- A
- AI Engineer
- AI Observability
- AIOps
- Artificial Intelligence Wiki
- Automated Machine Learning
- Autopilot Mode
- B
- Big Data
- C
- Citizen Data Scientist
- Classification
- Cognitive Computing
- Confusion Matrix
- Cross-Validation
- D
- Data Collection
- Data Governance
- Data Insights
- Data Management
- Data Preparation
- Data Profiling
- Data Science
- Deep Learning Algorithms
- E
- Explainable AI
- F
- Feature Engineering
- Feature Impact
- Feature Selection
- Feature Variables
- G
- Generative AI
- L
- Large Language Model Operations (LLMOps)
- M
- Machine Learning
- Machine Learning Algorithms
- Machine Learning Life Cycle
- Machine Learning Model
- Machine Learning Model Accuracy
- Machine Learning Model Deployment
- Machine Learning Operations (MLOps)
- Model Blueprint
- Model Fitting
- Model Interpretability
- Model Monitoring
- Model Tuning
- Multiclass Classification
- N
- Natural Language Processing
- Neural Network
- O
- Open Source Model Infrastructure
- Overfitting
- P
- Prediction
- Prediction Explanations
- Predictive Maintenance
- Production Model Governance
- Production Model Lifecycle Management
- R
- Regression
- S
- Scoring Data
- Semi-Supervised Machine Learning
- Stacked Predictions
- Supervised Machine Learning
- T
- Target Leakage
- Target Variable
- Text Mining
- Training Sets, Validation Sets, and Holdout Sets
- U
- Underfitting
- Unsupervised Machine Learning
- W
- What is Artificial Intelligence (AI)?
- View global site search results
- Artificial Intelligence
- Data
- Data Science
- Features
- Generative AI
- Machine Learning
-
Modeling
- Autopilot Mode
- Classification
- Confusion Matrix
- Cross-Validation
- Deep Learning Algorithms
- Machine Learning Model
- Machine Learning Model Accuracy
- Machine Learning Model Deployment
- Model Blueprint
- Model Fitting
- Model Interpretability
- Model Tuning
- Multiclass Classification
- Neural Network
- Open Source Model Infrastructure
- Overfitting
- Regression
- Training Sets, Validation Sets, and Holdout Sets
- Underfitting
- Predictions
Regression
What is Regression?
Regression, one of the most common types of machine learning models, estimates the relationships between variables. Whereas classification models identify which category an observation belongs to, regression models estimate a numeric value.
In the context of machine learning and data science, regression specifically refers to the estimation of a continuous dependent variable or response from a list of input variables, or features. There are a variety of regression techniques, ranging from the simplest (linear regression), to complicated statistical classic regression models (Lasso, Elastic Net, etc.), to more complex techniques including gradient boosting and neural networks.
Why is Regression Important?
Regression is essential for any machine learning problem that involves continuous numbers, which includes a vast array of real-life applications:
- Financial forecasting, such as estimating housing or stock prices
- Automobile testing
- Weather analysis
- Time series forecasting
Regression + DataRobot
Although regression is one of the most common algorithms, a lot of manual work still goes into creating a regression model with traditional data science techniques and tools. The DataRobot AI platform automates regression analysis for datasets with the touch of a few buttons.
Based on the target variable in the dataset, the DataRobot AI platform automatically decides whether the task is best suited for regression or classification. It also provides error metrics and parameters critical to regression analysis and visualization tools that help users (and their bosses) understand the model’s outcomes and insights.