Linear regression for micro-entrepreneurs
A bit more about the case
The database consisted of 964 Individual Microentrepreneurs (MEIs) and 76 variables. After modeling, we observed through the T-test that many variables are considered irrelevant. Among the 76 variables, only 5 showed significance. Using the ANOVA table, we concluded that only the municipality and experience influence the annual revenue. Another measure of fit quality is the coefficient of determination, also known as R^2. This measure ranges from 0 to 1; the closer to 1, the more of Y's variation is explained linearly by the independent variables Xi. In this fit, the R^2 was 0.09437, meaning the model didn't explain the annual revenue very well. So, we concluded that these variables are not very effective in explaining a company's revenue.