Skip to main content

Unanswered Questions

1,872 questions with no upvoted or accepted answers
8 votes
0 answers
652 views

Calculate goodness-of-fit (with deviance) to compare averaged models?

I need to compare the goodness of fit of several averaged logistic regression models by calculating the deviance explained. I'm using the MuMIn package in R to ...
7 votes
1 answer
2k views

Overfitting a neural network to a single batch as a sanity check - how small a loss value is small enough and long to run for?

I'm currently developing a neural network for a regression task. Following on from the advice given in places like here, here, and here I'm attempting to overfit my model to a single batch of 5 ...
7 votes
0 answers
182 views

Identifying non-linearities in relationship between variables

Logistic regression is often used to identify the effect of $x$ on a binary variable $y$ after adjusting for potential confounders $x_1,...,x_n$. In the medical literature, I will sometimes encounter ...
7 votes
0 answers
2k views

What are the differences between HC estimators and their small sample properties?

I am currently using R to run regression with the following code: ...
6 votes
0 answers
78 views

Before using CV-selected Regression model for Inference, shouldn't model performance be evaluated on unused test set?

I just came across a biokinesiology paper that used some Machine Learning methods, but I think there is a flaw in their methodology. The authors had data on stroke patients and used Lasso regression ...
6 votes
0 answers
150 views

using `lmer` to fit the linear mixed effects models

Edit: I know some people vote this question is off-topic since it is more like a Cross Validated question. However, I am not here to ask about the coding thing (but I might word in the wrong way). I ...
6 votes
1 answer
113 views

Importance of regressors in time series data

Could anyone recommend bibliography or name some useful methods to analyze which (exogenous) variables are most important in determining the value of a time series? For context, I have a random time ...
6 votes
1 answer
3k views

Comparing observed and predicted values across several measurements

I am investigating whether a medical treatment has an effect on cognitive measures (aside from curing the medical problem). I have been looking for a statistically sound method to approach the ...
5 votes
1 answer
496 views

How to train Prediction model for longitudinal data, with large number of time points?

Given a longitudinal data, that has date (in month-year format) as one of the independent variables and other independent variables being Gross metric tonnes, Tensile strength(UTS), weight per unit ...
5 votes
0 answers
358 views

Equivalence of ANCOVA and repeated measures model

Consider an RCT with individuals i in 2 arms (group, with 0 = control and 1 = treatment) in which one metric outcome (score) is collected at baseline (pre) and after some treatment (post). In an ...
5 votes
0 answers
321 views

How can I find and categorise the effect size of a single coefficient in a multiple regression?

Question How do I find the effect size for the different hierarchical multiple-level regressions used by papers in my review? And how do I categorise their effect size? Detail I’m publishing a ...
5 votes
0 answers
2k views

Covariance matrix of multivariate multiple regression coefficients

I would like to perform a regression analysis on a dataset comprising one independent variable (X) and two dependent variables (Y1 and Y2) which may be affected by correlated errors. R's stats::lm ...
5 votes
0 answers
4k views

Is it possible to do a time series analysis with more than one explanatory variable?

I am working on a project, and I am absolutely new to forecasting and not so strong in statistics. I have an employee data for the last 7 years, along with the other variables like economic growth, ...
5 votes
0 answers
113 views

Testing differences between slopes by examining proportions rather than interactions

In a recent article published in Nature Genetics, Francioli et al. argue that the relationship between paternal age and mutation rate depends on what part of the genome you're looking at (intergenic ...
5 votes
0 answers
1k views

Why would I need both a validation set & a test set if I'm not selecting a model?

I have a dataset with two features and one outcome. I was asked to separate the data into three parts such that 70% of the data is a training set, 20% is for validation and 10% for testing. The model ...

15 30 50 per page
1
2 3 4 5
125