Unanswered Questions
1,872 questions with no upvoted or accepted answers
8
votes
0
answers
652
views
Calculate goodness-of-fit (with deviance) to compare averaged models?
I need to compare the goodness of fit of several averaged logistic regression models by calculating the deviance explained. I'm using the MuMIn package in R to ...
7
votes
1
answer
2k
views
Overfitting a neural network to a single batch as a sanity check - how small a loss value is small enough and long to run for?
I'm currently developing a neural network for a regression task. Following on from the advice given in places like here, here, and here I'm attempting to overfit my model to a single batch of 5 ...
7
votes
0
answers
182
views
Identifying non-linearities in relationship between variables
Logistic regression is often used to identify the effect of $x$ on a binary variable $y$ after adjusting for potential confounders $x_1,...,x_n$. In the medical literature, I will sometimes encounter ...
7
votes
0
answers
2k
views
What are the differences between HC estimators and their small sample properties?
I am currently using R to run regression with the following code:
...
6
votes
0
answers
78
views
Before using CV-selected Regression model for Inference, shouldn't model performance be evaluated on unused test set?
I just came across a biokinesiology paper that used some Machine Learning methods, but I think there is a flaw in their methodology.
The authors had data on stroke patients and used Lasso regression ...
6
votes
0
answers
150
views
using `lmer` to fit the linear mixed effects models
Edit: I know some people vote this question is off-topic since it is more like a Cross Validated question. However, I am not here to ask about the coding thing (but I might word in the wrong way). I ...
6
votes
1
answer
113
views
Importance of regressors in time series data
Could anyone recommend bibliography or name some useful methods to analyze which (exogenous) variables are most important in determining the value of a time series?
For context, I have a random time ...
6
votes
1
answer
3k
views
Comparing observed and predicted values across several measurements
I am investigating whether a medical treatment has an effect on cognitive measures (aside from curing the medical problem). I have been looking for a statistically sound method to approach the ...
5
votes
1
answer
496
views
How to train Prediction model for longitudinal data, with large number of time points?
Given a longitudinal data, that has date (in month-year format) as one of the independent variables and other independent variables being Gross metric tonnes, Tensile strength(UTS), weight per unit ...
5
votes
0
answers
358
views
Equivalence of ANCOVA and repeated measures model
Consider an RCT with individuals i in 2 arms (group, with 0 = control and 1 = treatment) in which one metric outcome (score) is collected at baseline (pre) and after some treatment (post).
In an ...
5
votes
0
answers
321
views
How can I find and categorise the effect size of a single coefficient in a multiple regression?
Question
How do I find the effect size for the different hierarchical multiple-level regressions used by papers in my review? And how do I categorise their effect size?
Detail
I’m publishing a ...
5
votes
0
answers
2k
views
Covariance matrix of multivariate multiple regression coefficients
I would like to perform a regression analysis on a dataset comprising one independent variable (X) and two dependent variables (Y1 and Y2) which may be affected by correlated errors. R's stats::lm ...
5
votes
0
answers
4k
views
Is it possible to do a time series analysis with more than one explanatory variable?
I am working on a project, and I am absolutely new to forecasting and not so strong in statistics. I have an employee data for the last 7 years, along with the other variables like economic growth, ...
5
votes
0
answers
113
views
Testing differences between slopes by examining proportions rather than interactions
In a recent article published in Nature Genetics, Francioli et al. argue that the relationship between paternal age and mutation rate depends on what part of the genome you're looking at (intergenic ...
5
votes
0
answers
1k
views
Why would I need both a validation set & a test set if I'm not selecting a model?
I have a dataset with two features and one outcome. I was asked to separate the data into three parts such that 70% of the data is a training set, 20% is for validation and 10% for testing. The model ...