Unanswered Questions
2,518 questions with no upvoted or accepted answers
9
votes
0
answers
5k
views
Is the Naive Bayes family of classifiers linear?
There are a lot of places where you'll see the proof that Naive Bayes classifiers are linear, like this and this. But they always assume a special case of the family of Naive Bayes classifiers which ...
8
votes
0
answers
652
views
Calculate goodness-of-fit (with deviance) to compare averaged models?
I need to compare the goodness of fit of several averaged logistic regression models by calculating the deviance explained. I'm using the MuMIn package in R to ...
8
votes
0
answers
9k
views
Inverse Mills ratio after OLS
Short version of the question:
Is it possible to create a dependent variable in the first step of the Heckman Selection model such that it is possible to obtain the values for the calculation of the ...
7
votes
1
answer
2k
views
Overfitting a neural network to a single batch as a sanity check - how small a loss value is small enough and long to run for?
I'm currently developing a neural network for a regression task. Following on from the advice given in places like here, here, and here I'm attempting to overfit my model to a single batch of 5 ...
7
votes
1
answer
2k
views
Linear model: potential outcome framework vs. structural causal model
From my reading about the potential outcomes framework (POF) and structural causal models (SCM), I understand that both perspectives have been shown to be equivalent but take different starting points....
7
votes
0
answers
182
views
Identifying non-linearities in relationship between variables
Logistic regression is often used to identify the effect of $x$ on a binary variable $y$ after adjusting for potential confounders $x_1,...,x_n$. In the medical literature, I will sometimes encounter ...
7
votes
0
answers
2k
views
What are the differences between HC estimators and their small sample properties?
I am currently using R to run regression with the following code:
...
7
votes
1
answer
2k
views
What is the equivalent of a standard deviation when considering a least squares fit line?
I am calculating a tolerance interval following http://www.itl.nist.gov/div898/handbook/prc/section2/prc253.htm but this says to multiply the k value by the standard deviation of the sample. I have a ...
6
votes
0
answers
85
views
Asymptotic properties of functional models
When working in Functional Data Analysis, a classical "preprocessing" step is to represent the "observations" using a B-spline expansion:
$$
X_i(t) \approx \sum_{j=1}^J \lambda_{ij}...
6
votes
0
answers
78
views
Before using CV-selected Regression model for Inference, shouldn't model performance be evaluated on unused test set?
I just came across a biokinesiology paper that used some Machine Learning methods, but I think there is a flaw in their methodology.
The authors had data on stroke patients and used Lasso regression ...
6
votes
0
answers
150
views
using `lmer` to fit the linear mixed effects models
Edit: I know some people vote this question is off-topic since it is more like a Cross Validated question. However, I am not here to ask about the coding thing (but I might word in the wrong way). I ...
6
votes
1
answer
113
views
Importance of regressors in time series data
Could anyone recommend bibliography or name some useful methods to analyze which (exogenous) variables are most important in determining the value of a time series?
For context, I have a random time ...
6
votes
1
answer
3k
views
Comparing observed and predicted values across several measurements
I am investigating whether a medical treatment has an effect on cognitive measures (aside from curing the medical problem). I have been looking for a statistically sound method to approach the ...
5
votes
1
answer
496
views
How to train Prediction model for longitudinal data, with large number of time points?
Given a longitudinal data, that has date (in month-year format) as one of the independent variables and other independent variables being Gross metric tonnes, Tensile strength(UTS), weight per unit ...
5
votes
0
answers
358
views
Equivalence of ANCOVA and repeated measures model
Consider an RCT with individuals i in 2 arms (group, with 0 = control and 1 = treatment) in which one metric outcome (score) is collected at baseline (pre) and after some treatment (post).
In an ...