Unanswered Questions
71,464 questions with no upvoted or accepted answers
10
votes
0
answers
1k
views
What techniques are there to measure goodness of fit of Deming (orthogonal) regression?
Questions:
Even if there is no "widely accepted" technique, is there a useful-and-above-average technique for estimating goodness of fit in orthogonal regressions?
What are the pros/cons of this ...
10
votes
0
answers
744
views
Errors-in-Variables model for logistic regression
Simple question: I am familiar (though don't have tons of experience) with errors-in-variables regression. From what I have seen, this mostly is used with continuous outcomes in a linear model.
A) Is ...
10
votes
0
answers
2k
views
"weight" input in glm.nb function in R. How exactly does the weight affect the likelihood?
I would like to understand how the weight argument of glm.nb is affecting the likelihood function.
I understand that glm.nb find ...
10
votes
0
answers
636
views
Propensity Score Matching – How do the mechanics lead to a different result than unmatched?
The gist of propensity score matching, as I understand it, is as follows:
You want to estimate the average treatment effect (ATE) of a treatment on some outcome. However, if you simply calculate the ...
10
votes
0
answers
160
views
Rationale behind Good–Turing frequency estimation?
Good–Turing frequency estimation is a smoothing estimator for estimating a multinomial distribution. It seems very convoluted.
From mathematical statistics point of view, what is the rationale
behind ...
10
votes
2
answers
7k
views
Difference-in-difference in panel data
Under which conditions should we expect the difference-in-difference estimate to be equal to the equivalent panel data model?
Strictly speaking, whenever we have a experiment that offers a well ...
10
votes
0
answers
2k
views
Very different scale parameter estimates in Poisson regression
The background: I'm analysing survival data using a Poisson model. I've splitted the data on 2 time-scales (attained age and calendar year). Attained age is modelled using flexible parametric ...
10
votes
0
answers
387
views
"Brute force" expected deviance for logistic regression?
A commonly used goodness of fit statistic for logistic regression is the deviance. This is also known as the likelihood ratio chi-square statistic. It is defined as:
$$D=\sum_{i=1}^{N}d_i^2$$
$$d_i^...
10
votes
2
answers
2k
views
Random Forest: Class specific feature importance
I'm using the bigrf R-package to analyse a dataset with ca. 50.000 observations x 120 variables, classified into two groups.
After growing a forest of 1000 trees, ...
10
votes
1
answer
5k
views
Calculating standard deviation from log-normal distribution confidence intervals
I have the results of a meta-analysis of 10 studies that reports a combined random effects odds ratio (computed using Woolf's method) and 95% confidence interval of an event happening in one group ...
9
votes
0
answers
192
views
CDC says median number of sexual partners a woman has is 4.3. How can this be?
https://www.cdc.gov/nchs/nsfg/key_statistics/n-keystat.htm
According to the CDC, the median of the number of sexual partners a woman has is 4.3. But how can this be? The median can be 4, or 5, or ...
9
votes
0
answers
116
views
Why don't we typically worry about stationarity in panel data models with fixed effects?
Why don't we typically worry about stationarity in panel data models with fixed effects?
In time series analysis, stationarity is often a crucial assumption. However, I've noticed that in applied ...
9
votes
0
answers
109
views
Name for Generalized Generalized Linear Models
Consider the class of models given by $y\sim F(g^{-1}(\beta^\top\mathbf{x}))$ with $\mathbb{E}[Y]=g^{-1}(\beta^\top\mathbf{x})$.
Most authors I've come across call this a GLM only if F is in the ...
9
votes
0
answers
137
views
Intuitively, why do flushes overtake straights as the number of cards in hand increases?
A 5-card poker hand is more likely to be a straight than a flush.
But a 13-card bridge hand is more likely to contain a 5-card flush than a 5-card straight (source: I read it online somewhere).
What ...
9
votes
0
answers
132
views
Any Insights on the adoption and use of the Healthy Akaike Information Criterion (hAIC)?
Recently, I came across the Healthy Akaike Information Criterion (hAIC), introduced by Demidenko in his 2004 book "Mixed Models: Theory and Applications with R." Despite its (potential) ...