Highly scored unanswered questions - Page 5

10 votes

0 answers

1k views

What techniques are there to measure goodness of fit of Deming (orthogonal) regression?

Questions: Even if there is no "widely accepted" technique, is there a useful-and-above-average technique for estimating goodness of fit in orthogonal regressions? What are the pros/cons of this ...

Jorge

21

modified Nov 9, 2023 at 22:04

10 votes

0 answers

744 views

Errors-in-Variables model for logistic regression

Simple question: I am familiar (though don't have tons of experience) with errors-in-variables regression. From what I have seen, this mostly is used with continuous outcomes in a linear model. A) Is ...

Shawn Hemelstrand

20.4k

modified Apr 22, 2024 at 23:28

10 votes

0 answers

2k views

"weight" input in glm.nb function in R. How exactly does the weight affect the likelihood?

I would like to understand how the weight argument of glm.nb is affecting the likelihood function. I understand that glm.nb find ...

Benjamin Christoffersen

2,564

modified Jun 3, 2020 at 10:54

10 votes

0 answers

636 views

Propensity Score Matching – How do the mechanics lead to a different result than unmatched?

The gist of propensity score matching, as I understand it, is as follows: You want to estimate the average treatment effect (ATE) of a treatment on some outcome. However, if you simply calculate the ...

Krantz

585

modified Sep 3, 2019 at 11:24

10 votes

0 answers

160 views

Rationale behind Good–Turing frequency estimation?

Good–Turing frequency estimation is a smoothing estimator for estimating a multinomial distribution. It seems very convoluted. From mathematical statistics point of view, what is the rationale behind ...

kjetil b halvorsen♦

84k

modified May 26, 2017 at 19:08

10 votes

2 answers

7k views

Difference-in-difference in panel data

Under which conditions should we expect the difference-in-difference estimate to be equal to the equivalent panel data model? Strictly speaking, whenever we have a experiment that offers a well ...

CommunityBot

1

modified Apr 28 at 13:07

10 votes

0 answers

2k views

Very different scale parameter estimates in Poisson regression

The background: I'm analysing survival data using a Poisson model. I've splitted the data on 2 time-scales (attained age and calendar year). Attained age is modelled using flexible parametric ...

CommunityBot

1

modified Aug 19, 2013 at 20:25

10 votes

0 answers

387 views

"Brute force" expected deviance for logistic regression?

A commonly used goodness of fit statistic for logistic regression is the deviance. This is also known as the likelihood ratio chi-square statistic. It is defined as: $$D=\sum_{i=1}^{N}d_i^2$$ $$d_i^...

probabilityislogic

25.5k

Nov 24, 2011 at 12:25

10 votes

2 answers

2k views

Random Forest: Class specific feature importance

I'm using the bigrf R-package to analyse a dataset with ca. 50.000 observations x 120 variables, classified into two groups. After growing a forest of 1000 trees, ...

CommunityBot

1

modified Feb 8 at 22:07

10 votes

1 answer

5k views

Calculating standard deviation from log-normal distribution confidence intervals

I have the results of a meta-analysis of 10 studies that reports a combined random effects odds ratio (computed using Woolf's method) and 95% confidence interval of an event happening in one group ...

CommunityBot

1

modified Nov 10, 2024 at 10:01

9 votes

0 answers

192 views

CDC says median number of sexual partners a woman has is 4.3. How can this be?

https://www.cdc.gov/nchs/nsfg/key_statistics/n-keystat.htm According to the CDC, the median of the number of sexual partners a woman has is 4.3. But how can this be? The median can be 4, or 5, or ...

User1865345

11.5k

modified Jan 14 at 15:33

9 votes

0 answers

116 views

Why don't we typically worry about stationarity in panel data models with fixed effects?

Why don't we typically worry about stationarity in panel data models with fixed effects? In time series analysis, stationarity is often a crucial assumption. However, I've noticed that in applied ...

CommunityBot

1

modified Dec 15, 2024 at 5:14

9 votes

0 answers

109 views

Name for Generalized Generalized Linear Models

Consider the class of models given by $y\sim F(g^{-1}(\beta^\top\mathbf{x}))$ with $\mathbb{E}[Y]=g^{-1}(\beta^\top\mathbf{x})$. Most authors I've come across call this a GLM only if F is in the ...

User1865345

11.5k

modified Oct 26, 2024 at 15:17

9 votes

0 answers

137 views

Intuitively, why do flushes overtake straights as the number of cards in hand increases?

A 5-card poker hand is more likely to be a straight than a flush. But a 13-card bridge hand is more likely to contain a 5-card flush than a 5-card straight (source: I read it online somewhere). What ...

fblundun

4,199

modified Aug 23, 2024 at 10:51

9 votes

0 answers

132 views

Any Insights on the adoption and use of the Healthy Akaike Information Criterion (hAIC)?

Recently, I came across the Healthy Akaike Information Criterion (hAIC), introduced by Demidenko in his 2004 book "Mixed Models: Theory and Applications with R." Despite its (potential) ...

Neal Fultz

568

modified Aug 13, 2024 at 17:41

Stack Exchange Network

Unanswered Questions

What techniques are there to measure goodness of fit of Deming (orthogonal) regression?

Errors-in-Variables model for logistic regression

"weight" input in glm.nb function in R. How exactly does the weight affect the likelihood?

Propensity Score Matching – How do the mechanics lead to a different result than unmatched?

Rationale behind Good–Turing frequency estimation?

Difference-in-difference in panel data

Very different scale parameter estimates in Poisson regression

"Brute force" expected deviance for logistic regression?

Random Forest: Class specific feature importance

Calculating standard deviation from log-normal distribution confidence intervals

CDC says median number of sexual partners a woman has is 4.3. How can this be?

Why don't we typically worry about stationarity in panel data models with fixed effects?

Name for Generalized Generalized Linear Models

Intuitively, why do flushes overtake straights as the number of cards in hand increases?

Any Insights on the adoption and use of the Healthy Akaike Information Criterion (hAIC)?

Unanswered Questions

Unanswered Tags