Skip to main content

Unanswered Questions

71,464 questions with no upvoted or accepted answers
12 votes
0 answers
1k views

Why we really need the concept of "Local" Rademacher complexity?

Recently, I have been studying High-Dimensional Statistics: A Non-Asymptotic Viewpoint written by Martin J. Wainwright. In this book, the author uses a special complexity measure which is called Local ...
12 votes
0 answers
3k views

Fourier transform of a Gaussian process

I would like to discuss and ask a question regarding the Fourier transform of a Gaussian process, if it makes sense. For that purpose, let me describe the following situation. Let $z(s)$ be a ...
12 votes
2 answers
285 views

Methods to detect published mistakes without raw data?

I'm interested in ways to detect mistakes in published papers without analyzing the raw data. For example the GRIM test [1]. Here's another similarish one from one of the GRIM authors' blog. I don't ...
12 votes
0 answers
2k views

Is sparsity of topics a necessary condition for latent Dirichlet allocation (LDA) to work

I have been playing with the hyper-parameters of the latent Dirichlet allocation (LDA) model and am wondering how sparsity of topic priors play a role in inference. I have not performed these ...
12 votes
0 answers
788 views

What approaches use multiple eigenvectors in graph spectral clustering?

Background: In Newman's PNAS 2006 paper Modularity and community structure in networks, the first eigenvector splits the graph in two clusters, and then each cluster can be further divided by ...
12 votes
0 answers
2k views

Empirical Prediction interval for time series forecast based on quantile regression

As Gardner notes "almost all point forecasts are wrong", so prediction intervals (PI) are necessary to quantify uncertainty and help us make informed decisions. There exists theoretical PI, and in ...
12 votes
0 answers
130 views

Words that estimate numerical proportions of a group

There seem to be more than a few papers that surround the use of certain words to define the probability of an event without actually using the numerical probability itself. For example, if you are ...
12 votes
0 answers
2k views

Computing a bootstrap confidence interval for the prediction error with the percentile and the BCa method

I have two related questions regarding the computation of a non-parametric bootstrap confidence interval for the prediction error. Setting: I have a sample S from a data population P and a learner L, ...
11 votes
1 answer
521 views

Queuing theory for elevators

It's been a while since I had my probability course based on Sheldon Ross' book "Probability Models", and while I never went into econometrics, I was very interested in the queuing theory section. I ...
11 votes
0 answers
192 views

Pope effect on pizza - Regression with presence absence and similarity data as dependent variables

I'm trying to figure out the right way to set up a regression when the dependent variables are presence absence data (of pizzas), and the similarity between the present pizzas. Bear with the story: ...
11 votes
0 answers
1k views

Bootstrap Prediction Interval: which residuals to use and which method?

I ask this question referring to the post: Bootstrap prediction interval, where a step by step method for calculating the prediction interval for linear regression models is explained. In the ...
11 votes
1 answer
922 views

Proof that the addition of a baseline to the REINFORCE algorithm reduces the variance

A widely used variation of REINFORCE is to subtract a baseline value $b$ from the return $G_t$ to reduce the variance of gradient estimation, such that \begin{align} \nabla_\theta J(\theta) & \...
11 votes
3 answers
2k views

Need advice on change point (step) detection

I have a time series with lots of steps/jumps (data file here). A plot is given below. I would like to subtract an appropriate value for each of these square wave features to bring them back down to ...
11 votes
0 answers
3k views

Expected value of softmax transformation of Gaussian random vector

Let $\mathbf w_1,\mathbf w_2,\ldots,\mathbf w_n \in \mathbb R^p$ and $\mathbf v \in \mathbb R^n$ be fixed vectors, and $\mathbf x \sim \mathcal N_p(\boldsymbol{\mu}, \mathbf{\Sigma})$ be an $p$-...
11 votes
1 answer
6k views

Generalized additive model: choosing between cubic and thin-plate splines

I am using the gam function (from the mgcv package) to model a continuous response (a soil nutrient) in relation to a continuous ...

15 30 50 per page