Unanswered Questions
1,788 questions with no upvoted or accepted answers
10
votes
0
answers
377
views
Reinforcement *Model* Learning
Classical reinforcement learning (Q- or Sarsa-Learning) can be extended with models of the environment. These models are usually transition tables that contain the probability of arriving at a ...
9
votes
1
answer
229
views
What is Better for Prediction Error: Covariance Penalties or a Test Set?
I'm reading Computer Age Statistical Inference by Efron and Hastie, two statisticians I have a lot of respect for.
Section 12.3 discusses Mallows' $C_{p}$, Akaike's information criteria (AIC), and ...
9
votes
0
answers
173
views
In sports modelling, are hot simulations better or cold simulations?
I'm thinking here largely of the context in which someone has an Elo rating model for a particular sport.
To calculate things such as how often the team makes the Finals series, or wins the ...
8
votes
0
answers
820
views
Are there any General Proofs on Genetic Algorithms?
Are there any general proofs or theorems relating to "genetic algorithms"?
I have been reading about a theorem in math called the "Schema Theorem" - this theorem is one of the ...
8
votes
1
answer
809
views
How to predict routes using clustering data
I've been working on a ship route prediction algorithm such that given the past and current trajectory of a ship I am able to estimate the future one. The trajectories are represented as a sequence of ...
7
votes
0
answers
183
views
How to combine noisy and noise-free datasets to train a model
Overview
Suppose I have two datasets, both of which consist of rows of features and their matching labels. One of these datasets is noise-free and its labels correspond to the ground truth, but the ...
7
votes
0
answers
207
views
Including feature-dependent priors on output class, in bayesian logistic regression
When doing logistic regression with data $D_N = \{(x_i, y_i)\}_i^N$ with $x_i \in \mathbf{X}^N$ (each data point has N features) and $y_i \in \mathbf{Y}$ being assigned output classes, in a Bayesian ...
7
votes
0
answers
735
views
prediction interval of a new prediction out of a weighted linear regression model
This question is based on the example 11.1 out of the book Applied linear statistical models of Kutner, Nachtsheim, Neter and Li. You can find the data here.
First they calculate a simple linear ...
7
votes
0
answers
198
views
Graphical nominal model
Suppose I have a set of $k$ matrices.
$$
\mathbb D = A_1,A_2,...,A_k
$$
Each column of $A$ is categorical vector.
$$
A = v_1,v_2,...,v_n
$$
I want to find the mapping
$$
f: A \...
6
votes
0
answers
1k
views
Position Bias Normalization in CTR Prediction
I am working on Click Through Rate(CTR) prediction model on a toy dataset.
The label I am using is #Click/#NumShown.
But there is position bias in results shown. ...
6
votes
0
answers
574
views
logistic regression prediction: changing interpretation with changing prior
The data include 3 equally sized subsets A, B and C, belonging to two classes:
A belongs to class 1.
B and C belong to class 2.
The prior probabilities of an observation coming from class 1 and ...
6
votes
1
answer
3k
views
Comparing observed and predicted values across several measurements
I am investigating whether a medical treatment has an effect on cognitive measures (aside from curing the medical problem). I have been looking for a statistically sound method to approach the ...
5
votes
1
answer
201
views
How to estimate when an event of interest is overdue?
I'm looking for a principled way to estimate when an event of interest is overdue (a binary decision/alert), not just predicting when it is supposed to happen.
In the survival analysis literature I ...
5
votes
0
answers
1k
views
Prediction with OLS better then prediction with lasso or ridge
I did a regression on a train data set with 7000 observations and 50 explenatory variables with ols ridge and lasso. The lambda was chosen via cross validation.
After that i wanted to compare the ...
5
votes
0
answers
1k
views
Predictive modeling using GAM (mgcv)
I have seven years of continuous insect population data, along with temperature and humidity parameters. I’d like to use this data to predict future populations in a given year using a generalized ...