Skip to main content

Unanswered Questions

2,800 questions with no upvoted or accepted answers
14 votes
0 answers
700 views

Convolutional neural network for multi-variate time series?

I want to use CNN architectures for classification of multivariate time-series, where we apply one label to each sequence. I searched the net for the available designs in the literature and i found ...
13 votes
0 answers
272 views

Logistic regression for classification: are there any analytical solutions for the out-of-sample accuracy?

I run a binary logistic regression, with a binary dependent variable and a continuous independent one. Now I want to evaluate the out-of-sample performance of the classification algorithm so obtained. ...
12 votes
0 answers
2k views

Computing a bootstrap confidence interval for the prediction error with the percentile and the BCa method

I have two related questions regarding the computation of a non-parametric bootstrap confidence interval for the prediction error. Setting: I have a sample S from a data population P and a learner L, ...
10 votes
2 answers
2k views

Random Forest: Class specific feature importance

I'm using the bigrf R-package to analyse a dataset with ca. 50.000 observations x 120 variables, classified into two groups. After growing a forest of 1000 trees, ...
9 votes
1 answer
179 views

Is there a ML or DL tool that can learn to detect periodically occurring patterns in a one dimensional time series?

I am trying to create a tool that labels refrigerator temperature readings. A reading is taken every 5 minutes, and its label identifies whether of not it was taken while the refrigerator was ...
9 votes
0 answers
995 views

In penalized regression models, should binary predictors be standardized?

It is generally agreed that in penalized regression models, such as ridge regression, the lasso, and the elastic net, one should standardize the predictors (such as dividing each by its SD) so that ...
9 votes
0 answers
6k views

Poisson regression for binary data

I've been trying to read up on Poisson regression models, and it looks like it is possible to estimate such a model with a binary outcome. This has come up before on this site here (and somewhat here ...
8 votes
1 answer
809 views

How to predict routes using clustering data

I've been working on a ship route prediction algorithm such that given the past and current trajectory of a ship I am able to estimate the future one. The trajectories are represented as a sequence of ...
8 votes
0 answers
209 views

When is there a free lunch?

The no free lunch theorem (NFL) states that Theorem (Wolpert and Macready 1997) Let $A$ be any learning algorithm for the task of binary classification with respect to the $0−1$ loss over a ...
8 votes
0 answers
14k views

True positive, false negative, true negative, false positive definitions for multiclass-multilabel classification?

I'm trying to apply some evaluation metrics to several clustering methods. I thought that I knew them basing on the multiclass confusion matrix, considering the rows as the actual classes and the ...
8 votes
1 answer
8k views

Micro vs weighted F1 score

In a multi-label or multi-class classification setting, when choosing between a micro or a weighted F1 score, what shall I take into account? The main upside of choosing macro is that one gets a ...
8 votes
0 answers
1k views

Selecting link function in GEE with binary dependent variable

In my experiment participants had to make a binary (yes-no) decision about various stimuli. I have two categorical (stimulus characteristics coded as -1 0 and 1 and treatment group coded as 0 1) and ...
8 votes
1 answer
832 views

How to subset alternatives in nested multinomial logistic regression?

I am trying to predict whether or not captains in a particular groundfish fishery choose to fish on any given day and what variables may influence that decision. Originally I had planned on using ...
7 votes
0 answers
349 views

Features for binary time-series event prediction

This question is somewhat inspired by the answer to Features for time series classification. The difference to that question is that I have a dataset with multi-dimensional time-series where I have ...
7 votes
0 answers
183 views

How to combine noisy and noise-free datasets to train a model

Overview Suppose I have two datasets, both of which consist of rows of features and their matching labels. One of these datasets is noise-free and its labels correspond to the ground truth, but the ...

15 30 50 per page
1
2 3 4 5
187