Unanswered Questions
5,031 questions with no upvoted or accepted answers
14
votes
0
answers
700
views
Convolutional neural network for multi-variate time series?
I want to use CNN architectures for classification of multivariate time-series, where we apply one label to each sequence.
I searched the net for the available designs in the literature and i found ...
13
votes
0
answers
272
views
Logistic regression for classification: are there any analytical solutions for the out-of-sample accuracy?
I run a binary logistic regression, with a binary dependent variable and a continuous independent one.
Now I want to evaluate the out-of-sample performance of the classification algorithm so obtained. ...
12
votes
0
answers
788
views
What approaches use multiple eigenvectors in graph spectral clustering?
Background: In Newman's PNAS 2006 paper Modularity and community structure in networks, the first eigenvector splits the graph in two clusters, and then each cluster can be further divided by ...
12
votes
0
answers
2k
views
Computing a bootstrap confidence interval for the prediction error with the percentile and the BCa method
I have two related questions regarding the computation of a non-parametric bootstrap confidence interval for the prediction error.
Setting: I have a sample S from a data population P and a learner L, ...
10
votes
0
answers
3k
views
Cluster analysis vs Factor analysis as a means for "grouping" variables or cases
I've noticed responses that at face value seem to be in contradiction with each other.
For instance, here @peter-flom writes
Short answer: Cluster analysis is about grouping subjects (e.g.
...
10
votes
2
answers
2k
views
Random Forest: Class specific feature importance
I'm using the bigrf R-package to analyse a dataset with ca. 50.000 observations x 120 variables, classified into two groups.
After growing a forest of 1000 trees, ...
9
votes
1
answer
179
views
Is there a ML or DL tool that can learn to detect periodically occurring patterns in a one dimensional time series?
I am trying to create a tool that labels refrigerator temperature readings. A reading is taken every 5 minutes, and its label identifies whether of not it was taken while the refrigerator was ...
9
votes
0
answers
182
views
Territories from observations
I have a number of animal observations, and want to deduce the number of territories (i.e. the number of individual animals) from this.
More formally, the problem can be stated as follows: Each ...
8
votes
1
answer
809
views
How to predict routes using clustering data
I've been working on a ship route prediction algorithm such that given the past and current trajectory of a ship I am able to estimate the future one. The trajectories are represented as a sequence of ...
8
votes
0
answers
209
views
When is there a free lunch?
The no free lunch theorem (NFL) states that
Theorem (Wolpert and Macready 1997) Let $A$ be any learning algorithm for the task of binary classification with respect to the
$0−1$ loss over a ...
8
votes
0
answers
2k
views
What is the intuition for testing seasonal difference with OCSB test and its correct application?
I have daily time series data of a shop's revenue. Now I would like to test for seasonal differencing with the OCSB test originally intrduced in (Osborn et al. (1988): Seasonality and the Order of ...
8
votes
0
answers
14k
views
True positive, false negative, true negative, false positive definitions for multiclass-multilabel classification?
I'm trying to apply some evaluation metrics to several clustering methods. I thought that I knew them basing on the multiclass confusion matrix, considering the rows as the actual classes and the ...
8
votes
1
answer
8k
views
Micro vs weighted F1 score
In a multi-label or multi-class classification setting, when choosing between a micro or a weighted F1 score, what shall I take into account?
The main upside of choosing macro is that one gets a ...
8
votes
1
answer
832
views
How to subset alternatives in nested multinomial logistic regression?
I am trying to predict whether or not captains in a particular groundfish fishery choose to fish on any given day and what variables may influence that decision. Originally I had planned on using ...
7
votes
0
answers
349
views
Features for binary time-series event prediction
This question is somewhat inspired by the answer to Features for time series classification.
The difference to that question is that I have a dataset with multi-dimensional time-series where I have ...