Tags
A tag is a keyword or label that categorizes your question with other, similar questions. Using the right tags makes it easier for others to find and answer your question.
Techniques for analyzing the relationship between one (or more) "dependent" variables and "independent" variables.
30643 questions
for any *on-topic* question that (a) involves `R` either as a critical part of the question or expected answer, & (b) is not *just* about how to use `R`.
30416 questions
Machine learning algorithms build a model of the training data. The term "machine learning" is vaguely defined; it includes what is also called statistical learning, reinforcement learning, unsupervis…
20328 questions
Time series are data observed over time (either in continuous time or at discrete time periods).
14533 questions
A probability provides a quantitative description of the likely occurrence of a particular event.
12876 questions
Hypothesis testing assesses whether data are inconsistent with a given hypothesis (usually a null hypothesis of no effect).
11084 questions
A distribution is a mathematical description of probabilities or frequencies.
9739 questions
A routine exercise designed to test one's knowledge; often from a textbook, course, or test used for a class or self-study. This community's policy is to "provide helpful hints" for such questions rat…
8193 questions
Bayesian inference is a method of statistical inference that relies on treating the model parameters as random variables and applying Bayes' theorem to deduce subjective probability statements about t…
8169 questions
Artificial neural networks (ANNs) are a broad class of computational models loosely based on biological neural networks. They encompass feedforward NNs (including "deep" NNs), convolutional NNs, recu…
8133 questions
Refers generally to statistical procedures that utilize the logistic function, most commonly various forms of logistic regression
8057 questions
Mathematical theory of statistics, concerned with formal definitions and general results.
7940 questions
Statistical classification is the problem of identifying the sub-population to which new observations belong, where the identity of the sub-population is unknown, on the basis of a training set of dat…
6949 questions
Mixed (aka multilevel or hierarchical) models are linear models that include both fixed effects and random effects. They are used to model longitudinal or nested data.
6684 questions
"Statistical significance" is a term used in classical hypothesis testing. We say that the evidence in the test is "statistical significant" if it is sufficiently in favour of the alternative hypothe…
6531 questions
A measure of the degree of association among a pair of variables.
6471 questions
The normal, or Gaussian, distribution has a density function that is a symmetrical bell-shaped curve. It is one of the most important distributions in statistics. Use the [normality-assumption] tag fo…
6213 questions
Regression that includes two or more non-constant independent variables.
5692 questions
ANOVA stands for ANalysis Of VAriance, a statistical model and set of procedures for comparing multiple group means. The independent variables in an ANOVA model are categorical, but an ANOVA table can…
5405 questions
Python is a programming language commonly used for machine learning. Use this tag for any *on-topic* question that (a) involves `Python` either as a critical part of the question or expected answer, &…
4846 questions
A generalization of linear regression allowing for nonlinear relationships via a "link function" and for the variance of the response to depend on the predicted value. (Not to be confused with "genera…
4738 questions
A confidence interval is an interval that covers an unknown parameter with $100(1-\alpha)\%$ confidence. Confidence intervals are a frequentist concept. They are often confused with credible intervals…
4725 questions
The expected squared deviation of a random variable from its mean; or, the average squared deviation of data about their mean.
4334 questions
Cluster analysis is the task of partitioning data into subsets of objects according to their mutual "similarity," without using preexisting knowledge such as class labels. [Clustered-standard-errors a…
4046 questions
Prediction of the future events. It is a special case of [prediction], in the context of [time-series].
3948 questions
A test for comparing the means of two samples, or the mean of one sample (or even parameter estimates) with a specified value; also known as the "Student t-test" after the pseudonym of its inventor.
3732 questions
Categorical (also called nominal) data can take on a limited number of possible values called categories. Categorical values "label", they do not "measure". Please use [ordinal-data] tag for discrete …
3629 questions
lme4 and nlme are R packages used for fitting linear, generalized linear and nonlinear mixed effects models. For general questions about mixed models use [mixed-model] tag.
3566 questions
Repeatedly withholding subsets of the data during model fitting in order to quantify the model performance on the withheld data subsets.
3523 questions
Survival analysis models time to event data, typically time to death or failure time. Censored data are a common problem for survival analyses.
3505 questions
a method of estimating parameters of a statistical model by choosing the parameter value that optimizes the probability of observing the given sample.
3480 questions
Principal component analysis (PCA) is a linear dimensionality reduction technique. It reduces a multivariate dataset to a smaller set of constructed variables preserving as much information (as much v…
3464 questions
Creating samples from a well-specified population using a probabilistic method and/or producing random numbers from a specified distribution. As this tag is ambiguous, please consider [survey-sampling…
3334 questions
too general; please provide a more specific tag. For questions about the properties of specific estimators, use [estimators] tag instead.
3316 questions
Predictive models are statistical models whose primary purpose is to predict other observations of a system optimally, as opposed to models whose purpose is to test a particular hypothesis or explain …
3234 questions
Drawing conclusions about population parameters from sample data. See https://en.wikipedia.org/wiki/Inference and https://en.wikipedia.org/wiki/Statistical_inference
3155 questions