Tags - Cross Validated

data-visualization

Constructing and interpreting meaningful and useful graphical representations of data. (If your question is only about how to get particular software to produce a specific effect, then it is likely no…

3127 questions

71 asked this year

p-value

In frequentist hypothesis testing, the $p$-value is the probability of a result as extreme (or more) than the observed result, under the assumption that the null hypothesis is true.

2990 questions

7 asked this month, 107 this year

arima

Refers to the AutoRegressive Integrated Moving Average model used in time series modeling both for data description and for forecasting. This model generalizes the ARMA model by including a term for d…

2979 questions

5 asked this month, 79 this year

least-squares

Refers to a general estimation technique that selects the parameter value to minimize the squared difference between two quantities, such as the observed value of a variable, and the expected value of…

2919 questions

5 asked this month, 106 this year

mean

The expected value of a random variable; or a location measure for a sample.

2848 questions

60 asked this year

optimization

for any use of optimization within statistics.

2826 questions

5 asked this month, 97 this year

repeated-measures

Repeated measures data occur when more than one measurement is collected on the same units (e.g. subjects). Use this tag for RM-ANOVA together with [anova] tag.

2806 questions

7 asked this month, 105 this year

interaction

A situation where the effect of an explanatory variable may depend on the value of another explanatory variable.

2780 questions

154 asked this year

references

Questions seeking external references (books, papers, etc.) about a particular subject. Always use a more specific tag in addition.

2716 questions

11 asked this month, 112 this year

chi-squared-test

A test (typically of distribution, independence, or goodness of fit) for analyzing contingency tables of counts.

2686 questions

73 asked this year

econometrics

Econometrics is a field of statistics dealing with applications to economics.

2639 questions

5 asked this month, 122 this year

linear-model

Refers to any model where a random variable is related to one or more random variables by a function that is linear in a finite number of parameters.

2611 questions

5 asked this month, 87 this year

modeling

describes the process of creating a statistical or machine learning model. Always add a more specific tag.

2603 questions

63 asked this year

multivariate-analysis

Analyses/models where there is more than one response (dependent) variable. Commonly confused with "multiple" or "multivariable" analysis, which has more than one predictor (independent) variable.

2560 questions

7 asked this month, 80 this year

random-variable

A random variable or stochastic variable is a value that is subject to chance variation (i.e., randomness in a mathematical sense).

2535 questions

66 asked this year

conditional-probability

The probability that an event A will occur, when another event B is known to occur or to have occurred. It is commonly denoted by P(A|B).

2525 questions

85 asked this year

data-transformation

Mathematical re-expression, often nonlinear, of data values. Data are often transformed either to meet the assumptions of a statistical model or to make the results of an analysis more interpretable.

2499 questions

5 asked this month, 75 this year

panel-data

Panel data refers to multi-dimensional data frequently involving measurements over time in econometrics. It is also called longitudinal data in biostatistics, psychology, and some other fields..

2490 questions

9 asked this month, 166 this year

random-forest

Random forest is a machine-learning method based on combining the outputs of many decision trees.

2485 questions

44 asked this year

interpretation

Refers generally to making substantive conclusions from the results of a statistical analysis.

2483 questions

8 asked this month, 68 this year

feature-selection

Methods and principles of selecting a subset of attributes for use in further modelling

2455 questions

71 asked this year

binomial-distribution

The binomial distribution gives the frequencies of "successes" in a fixed number of independent "trials". Use this tag for questions about data that might be binomially distributed or for questions a…

2448 questions

6 asked this month, 80 this year

expected-value

The expected value of a random variable is a weighted average of all possible values a random variable can take on, with the weights equal to the probability of taking on that value.

2375 questions

5 asked this month, 69 this year

svm

Support Vector Machine refers to "a set of related supervised learning methods that analyze data and recognize patterns, used for classification and regression analysis."

2293 questions

31 asked this year

regression-coefficients

The parameters of a regression model. Most commonly, the values by which the independent variables will be multiplied to get the predicted value of the dependent variable.

2146 questions

7 asked this month, 84 this year

nonparametric

to ask about the nature of nonparametric or parametric methods, or the difference between the two. Nonparametric methods generally rely on few assumptions about the underlying distributio…

2145 questions

64 asked this year

standard-deviation

Standard deviation is the square root of the variance of a random variable, an estimator thereof, or a similar measure of the spread of a batch of data.

2051 questions

48 asked this year

spss

IBM SPSS Statistics is a statistical software package. Use this tag for any on-topic question that (a) involves SPSS either as a critical part of the question or expected answer and (b) is not just ab…

2040 questions

41 asked this year

bootstrap

The bootstrap is a resampling method to estimate the sampling distribution of a statistic.

2032 questions

89 asked this year

model-selection

Model selection is a problem of judging which model from some set performs best. Popular methods include $R^2$, AIC and BIC criteria, test sets, and cross-validation. To some extent, feature selection…

2029 questions

56 asked this year

experiment-design

The study of how to structure an information-gathering exercise where variation is present.

2024 questions

7 asked this month, 85 this year

sample-size

ambiguous. Use it when the question is about sample size and NONE of the following are more appropriate: [small-sample], [large-data], [statistical-power], [underdetermined], or [unbalance…

2013 questions

6 asked this month, 100 this year

independence

Events (or random variables) are independent when information on some of them tells you nothing about the probability of occurrence (/ distribution) of the others. Please DO NOT use this tag for indep…

1978 questions

64 asked this year

simulation

A vast area which includes generating results from computer models.

1972 questions

88 asked this year

multilevel-analysis

Statistical analysis of datasets comprising several levels of hierarchy (e.g., students nested in classes nested in schools or hierarchical forecasting). For questions about mixed models use [mixed-mo…

1937 questions

98 asked this year

poisson-distribution

A discrete distribution defined on the non-negative integers that has the property that the mean is equal to the variance.

1934 questions

7 asked this month, 50 this year