CodeBuzz Helpers
Measures of cental tendency
- Mean: The mean, or average, is the sum of all values in a data set divided by the number of values.
The mean is sensitive to extreme values (outliers) and may not accurately represent the central location if the data is skewed.
- Median: The median is the middle value of a data set when it is ordered. If there is an even number of values, the median is the average of the two middle values.
The median is not affected by extreme values and is a robust measure of central tendency.
- Mode: The mode is the value that appears most frequently in a data set.
A data set may have one mode, more than one mode, or no mode at all.
Unlike the mean and median, the mode can be used for both numerical and categorical data.
A data set with one mode is unimodal, while a data set with two modes is bimodal, and more than two modes is multimodal.
- .describe(): The .describe() method is a convenient function in the Pandas library in Python that provides a quick statistical summary of a DataFrame.
When applied to a Pandas DataFrame or Series, it returns several descriptive statistics for each column,
including measures of central tendency, dispersion, and shape of the distribution.
- MIN/MAX In Pandas, the min() and max() functions are used to find the minimum and maximum values, respectively, in a Pandas DataFrame or Series.