Package: DataSum 0.1.0
DataSum: The DataSumm function takes a data frame as input and applies the Datum function to each column, returning a data frame with the summary statistics for each column
The moments package provides functions for calculating various statistical moments and related measures, such as skewness and kurtosis. The dplyr package is used for data manipulation, and the nortest package is used for normality testing. The find_mode function takes a data vector as input and returns the mode(s) of the data. The shapiro_normality_test function performs a Shapiro-Wilk normality test on the input data, and returns "Normal" if the data is normally distributed (p-value > 0.05), and "Not Normal" otherwise. If the data length is outside the valid range for the Shapiro-Wilk test (3 to 5000), it performs an Anderson-Darling normality test instead. The Datum function takes a data vector as input and returns a data frame with various summary statistics, including data type, sample size, mean, mode, median, variance, standard deviation, maximum, minimum, range, skewness, kurtosis, and normality test result. If the data is numeric, it calculates the statistics accordingly. If the data is character or factor, it provides the mode and marks the other statistics as not applicable (NA).The DataSumm function takes a data frame as input and applies the Datum function to each column, returning a data frame with the summary statistics for each column. Measures of Central Tendency Mean: The average of the values, calculated by summing all the values and dividing by the number of values. Median: The middle value when the data is arranged in order. If there are an even number of values, the median is the average of the two middle values. Mode: The value that appears most frequently in the data set. Measures of Dispersion Range: The difference between the largest and smallest values in the data set. Variance: A measure of how spread out the values are from the mean, calculated as the average squared deviation from the mean. Standard Deviation: The square root of the variance, providing a measure of the average amount each value deviates from the mean. Other Measures Skewness: A measure of the asymmetry of the probability distribution of a random variable around its mean. Positive skewness indicates a distribution with an asymmetric tail extending towards more positive values. Kurtosis: A measure of the "peakedness" of the probability distribution of a random variable. Normality: A test to determine if the data follows a normal (Gaussian) distribution, such as the Shapiro-Wilk test.
Authors:
DataSum_0.1.0.tar.gz
DataSum_0.1.0.zip(r-4.5)DataSum_0.1.0.zip(r-4.4)DataSum_0.1.0.zip(r-4.3)
DataSum_0.1.0.tgz(r-4.4-any)DataSum_0.1.0.tgz(r-4.3-any)
DataSum_0.1.0.tar.gz(r-4.5-noble)DataSum_0.1.0.tar.gz(r-4.4-noble)
DataSum_0.1.0.tgz(r-4.4-emscripten)DataSum_0.1.0.tgz(r-4.3-emscripten)
DataSum.pdf |DataSum.html✨
DataSum/json (API)
# Install 'DataSum' in R: |
install.packages('DataSum', repos = c('https://uzairkhan11w.r-universe.dev', 'https://cloud.r-project.org')) |
Bug tracker:https://github.com/uzairkhan11w/datasum/issues
Last updated 4 months agofrom:153606be38. Checks:OK: 7. Indexed: yes.
Target | Result | Date |
---|---|---|
Doc / Vignettes | OK | Oct 28 2024 |
R-4.5-win | OK | Oct 28 2024 |
R-4.5-linux | OK | Oct 28 2024 |
R-4.4-win | OK | Oct 28 2024 |
R-4.4-mac | OK | Oct 28 2024 |
R-4.3-win | OK | Oct 28 2024 |
R-4.3-mac | OK | Oct 28 2024 |
Exports:DataSummDatumgetmodeshapiro_normality_test
Dependencies:clidplyrfansigenericsgluelifecyclemagrittrmomentsnortestpillarpkgconfigR6rlangtibbletidyselectutf8vctrswithr
Readme and manuals
Help Manual
Help page | Topics |
---|---|
Summarize an Entire Data Frame | DataSumm |
Summarize a Single Vector | Datum |
Get Mode of a Numeric Vector | getmode |
Perform Normality Test | shapiro_normality_test |