Package: DataSum 0.1.0

DataSum: The DataSumm function takes a data frame as input and applies the Datum function to each column, returning a data frame with the summary statistics for each column

The moments package provides functions for calculating various statistical moments and related measures, such as skewness and kurtosis. The dplyr package is used for data manipulation, and the nortest package is used for normality testing. The find_mode function takes a data vector as input and returns the mode(s) of the data. The shapiro_normality_test function performs a Shapiro-Wilk normality test on the input data, and returns "Normal" if the data is normally distributed (p-value > 0.05), and "Not Normal" otherwise. If the data length is outside the valid range for the Shapiro-Wilk test (3 to 5000), it performs an Anderson-Darling normality test instead. The Datum function takes a data vector as input and returns a data frame with various summary statistics, including data type, sample size, mean, mode, median, variance, standard deviation, maximum, minimum, range, skewness, kurtosis, and normality test result. If the data is numeric, it calculates the statistics accordingly. If the data is character or factor, it provides the mode and marks the other statistics as not applicable (NA).The DataSumm function takes a data frame as input and applies the Datum function to each column, returning a data frame with the summary statistics for each column. Measures of Central Tendency Mean: The average of the values, calculated by summing all the values and dividing by the number of values. Median: The middle value when the data is arranged in order. If there are an even number of values, the median is the average of the two middle values. Mode: The value that appears most frequently in the data set. Measures of Dispersion Range: The difference between the largest and smallest values in the data set. Variance: A measure of how spread out the values are from the mean, calculated as the average squared deviation from the mean. Standard Deviation: The square root of the variance, providing a measure of the average amount each value deviates from the mean. Other Measures Skewness: A measure of the asymmetry of the probability distribution of a random variable around its mean. Positive skewness indicates a distribution with an asymmetric tail extending towards more positive values. Kurtosis: A measure of the "peakedness" of the probability distribution of a random variable. Normality: A test to determine if the data follows a normal (Gaussian) distribution, such as the Shapiro-Wilk test.

Authors:Mr. Uzair Javid Khan and Dr. Immad Ahmad Shah

DataSum_0.1.0.tar.gz
DataSum_0.1.0.zip(r-4.5)DataSum_0.1.0.zip(r-4.4)DataSum_0.1.0.zip(r-4.3)
DataSum_0.1.0.tgz(r-4.4-any)DataSum_0.1.0.tgz(r-4.3-any)
DataSum_0.1.0.tar.gz(r-4.5-noble)DataSum_0.1.0.tar.gz(r-4.4-noble)
DataSum_0.1.0.tgz(r-4.4-emscripten)DataSum_0.1.0.tgz(r-4.3-emscripten)
DataSum.pdf |DataSum.html
DataSum/json (API)

# Install 'DataSum' in R:
install.packages('DataSum', repos = c('https://uzairkhan11w.r-universe.dev', 'https://cloud.r-project.org'))

Peer review:

Bug tracker:https://github.com/uzairkhan11w/datasum/issues

On CRAN:

4 exports 0.71 score 18 dependencies 3 scripts

Last updated 2 months agofrom:153606be38. Checks:OK: 7. Indexed: yes.

TargetResultDate
Doc / VignettesOKAug 29 2024
R-4.5-winOKAug 29 2024
R-4.5-linuxOKAug 29 2024
R-4.4-winOKAug 29 2024
R-4.4-macOKAug 29 2024
R-4.3-winOKAug 29 2024
R-4.3-macOKAug 29 2024

Exports:DataSummDatumgetmodeshapiro_normality_test

Dependencies:clidplyrfansigenericsgluelifecyclemagrittrmomentsnortestpillarpkgconfigR6rlangtibbletidyselectutf8vctrswithr