--- title: "Getting started with DataSum" output: rmarkdown::html_vignette vignette: > %\VignetteIndexEntry{Getting started with DataSum} %\VignetteEngine{knitr::rmarkdown} %\VignetteEncoding{UTF-8} --- ```{r, include = FALSE} knitr::opts_chunk$set(collapse = TRUE, comment = "#>") ``` # Why DataSum? DataSum is built for the first serious look at a dataset. Before modeling, teaching, or publication, analysts need to know what is missing, what is unusual, which variables are skewed, whether normality checks are meaningful, and which columns need closer inspection. ```{r setup} library(DataSum) ``` # Summarize one variable ```{r vector} summarize_vector(c(1, 2, 2, NA, 10), name = "score") ``` # Summarize a data frame ```{r data} summarize_data(iris) ``` Grouped summaries are useful for teaching and comparative research workflows. ```{r grouped} summarize_data(iris, by = "Species") ``` # Profile a dataset ```{r profile} profile <- profile_data(iris) profile$dataset profile$warnings ``` # Create a report scaffold ```{r report} report_path <- datasum_report(iris, format = "qmd", render = FALSE) file.exists(report_path) ``` The generated Quarto source contains the dataset overview, variable diagnostics, warnings, formula definitions, and interpretation notes. Rendering HTML, PDF, or DOCX output is available when the optional `quarto` package and Quarto CLI are installed.