stri_stats_latex {stringi} | R Documentation |
This function gives LaTeX-oriented statistics for a character vector,
e.g. obtained by loading a text file with the
readLines
function, where each text line
is represented by a separate string.
stri_stats_latex(str)
str |
character vector to be aggregated |
We use a slightly modified LaTeX Word Count algorithm taken from Kile 2.1.3, see http://kile.sourceforge.net/team.php for original contributors.
Returns an integer vector with the following named elements:
CharsWord
- number of word characters;
CharsCmdEnvir
- command and words characters;
CharsWhite
- LaTeX white spaces, including { and } in some contexts;
Words
- number of words;
Cmds
- number of commands;
Envirs
- number of environments;
... (Other stuff that may appear in future releases of stringi).
Other stats: stri_stats_general
s <- c("Lorem \\textbf{ipsum} dolor sit \\textit{amet}, consectetur adipisicing elit.", "\\begin{small}Proin nibh augue,\\end{small} suscipit a, scelerisque sed, lacinia in, mi.", "") stri_stats_latex(s) ## Not run: # Stats for the preprint version of M. Gagolewski's book # "Programowanie w jezyku R", Wydawnictwo Naukowe PWN, 2014. # see http://rksiazka.rexamine.com apply( sapply( list.files(path="~/Publikacje/ProgramowanieR/rozdzialy/", pattern=glob2rx("*.tex"), recursive=TRUE, full.names=TRUE), function(x) stri_stats_latex(readLines(x)) ), 1, sum) CharsWord CharsCmdEnvir CharsWhite Words Cmds Envirs 718755 458403 281989 120202 37055 6119 ## End(Not run)