.BG
.VE $Header: /usr3/s/current/s/.help/RCS/scandata,v 1.2 83/12/30 19:34:12 rab Exp $
.FN scandata
.TL
scandata: Scan fixed format data file
.CS
!S scandata <file >summary      #  UNIX command
.PP
.AG file
the name (unquoted) of a file containing fixed-format input data,
a sequence of records with fields in specific columns.
A maximum of 500 characters per record is permitted.
.AG summary
the name of the output file which will summarize the data.
For each column in `file'
a summary will be produced describing the number (and percentage)
of occurrences of specific characters in that column
(a 1-dimensional contingency table if the column contains categorical data).
.PP
Special characters (newlines, control characters, etc.) are represented
by the following notation in `summary':
`^A' is control-A, ... `^J' is control-J (newline), etc.
.PP
The summary also includes information on the number of records in the
file and the maximum record length.
.SH SEE ALSO
`extract'.
.EX
.ft 1
If the file "mydata" contains:

.ft CW
.ps -2
.vs -2
 123
 abc
 xyz
 aaa
 321
.ps +2
.vs +2
.ft 1

Then the command
`S scandata <junk'
produces:
.ft CW
.ps -2
.vs -2

 Char   Count     %    Column 1
 
 '1'        1   20.00
 '3'        1   20.00
 'a'        2   40.00
 'x'        1   20.00
 
 
 Char   Count     %    Column 2
 
 '2'        2   40.00
 'a'        1   20.00
 'b'        1   20.00
 'y'        1   20.00
 
 
 Char   Count     %    Column 3
 
 '1'        1   20.00
 '3'        1   20.00
 'a'        1   20.00
 'c'        1   20.00
 'z'        1   20.00
 
 
 Char   Count     %    Column 4
 
 '^J'       5  100.00
 
 
 Longest Record (including final newline) = 4
 Total Records  = 5
.ps +2
.vs +2
.ft 1
.KW data management
.KW utilities
.KW file
.WR
