An introduction to the C shell


                        William Joy

                 Computer Science Division
 Department of Electrical Engineering and Computer Science
             University of California, Berkeley
                 Berkeley, California 94720


                          _A_B_S_T_R_A_C_T

          _C_s_h is a new command language interpreter for
     UNIX|- systems.  It incorporates good  features  of
     other  shells  and  a _h_i_s_t_o_r_y mechanism similar to
     the _r_e_d_o of INTERLISP.  While  incorporating  many
     features  of other shells which make writing shell
     programs  (shell  scripts)  easier,  most  of  the
     features  unique  to _c_s_h are designed more for the
     interactive UNIX user.

          UNIX users who have read a general  introduc-
     tion  to  the  system  will  find a valuable basic
     explanation of the shell  here.   Simple  terminal
     interaction  with  _c_s_h  is  possible after reading
     just the first  section  of  this  document.   The
     second  section  describes the shells capabilities
     which you can explore  after  you  have  begun  to
     become  acquainted with the shell.  Later sections
     introduce  features  which  are  useful,  but  not
     necessary for all users of the shell.

          Back matter includes an appendix listing spe-
     cial  characters  of  the  shell and a glossary of
     terms and commands introduced in this manual.


May 27, 1987


_________________________
|- UNIX is a trademark of Bell Laboratories.


               An introduction to the C shell


                        William Joy

                 Computer Science Division
 Department of Electrical Engineering and Computer Science
             University of California, Berkeley
                 Berkeley, California 94720


_I_n_t_r_o_d_u_c_t_i_o_n

     A _s_h_e_l_l is a command language interpreter.  _C_s_h is  the
name  of  one  particular  command interpreter on UNIX.  The
primary purpose of _c_s_h is to translate command  lines  typed
at  a  terminal  into  system actions, such as invocation of
other programs.  _C_s_h is a user program  just  like  any  you
might  write.   Hopefully, _c_s_h will be a very useful program
for you in interacting with the UNIX system.

     In addition to this document, you will want to refer to
a  copy of the UNIX programmer's manual.  The _c_s_h documenta-
tion in the  manual  provides  a  full  description  of  all
features of the shell and is a final reference for questions
about the shell.

     Many words in  this  document  are  shown  in  _i_t_a_l_i_c_s.
These  are  important  words;  names  of commands, and words
which have special meaning in discussing the shell and UNIX.
Many  of  the  words are defined in a glossary at the end of
this document.  If you don't know what is meant by  a  word,
you should look for it in the glossary.

_A_c_k_n_o_w_l_e_d_g_e_m_e_n_t_s

     Numerous people have provided good input about previous
versions of _c_s_h and aided in its debugging and in the debug-
ging of its documentation.  I would especially like to thank
Michael  Ubell who made the crucial observation that history
commands could be done well over the word structure of input
text,  and  implemented  a prototype history mechanism in an
older version of the shell.  Eric Allman has also provided a
large  number  of  useful  comments on the shell, helping to
unify those concepts which are present and to  identify  and
eliminate  useless  and  marginally  useful  features.  Mike
O'Brien  suggested  the  pathname  hashing  mechanism  which
speeds  command  execution.   Jim Kulp added the job control
and directory stack primitives and added their documentation
to this introduction.


                           - 2 -


_1.  _T_e_r_m_i_n_a_l _u_s_a_g_e _o_f _t_h_e _s_h_e_l_l

_1._1.  _T_h_e _b_a_s_i_c _n_o_t_i_o_n _o_f _c_o_m_m_a_n_d_s

     A _s_h_e_l_l in UNIX acts mostly as a medium  through  which
other  _p_r_o_g_r_a_m_s  are invoked.  While it has a set of _b_u_i_l_t_i_n
functions which it performs directly,  most  commands  cause
execution  of  programs  that  are, in fact, external to the
shell.  The shell is thus  distinguished  from  the  command
interpreters  of  other  systems both by the fact that it is
just a user program, and by the fact that it is used  almost
exclusively as a mechanism for invoking other programs.

     _C_o_m_m_a_n_d_s in the  UNIX  system  consist  of  a  list  of
strings  or  _w_o_r_d_s interpreted as a _c_o_m_m_a_n_d _n_a_m_e followed by
_a_r_g_u_m_e_n_t_s.  Thus the command

        mail bill

consists of two words.  The first word _m_a_i_l names  the  com-
mand  to  be  executed,  in this case the mail program which
sends messages to other users.  The shell uses the  name  of
the  command  in  attempting to execute it for you.  It will
look in a number of _d_i_r_e_c_t_o_r_i_e_s for a  file  with  the  name
_m_a_i_l which is expected to contain the mail program.

     The rest of the words of the command are given as _a_r_g_u_-
_m_e_n_t_s  to  the  command itself when it is executed.  In this
case we specified also the argument  _b_i_l_l  which  is  inter-
preted  by the _m_a_i_l program to be the name of a user to whom
mail is to be sent.  In normal terminal usage we  might  use
the _m_a_i_l command as follows.

        % mail bill
        I have a question about the csh documentation.
        My document seems to be missing page 5.
        Does a page five exist?
                Bill
        EOT
        %


     Here we typed a message to send to _b_i_l_l and ended  this
message with a |^D which sent an end-of-file to the mail pro-
gram.  (Here and  throughout  this  document,  the  notation
``|^_x'' is to be read ``control-_x'' and represents the strik-
ing of the _x key while the control key is  held  down.)  The
mail  program then echoed the characters `EOT' and transmit-
ted our message.  The characters `% '  were  printed  before
and  after  the  mail  command by the shell to indicate that
input was needed.

     After typing the `% ' prompt the shell was reading com-
mand  input  from our terminal.  We typed a complete command


                           - 3 -


`mail bill'.  The shell then executed the _m_a_i_l program  with
argument  _b_i_l_l  and went dormant waiting for it to complete.
The mail program then read input from our terminal until  we
signalled  an  end-of-file  via  typing a |^D after which the
shell noticed that mail had completed and signaled  us  that
it  was  ready  to  read from the terminal again by printing
another `% ' prompt.

     This is the essential pattern of all  interaction  with
UNIX  through the shell.  A complete command is typed at the
terminal, the shell executes the command and when this  exe-
cution  completes, it prompts for a new command.  If you run
the editor for an hour, the shell will  patiently  wait  for
you  to finish editing and obediently prompt you again when-
ever you finish editing.

     An example of a useful command you can execute  now  is
the  _t_s_e_t  command,  which  sets  the default _e_r_a_s_e and _k_i_l_l
characters on your terminal - the erase character erases the
last  character  you typed and the kill character erases the
entire line you have entered so far.  By default, the  erase
character is `#' and the kill character is `@'.  Most people
who use CRT displays prefer to use the backspace (|^H)  char-
acter  as  their  erase character since it is then easier to
see what you have typed so far.  You can make this  be  true
by typing

        tset -e

which tells the program _t_s_e_t to set the erase character, and
its default setting for this character is a backspace.

_1._2.  _F_l_a_g _a_r_g_u_m_e_n_t_s

     A useful notion in UNIX is that  of  a  _f_l_a_g  argument.
While  many arguments to commands specify file names or user
names some arguments rather specify an  optional  capability
of  the  command  which  you wish to invoke.  By convention,
such arguments begin with the character `-' (hyphen).   Thus
the command

        ls

will produce a list of the  files  in  the  current  _w_o_r_k_i_n_g
_d_i_r_e_c_t_o_r_y.  The option -_s is the size option, and

        ls -s

causes _l_s to also give, for each file the size of  the  file
in  blocks  of  512 characters.  The manual section for each
command in the UNIX reference  manual  gives  the  available
options for each command.  The _l_s command has a large number
of useful and interesting options.  Most other commands have
either no options or only one or two options.  It is hard to


                           - 4 -


remember options of commands which are not  used  very  fre-
quently,  so  most  UNIX  utilities  perform only one or two
functions rather than having  a  large  number  of  hard  to
remember options.

_1._3.  _O_u_t_p_u_t _t_o _f_i_l_e_s

     Commands that normally read input or  write  output  on
the  terminal  can  also  be executed with this input and/or
output done to a file.

     Thus suppose we wish to save the current date in a file
called `now'.  The command

        date

will print the  current  date  on  our  terminal.   This  is
because  our terminal is the default _s_t_a_n_d_a_r_d _o_u_t_p_u_t for the
date command and the date command prints  the  date  on  its
standard  output.   The  shell lets us _r_e_d_i_r_e_c_t the _s_t_a_n_d_a_r_d
_o_u_t_p_u_t of a command through a notation using the _m_e_t_a_c_h_a_r_a_c_-
_t_e_r  `>'  and  the  name  of  the file where output is to be
placed.  Thus the command

        date > now

runs the _d_a_t_e command such that its standard output  is  the
file  `now'  rather  than  the  terminal.  Thus this command
places the current date and time into the file `now'.  It is
important to know that the _d_a_t_e command was unaware that its
output was going to a file rather than to the terminal.  The
shell  performed  this  _r_e_d_i_r_e_c_t_i_o_n before the command began
executing.

     One other thing to note here is  that  the  file  `now'
need  not have existed before the _d_a_t_e command was executed;
the shell would have created the file if it did  not  exist.
And  if  the  file  did exist?  If it had existed previously
these previous contents would have been discarded!  A  shell
option  _n_o_c_l_o_b_b_e_r  exists  to  prevent  this  from happening
accidentally; it is discussed in section 2.2.

     The system normally keeps files which you  create  with
`>'  and  all other files.  Thus the default is for files to
be permanent.  If you wish to create a file  which  will  be
removed  automatically,  you  can  begin its name with a `#'
character, this `scratch' character denotes  the  fact  that
the file will be a scratch file.*  The  system  will  remove
_________________________
*Note that if your erase character is a `#',  you  will
have  to precede the `#' with a `\'.  The fact that the
`#' character is the old (pre-CRT) standard erase char-
acter  means that it seldom appears in a file name, and
allows this convention to be used  for  scratch  files.


                           - 5 -


such files after a couple of days, or sooner if  file  space
becomes  very  tight.   Thus,  in  running  the _d_a_t_e command
above, we don't really want to save the output  forever,  so
we would more likely do

        date > #now


_1._4.  _M_e_t_a_c_h_a_r_a_c_t_e_r_s _i_n _t_h_e _s_h_e_l_l

     The shell has a  large  number  of  special  characters
(like  `>')  which  indicate special functions.  We say that
these notations have _s_y_n_t_a_c_t_i_c and _s_e_m_a_n_t_i_c meaning  to  the
shell.   In  general,  most  characters  which  are  neither
letters nor digits have special meaning to  the  shell.   We
shall  shortly learn a means of _q_u_o_t_a_t_i_o_n which allows us to
use _m_e_t_a_c_h_a_r_a_c_t_e_r_s without the shell treating  them  in  any
special way.

     Metacharacters normally have effect only when the shell
is reading our input.  We need not worry about placing shell
metacharacters in a letter we are sending via _m_a_i_l, or  when
we  are  typing in text or data to some other program.  Note
that the shell is only reading input when  it  has  prompted
with `% '.

_1._5.  _I_n_p_u_t _f_r_o_m _f_i_l_e_s; _p_i_p_e_l_i_n_e_s

     We learned above how to _r_e_d_i_r_e_c_t the _s_t_a_n_d_a_r_d _o_u_t_p_u_t of
a  command  to  a file.  It is also possible to redirect the
_s_t_a_n_d_a_r_d _i_n_p_u_t of a command from a file.  This is not  often
necessary  since  most  commands will read from a file whose
name is given as an argument.  We can give the command

        sort < data

to run the _s_o_r_t command with standard input, where the  com-
mand  normally  reads  its  input, from the file `data'.  We
would more likely say

        sort data

letting the _s_o_r_t command open  the  file  `data'  for  input
itself since this is less to type.

     We should note that if we just typed

        sort

_________________________
If  you are using a CRT, your erase character should be
a |^H, as we demonstrated in section 1.1 how this  could
be set up.


                           - 6 -


then the sort program would sort  lines  from  its  _s_t_a_n_d_a_r_d
_i_n_p_u_t.   Since  we  did  not _r_e_d_i_r_e_c_t the standard input, it
would sort lines as we typed them on the terminal  until  we
typed a |^D to indicate an end-of-file.

     A most useful capability is the ability to combine  the
standard  output  of  one command with the standard input of
another, i.e. to run the commands in a sequence known  as  a
_p_i_p_e_l_i_n_e.  For instance the command

        ls -s

normally produces a list of the files in our directory  with
the  size  of  each  in blocks of 512 characters.  If we are
interested in learning which of our files is largest we  may
wish  to have this sorted by size rather than by name, which
is the default way in which _l_s sorts.  We could look at  the
many  options of _l_s to see if there was an option to do this
but would eventually discover that there is not.  Instead we
can use a couple of simple options of the _s_o_r_t command, com-
bining it with _l_s to get what we want.

     The -_n option of sort specifies a numeric  sort  rather
than an alphabetic sort.  Thus

        ls -s | sort -n

specifies that the output of the _l_s  command  run  with  the
option  -_s  is  to be _p_i_p_e_d to the command _s_o_r_t run with the
numeric sort option.  This would give us a  sorted  list  of
our  files  by  size, but with the smallest first.  We could
then use the -_r reverse sort option and the _h_e_a_d command  in
combination with the previous command doing

        ls -s | sort -n -r | head -5

Here we have taken a list of  our  files  sorted  alphabeti-
cally,  each  with  the size in blocks.  We have run this to
the standard input of the _s_o_r_t command  asking  it  to  sort
numerically  in  reverse order (largest first).  This output
has then been run into the command _h_e_a_d which gives  us  the
first  few  lines.   In this case we have asked _h_e_a_d for the
first 5 lines.  Thus this command gives  us  the  names  and
sizes of our 5 largest files.

     The  notation  introduced  above  is  called  the  _p_i_p_e
mechanism.   Commands  separated  by `|' characters are con-
nected together by the shell and the standard output of each
is  run  into  the standard input of the next.  The leftmost
command in a pipeline will normally take its standard  input
from  the terminal and the rightmost will place its standard
output on the terminal.  Other examples of pipelines will be
given  later  when  we  discuss  the  history mechanism; one
important use of pipes which is illustrated there is in  the


                           - 7 -


routing of information to the line printer.

_1._6.  _F_i_l_e_n_a_m_e_s

     Many commands to be executed will  need  the  names  of
files  as  arguments.  UNIX _p_a_t_h_n_a_m_e_s consist of a number of
_c_o_m_p_o_n_e_n_t_s separated by `/'.  Each component except the last
names  a  directory  in which the next component resides, in
effect specifying the _p_a_t_h of directories to follow to reach
the file.  Thus the pathname

        /etc/motd

specifies a file in the directory `etc' which is a subdirec-
tory  of  the _r_o_o_t directory `/'.  Within this directory the
file named is `motd' which stands for `message of the  day'.
A  _p_a_t_h_n_a_m_e  that begins with a slash is said to be an _a_b_s_o_-
_l_u_t_e pathname since it is specified from the absolute top of
the  entire  directory  hierarchy  of the system (the _r_o_o_t).
_P_a_t_h_n_a_m_e_s which do not begin with  `/'  are  interpreted  as
starting  in  the  current  _w_o_r_k_i_n_g  _d_i_r_e_c_t_o_r_y, which is, by
default, your _h_o_m_e directory and can be changed  dynamically
by the _c_d change directory command.  Such pathnames are said
to be _r_e_l_a_t_i_v_e to the working directory since they are found
by starting in the working directory and descending to lower
levels of directories for each _c_o_m_p_o_n_e_n_t  of  the  pathname.
If  the pathname contains no slashes at all then the file is
contained in the working directory itself and  the  pathname
is  merely the name of the file in this directory.  Absolute
pathnames have no relation to the working directory.

     Most filenames consist  of  a  number  of  alphanumeric
characters  and `.'s (periods).  In fact, all printing char-
acters except `/' (slash) may appear in  filenames.   It  is
inconvenient  to  have  most  non-alphabetic  characters  in
filenames because many of these have special meaning to  the
shell.    The   character  `.'  (period)  is  not  a  shell-
metacharacter and is often used to separate the _e_x_t_e_n_s_i_o_n of
a file name from the base of the name.  Thus

        prog.c prog.o prog.errs prog.output

are four related files.  They share a _b_a_s_e portion of a name
(a  base  portion  being  that part of the name that is left
when a trailing `.' and following characters which  are  not
`.'  are  stripped  off).   The  file  `prog.c' might be the
source for a C program, the file `prog.o' the  corresponding
object  file, the file `prog.errs' the errors resulting from
a compilation of the program and the file `prog.output'  the
output of a run of the program.

     If we wished to refer to all four of these files  in  a
command, we could use the notation


                           - 8 -


        prog.*

This word is expanded by the shell, before  the  command  to
which  it  is  an argument is executed, into a list of names
which begin with `prog.'.  The character  `*'  here  matches
any sequence (including the empty sequence) of characters in
a file name.   The  names  which  match  are  alphabetically
sorted and placed in the _a_r_g_u_m_e_n_t _l_i_s_t of the command.  Thus
the command

        echo prog.*

will echo the names

        prog.c prog.errs prog.o prog.output

Note that the names are in sorted order  here,  and  a  dif-
ferent  order  than  we listed them above.  The _e_c_h_o command
receives four words as arguments, even though we only  typed
one  word as as argument directly.  The four words were gen-
erated by _f_i_l_e_n_a_m_e _e_x_p_a_n_s_i_o_n of the one input word.

     Other notations for _f_i_l_e_n_a_m_e _e_x_p_a_n_s_i_o_n are also  avail-
able.   The  character `?' matches any single character in a
filename.  Thus

        echo ? ?? ???

will echo a line of filenames; first those with one  charac-
ter  names, then those with two character names, and finally
those with three character names.  The names of each  length
will be independently sorted.

     Another mechanism consists of a sequence of  characters
between  `['  and `]'.  This metasequence matches any single
character from the enclosed set.  Thus

        prog.[co]

will match

        prog.c prog.o

in the example above.  We  can  also  place  two  characters
around a `-' in this notation to denote a range.  Thus

        chap.[1-5]

might match files

        chap.1 chap.2 chap.3 chap.4 chap.5

if they existed.  This is shorthand for


                           - 9 -


        chap.[12345]

and otherwise equivalent.

     An important point to note is that if a list  of  argu-
ment words to a command (an _a_r_g_u_m_e_n_t _l_i_s_t) contains filename
expansion syntax, and  if  this  filename  expansion  syntax
fails  to match any existing file names, then the shell con-
siders this to be an error and prints a diagnostic

        No match.

and does not execute the command.

     Another very important point is  that  files  with  the
character  `.' at the beginning are treated specially.  Nei-
ther `*' or `?' or the `['  `]'  mechanism  will  match  it.
This  prevents  accidental matching of the filenames `.' and
`..' in the working directory which have special meaning  to
the  system, as well as other files such as ._c_s_h_r_c which are
not normally visible.  We will discuss the special  role  of
the file ._c_s_h_r_c later.

     Another filename expansion mechanism  gives  access  to
the  pathname  of  the  _h_o_m_e directory of other users.  This
notation consists of the character `~' (tilde)  followed  by
another  users'  login  name.  For instance the word `~bill'
would map to the pathname `/usr/bill' if the home  directory
for  `bill' was `/usr/bill'.  Since, on large systems, users
may have login directories  scattered  over  many  different
disk  volumes  with  different  prefix directory names, this
notation provides a reliable way of accessing the  files  of
other users.

     A special case of  this  notation  consists  of  a  `~'
alone,  e.g.  `~/mbox'.   This  notation  is expanded by the
shell into the file `mbox' in your _h_o_m_e directory, i.e. into
`/usr/bill/mbox'  for  me  on Ernie Co-vax, the UCB Computer
Science Department VAX  machine,  where  this  document  was
prepared.   This  can  be very useful if you have used _c_d to
change to another directory and have found a file  you  wish
to copy using _c_p.  If I give the command

        cp thatfile ~

the shell will expand this command to

        cp thatfile /usr/bill

since my home directory is /usr/bill.

     There also exists a mechanism using the characters  `{'
and  `}'  for  abbreviating a set of words which have common


                           - 10 -


parts but cannot be  abbreviated  by  the  above  mechanisms
because  they are not files, are the names of files which do
not yet exist, are not thus  conveniently  described.   This
mechanism  will  be described much later, in section 4.2, as
it is used less frequently.

_1._7.  _Q_u_o_t_a_t_i_o_n

     We have already seen a number of metacharacters used by
the  shell.   These metacharacters pose a problem in that we
cannot use them directly as parts of words.  Thus  the  com-
mand

        echo *

will not echo the character `*'.  It  will  either  echo  an
sorted  list  of filenames in the current _w_o_r_k_i_n_g _d_i_r_e_c_t_o_r_y,
or print the message `No match' if there are no files in the
working directory.

     The recommended mechanism for placing characters  which
are  neither numbers, digits, `/', `.' or `-' in an argument
word to a command is to enclose  it  with  single  quotation
characters `'', i.e.

        echo '*'

There is one special character `!' which is used by the _h_i_s_-
_t_o_r_y  mechanism  of the shell and which cannot be _e_s_c_a_p_e_d by
placing it within `'' characters.  It and the character  `''
itself can be preceded by a single `\' to prevent their spe-
cial meaning.  Thus

        echo \'\!

prints

        '!

These two mechanisms suffice to place any printing character
into  a  word which is an argument to a shell command.  They
can be combined, as in

        echo \''*'

which prints

        '*

since the first `\' escaped the first `'' and  the  `*'  was
enclosed between `'' characters.


                           - 11 -


_1._8.  _T_e_r_m_i_n_a_t_i_n_g _c_o_m_m_a_n_d_s

     When you are executing a command and the shell is wait-
ing for it to complete there are several ways to force it to
stop.  For instance if you type the command

        cat /etc/passwd

the system will print a copy of a list of all users  of  the
system  on  your  terminal.   This is likely to continue for
several minutes unless you stop it.  You can send an  INTER-
RUPT  _s_i_g_n_a_l  to the _c_a_t command by typing the DEL or RUBOUT
key on your terminal.* Since _c_a_t does not take  any  precau-
tions to avoid or otherwise handle this signal the INTERRUPT
will cause it to terminate.  The shell notices that _c_a_t  has
terminated  and  prompts  you  again  with `% '.  If you hit
INTERRUPT again, the shell will just repeat its prompt since
it handles INTERRUPT signals and chooses to continue to exe-
cute commands rather than terminating like  _c_a_t  did,  which
would have the effect of logging you out.

     Another way in which many programs  terminate  is  when
they get an end-of-file from their standard input.  Thus the
_m_a_i_l program in the first example above was terminated  when
we  typed a |^D which generates an end-of-file from the stan-
dard input.  The shell also terminates when it gets an  end-
of-file  printing  `logout'; UNIX then logs you off the sys-
tem.  Since  this  means  that  typing  too  many  |^D's  can
accidentally  log  us  off,  the  shell  has a mechanism for
preventing this.  This _i_g_n_o_r_e_e_o_f option will be discussed in
section 2.2.

     If a command has its standard input redirected  from  a
file,  then  it  will normally terminate when it reaches the
end of this file.  Thus if we execute

        mail bill < prepared.text

the mail command will terminate without  our  typing  a  |^D.
This  is  because  it  read  to  the end-of-file of our file
`prepared.text' in which we placed a message for `bill' with
an editor program.  We could also have done

        cat prepared.text | mail bill

since the _c_a_t command  would  then  have  written  the  text
through  the pipe to the standard input of the mail command.
When the _c_a_t command completed  it  would  have  terminated,
closing  down  the  pipeline and the _m_a_i_l command would have
received an end-of-file from it  and  terminated.   Using  a
_________________________
*Many users use _s_t_t_y(1) to change the interrupt charac-
ter to |^C.


                           - 12 -


pipe here is more complicated than redirecting input  so  we
would  more likely use the first form.  These commands could
also have been stopped by sending an INTERRUPT.

     Another  possibility  for  stopping  a  command  is  to
suspend  its  execution temporarily, with the possibility of
continuing execution later.  This is done by sending a  STOP
signal  via  typing  a  |^Z.  This signal causes all commands
running on the terminal (usually one but more if a  pipeline
is  executing)  to become suspended.  The shell notices that
the command(s) have been suspended, types `Stopped' and then
prompts for a new command.  The previously executing command
has been suspended, but otherwise  unaffected  by  the  STOP
signal.  Any other commands can be executed while the origi-
nal command remains suspended.  The suspended command can be
continued using the _f_g command with no arguments.  The shell
will then retype the command to remind you which command  is
being  continued, and cause the command to resume execution.
Unless any input files in use by the suspended command  have
been  changed  in the meantime, the suspension has no effect
whatsoever on the execution of the  command.   This  feature
can  be very useful during editing, when you need to look at
another  file  before  continuing.  An  example  of  command
suspension follows.

        % mail harold
        Someone just copied a big file into my directory and its name is
        |^Z
        Stopped
        % ls
        funnyfile
        prog.c
        prog.o
        % jobs
        [1]  + Stopped   mail harold
        % fg
        mail harold
        funnyfile. Do you know who did it?
        EOT
        %

In this example someone was sending a message to Harold  and
forgot  the name of the file he wanted to mention.  The mail
command was suspended by typing |^Z.  When the shell  noticed
that  the mail program was suspended, it typed `Stopped' and
prompted for a new command.  Then the _l_s command  was  typed
to  find out the name of the file.  The _j_o_b_s command was run
to find out which command was suspended. At this time the _f_g
command was typed to continue execution of the mail program.
Input to the mail program was then continued and ended  with
a  |^D  which  indicated the end of the message at which time
the mail program typed EOT.   The  _j_o_b_s  command  will  show
which  commands  are suspended.  The |^Z should only be typed
at the beginning of a line since  everything  typed  on  the


                           - 13 -


current  line  is  discarded  when a signal is sent from the
keyboard.  This also happens on INTERRUPT, and QUIT signals.
More  information on suspending jobs and controlling them is
given in section 2.6.

     If you write  or  run  programs  which  are  not  fully
debugged  then  it  may  be  necessary to stop them somewhat
ungracefully.  This can be done by sending them a QUIT  sig-
nal,  sent  by  typing  a |^\.  This will usually provoke the
shell to produce a message like:

        Quit (Core dumped)

indicating that a file `core' has  been  created  containing
information  about  the  program `a.out's state when it ter-
minated due to the QUIT signal.  You can examine  this  file
yourself,  or  forward  information to the maintainer of the
program telling him/her where the _c_o_r_e _f_i_l_e is.

     If you run background commands (as explained in section
2.6) then these commands will ignore INTERRUPT and QUIT sig-
nals at the terminal.  To stop them you must  use  the  _k_i_l_l
command.  See section 2.6 for an example.

     If you want to examine the output of a command  without
having it move off the screen as the output of the

        cat /etc/passwd

command will, you can use the command

        more /etc/passwd

The _m_o_r_e program pauses after each  complete  screenful  and
types  `--More--'  at which point you can hit a space to get
another screenful, a return to get another line, or a `q' to
end  the  _m_o_r_e  program.  You can also use more as a filter,
i.e.

        cat /etc/passwd | more

works just like the more simple more command above.

     For stopping output of commands not involving _m_o_r_e  you
can  use  the  |^S key to stop the typeout.  The typeout will
resume when you hit |^Q or any other key, but |^Q is  normally
used because it only restarts the output and does not become
input to the program which is running.  This works  well  on
low-speed  terminals, but at 9600 baud it is hard to type |^S
and |^Q fast enough to paginate the output nicely, and a pro-
gram like _m_o_r_e is usually used.

     An additional possibility is to use the |^O flush output
character; when this character is typed, all output from the


                           - 14 -


current command is thrown  away  (quickly)  until  the  next
input  read occurs or until the next shell prompt.  This can
be used to allow a command to  complete  without  having  to
suffer  through  the output on a slow terminal; |^O is a tog-
gle, so flushing can be turned off by typing |^O again  while
output is being flushed.

_1._9.  _W_h_a_t _n_o_w?

     We have so far seen a number of mechanisms of the shell
and  learned  a lot about the way in which it operates.  The
remaining sections will go yet further into the internals of
the  shell,  but you will surely want to try using the shell
before you go any further.  To try it you can log in to UNIX
and type the following command to the system:

        chsh myname /bin/csh

Here `myname' should be replaced by the name  you  typed  to
the  system prompt of `login:' to get onto the system.  Thus
I would use `chsh bill /bin/csh'.  _Y_o_u _o_n_l_y _h_a_v_e _t_o _d_o  _t_h_i_s
_o_n_c_e;  _i_t  _t_a_k_e_s _e_f_f_e_c_t _a_t _n_e_x_t _l_o_g_i_n.  You are now ready to
try using _c_s_h.

     Before you do the `chsh' command,  the  shell  you  are
using  when  you log into the system is `/bin/sh'.  In fact,
much of the above discussion  is  applicable  to  `/bin/sh'.
The  next section will introduce many features particular to
_c_s_h so you should change your shell to _c_s_h before you  begin
reading it.


                           - 15 -


_2.  _D_e_t_a_i_l_s _o_n _t_h_e _s_h_e_l_l _f_o_r _t_e_r_m_i_n_a_l _u_s_e_r_s

_2._1.  _S_h_e_l_l _s_t_a_r_t_u_p _a_n_d _t_e_r_m_i_n_a_t_i_o_n

     When you login, the shell is started by the  system  in
your  _h_o_m_e  directory  and begins by reading commands from a
file ._c_s_h_r_c in this directory.  All  shells  which  you  may
start during your terminal session will read from this file.
We will later see what kinds of commands are usefully placed
there.   For  now  we  need not have this file and the shell
does not complain about its absence.

     A _l_o_g_i_n _s_h_e_l_l, executed after you login to the  system,
will,  after  it  reads  commands from ._c_s_h_r_c, read commands
from a file ._l_o_g_i_n also in your home directory.   This  file
contains  commands  which you wish to do each time you login
to the UNIX system.  My ._l_o_g_i_n file looks something like:

        set ignoreeof
        set mail=(/usr/spool/mail/bill)
        echo "${prompt}users" ; users
        alias ts \
                'set noglob ; eval `tset -s -m dialup:c100rv4pna -m plugboard:?hp2621nl *`';
        ts; stty intr |^C kill |^U crt
        set time=15 history=10
        msgs -f
        if (-e $mail) then
                echo "${prompt}mail"
                mail
        endif


     This file contains several commands to be  executed  by
UNIX each time I login.  The first is a _s_e_t command which is
interpreted directly by the shell.  It sets the shell  vari-
able _i_g_n_o_r_e_e_o_f which causes the shell to not log me off if I
hit |^D.  Rather, I use the _l_o_g_o_u_t command to log off of  the
system.   By  setting  the _m_a_i_l variable, I ask the shell to
watch for incoming mail to me.  Every 5  minutes  the  shell
looks  for  this  file and tells me if more mail has arrived
there.  An alternative to this is to put the command

        biff y

in place of this _s_e_t; this will  cause  me  to  be  notified
immediately when mail arrives, and to be shown the first few
lines of the new message.

     Next I set the shell variable `time'  to  `15'  causing
the  shell  to  automatically print out statistics lines for
commands which execute for at least 15 seconds of CPU  time.
The  variable  `history' is set to 10 indicating that I want
the shell to remember the last 10 commands  I  type  in  its
_h_i_s_t_o_r_y _l_i_s_t, (described later).


                           - 16 -


     I create an _a_l_i_a_s ``ts'' which executes a _t_s_e_t(1)  com-
mand  setting  up the modes of the terminal.  The parameters
to _t_s_e_t indicate the kinds of terminal which I  usually  use
when  not  on  a  hardwired port.  I then execute ``ts'' and
also use the _s_t_t_y command to change the interrupt  character
to |^C and the line kill character to |^U.

     I then run the `msgs' program, which provides  me  with
any  system  messages which I have not seen before; the `-f'
option here prevents it from telling me  anything  if  there
are  no  new  messages.  Finally, if my mailbox file exists,
then I run the `mail' program to process my mail.

     When the `mail' and `msgs' programs finish,  the  shell
will finish processing my ._l_o_g_i_n file and begin reading com-
mands from the terminal, prompting for each with `% '.  When
I  log  off  (by  giving  the _l_o_g_o_u_t command) the shell will
print `logout' and execute commands from the file  `.logout'
if  it  exists  in  my home directory.  After that the shell
will terminate and UNIX will log me off the system.  If  the
system  is  not  going down, I will receive a new login mes-
sage.  In any case, after the `logout' message the shell  is
committed to terminating and will take no further input from
my terminal.

_2._2.  _S_h_e_l_l _v_a_r_i_a_b_l_e_s

     The shell maintains a set of _v_a_r_i_a_b_l_e_s.  We  saw  above
the  variables  _h_i_s_t_o_r_y  and  _t_i_m_e which had values `10' and
`15'.  In fact, each shell variable has as value an array of
zero  or  more  _s_t_r_i_n_g_s.   Shell  variables  may be assigned
values by the set command.  It has several forms,  the  most
useful of which was given above and is

        set name=value


     Shell variables may be used to store values  which  are
to  be used in commands later through a substitution mechan-
ism.  The shell variables most commonly referenced are, how-
ever,  those  which the shell itself refers to.  By changing
the values of these variables one can  directly  affect  the
behavior of the shell.

     One of the most important  variables  is  the  variable
_p_a_t_h.   This variable contains a sequence of directory names
where the shell searches for commands.  The _s_e_t command with
no  arguments  shows  the  value  of all variables currently
defined (we usually say _s_e_t)  in  the  shell.   The  default
value for path will be shown by _s_e_t to be


                           - 17 -


        % set
        argv   ()
        cwd    /usr/bill
        home   /usr/bill
        path   (. /usr/ucb /bin /usr/bin)
        prompt %
        shell  /bin/csh
        status 0
        term   c100rv4pna
        user   bill
        %

This output indicates that the variable path points  to  the
current  directory  `.'  and  then  `/usr/ucb',  `/bin'  and
`/usr/bin'.  Commands which you may write might  be  in  `.'
(usually  one  of  your directories).  Commands developed at
Berkeley, live in `/usr/ucb'  while  commands  developed  at
Bell Laboratories live in `/bin' and `/usr/bin'.

     A number of locally developed programs  on  the  system
live  in  the  directory  `/usr/local'.  If we wish that all
shells which we invoke to have access to these new  programs
we can place the command

        set path=(. /usr/ucb /bin /usr/bin /usr/local)

in our file ._c_s_h_r_c in our home directory.   Try  doing  this
and then logging out and back in and do

        set

again to see that the value assigned to _p_a_t_h has changed.

     One thing you should be aware  of  is  that  the  shell
examines  each directory which you insert into your path and
determines which commands are contained there.   Except  for
the current directory `.', which the shell treats specially,
this means that if commands are added to a directory in your
search  path after you have started the shell, they will not
necessarily be found by the shell.  If you  wish  to  use  a
command  which  has  been added in this way, you should give
the command

        rehash

to the shell, which will cause it to recompute its  internal
table  of  command locations, so that it will find the newly
added command.  Since the shell has to look in  the  current
directory  `.' on each command, placing it at the end of the
path specification usually works  equivalently  and  reduces
overhead.

     Other useful built in variables are the  variable  _h_o_m_e


                           - 18 -


which  shows  your  home  directory, _c_w_d which contains your
current working directory, the variable _i_g_n_o_r_e_e_o_f which  can
be  set  in  your  ._l_o_g_i_n file to tell the shell not to exit
when  it  receives  an  end-of-file  from  a  terminal   (as
described  above).   The  variable  `ignoreeof'  is  one  of
several variables which the shell does not  care  about  the
value  of,  only whether they are _s_e_t or _u_n_s_e_t.  Thus to set
this variable you simply do

        set ignoreeof

and to unset it do

        unset ignoreeof

These give the variable `ignoreeof' no value,  but  none  is
desired or required.

     Finally, some other built-in shell variables of use are
the variables _n_o_c_l_o_b_b_e_r and _m_a_i_l.  The metasyntax

        > filename

which redirects  the  standard  output  of  a  command  will
overwrite  and  destroy  the  previous contents of the named
file.  In this way you may  accidentally  overwrite  a  file
which  is  valuable.  If you would prefer that the shell not
overwrite files in this way you can

        set noclobber

in your ._l_o_g_i_n file.  Then trying to do

        date > now

would cause a diagnostic  if  `now'  existed  already.   You
could type

        date >!  now

if you really wanted to overwrite  the  contents  of  `now'.
The  `>!' is a special metasyntax indicating that clobbering
the file is ok.|-

_2._3.  _T_h_e _s_h_e_l_l'_s _h_i_s_t_o_r_y _l_i_s_t

     The shell can maintain a _h_i_s_t_o_r_y  _l_i_s_t  into  which  it
places  the  words  of previous commands.  It is possible to
use a notation to reuse commands or words from  commands  in
_________________________
|-The space between the `!' and the word `now' is criti-
cal  here, as `!now' would be an invocation of the _h_i_s-
_t_o_r_y mechanism, and have a totally different effect.


                           - 19 -


forming new commands.  This mechanism can be used to  repeat
previous  commands  or  to  correct minor typing mistakes in
commands.

     The following figure gives a sample  session  involving
typical usage of the history mechanism  of  the  shell.   In
this example we have a very simple C program which has a bug
(or two) in it in the file `bug.c', which we  `cat'  out  on
our  terminal.   We  then  try  to run the C compiler on it,
referring to the file again as `!$', meaning the last  argu-
ment  to  the previous command.  Here the `!' is the history
mechanism invocation metacharacter, and the `$'  stands  for
the  last  argument,  by  analogy to `$' in the editor which
stands for the end of the line.  The shell echoed  the  com-
mand, as it would have been typed without use of the history
mechanism, and then executed it.   The  compilation  yielded
error  diagnostics  so  we now run the editor on the file we
were trying to compile, fix the bug, and run the C  compiler
again,  this  time referring to this command simply as `!c',
which repeats the last command which started with the letter
`c'.   If  there  were other commands starting with `c' done
recently we could have said  `!cc'  or  even  `!cc:p'  which
would  have  printed  the  last  command  starting with `cc'
without executing it.

     After this recompilation, we ran the resulting  `a.out'
file,  and  then  noting that there still was a bug, ran the
editor again.  After fixing the program we ran  the  C  com-
piler  again,  but tacked onto the command an extra `-o bug'
telling the compiler to place the resultant  binary  in  the
file  `bug'  rather  than  `a.out'.  In general, the history
mechanisms may be used anywhere in the formation of new com-
mands  and  other  characters may be placed before and after
the substituted commands.

     We then ran the `size' command to  see  how  large  the
binary  program images we have created were, and then an `ls
-l' command with the same argument list, denoting the  argu-
ment list `*'.  Finally we ran the program `bug' to see that
its output is indeed correct.

     To make a numbered listing of the program  we  ran  the
`num' command on the file `bug.c'.  In order to compress out
blank lines in the output of `num' we ran the output through
the filter `ssp', but misspelled it as spp.  To correct this
we used a shell substitute, placing the  old  text  and  new
text between `|^' characters.  This is similar to the substi-
tute command in the editor.  Finally, we repeated  the  same
command with `!!', but sent its output to the line printer.

     There are other mechanisms available for repeating com-
mands.   The _h_i_s_t_o_r_y command prints out a number of previous
commands with numbers  by  which  they  can  be  referenced.
There  is  a way to refer to a previous command by searching


                           - 20 -


        % cat bug.c
        main()

        {
                printf("hello);
        }
        % cc !$
        cc bug.c
        "bug.c", line 4: newline in string or char constant
        "bug.c", line 5: syntax error
        % ed !$
        ed bug.c
        29
        4s/);/"&/p
                printf("hello");
        w
        30
        q
        % !c
        cc bug.c
        % a.out
        hello% !e
        ed bug.c
        30
        4s/lo/lo\\n/p
                printf("hello\n");
        w
        32
        q
        % !c -o bug
        cc bug.c -o bug
        % size a.out bug
        a.out: 2784+364+1028 = 4176b = 0x1050b
        bug: 2784+364+1028 = 4176b = 0x1050b
        % ls -l !*
        ls -l a.out bug
        -rwxr-xr-x 1 bill       3932 Dec 19 09:41 a.out
        -rwxr-xr-x 1 bill       3932 Dec 19 09:42 bug
        % bug
        hello
        % num bug.c | spp
        spp: Command not found.
        % |^spp|^ssp
        num bug.c | ssp
            1   main()
            3   {
            4           printf("hello\n");
            5   }
        % !! | lpr
        num bug.c | ssp | lpr
        %


                           - 21 -


for a string which appeared in it, and there are other, less
useful,  ways  to  select arguments to include in a new com-
mand.  A complete description of  all  these  mechanisms  is
given  in  the  C shell manual pages in the UNIX Programmers
Manual.

_2._4.  _A_l_i_a_s_e_s

     The shell has an _a_l_i_a_s mechanism which can be  used  to
make  transformations on input commands.  This mechanism can
be used to simplify the commands you type, to supply default
arguments to commands, or to perform transformations on com-
mands and their arguments.  The alias facility is similar to
a macro facility.  Some of the features obtained by aliasing
can be obtained also using shell command  files,  but  these
take  place  in  another  instance  of  the shell and cannot
directly affect the current shells  environment  or  involve
commands such as _c_d which must be done in the current shell.

     As an example, suppose that there is a new  version  of
the  mail program on the system called `newmail' you wish to
use, rather than the standard mail program which  is  called
`mail'.  If you place the shell command

        alias mail newmail

in your ._c_s_h_r_c file, the shell will transform an input  line
of the form

        mail bill

into a call on `newmail'.  More generally, suppose  we  wish
the  command  `ls' to always show sizes of files, that is to
always do `-s'.  We can do

        alias ls ls -s

or even

        alias dir ls -s

creating a new command syntax `dir' which does an  `ls  -s'.
If we say

        dir ~bill

then the shell will translate this to

        ls -s /mnt/bill


     Thus the _a_l_i_a_s mechanism can be used to  provide  short
names  for  commands,  to  provide default arguments, and to
define new short commands in terms of other commands.  It is


                           - 22 -


also  possible to define aliases which contain multiple com-
mands or pipelines, showing where the arguments to the  ori-
ginal  command are to be substituted using the facilities of
the history mechanism.  Thus the definition

        alias cd 'cd \!* ; ls '

would do an _l_s command after each change directory  _c_d  com-
mand.   We enclosed the entire alias definition in `'' char-
acters to prevent most substitutions from occurring and  the
character `;' from being recognized as a metacharacter.  The
`!' here is escaped with a `\'  to  prevent  it  from  being
interpreted  when  the alias command is typed in.  The `\!*'
here substitutes  the  entire  argument  list  to  the  pre-
aliasing  _c_d  command, without giving an error if there were
no arguments.  The `;' separating commands is used  here  to
indicate  that  one command is to be done and then the next.
Similarly the definition

        alias whois 'grep \!|^ /etc/passwd'

defines a command which looks up its first argument  in  the
password file.

     _W_a_r_n_i_n_g: The shell currently reads the ._c_s_h_r_c file each
time  it starts up.  If you place a large number of commands
there, shells will tend to start slowly.   A  mechanism  for
saving  the  shell environment after reading the ._c_s_h_r_c file
and quickly restoring it is under development, but  for  now
you  should try to limit the number of aliases you have to a
reasonable number... 10 or 15 is reasonable, 50 or  60  will
cause a noticeable delay in starting up shells, and make the
system seem sluggish when you execute commands  from  within
the editor and other programs.

_2._5.  _M_o_r_e _r_e_d_i_r_e_c_t_i_o_n; >> _a_n_d >&

     There are a few more notations useful to  the  terminal
user which have not been introduced yet.

     In addition to the standard output, commands also  have
a _d_i_a_g_n_o_s_t_i_c _o_u_t_p_u_t which is normally directed to the termi-
nal even when the standard output is redirected to a file or
a pipe.  It is occasionally desirable to direct the diagnos-
tic output along with the standard output.  For instance  if
you  want  to  redirect the output of a long running command
into a file and wish to have a record of any error  diagnos-
tic it produces you can do

        command >& file

The `>&' here tells the shell to route both  the  diagnostic
output  and  the standard output into `file'.  Similarly you
can give the command


                           - 23 -


        command |& lpr

to route both standard and  diagnostic  output  through  the
pipe to the line printer daemon _l_p_r.#

     Finally, it is possible to use the form

        command >> file

to place output at the end of an existing file.|-

_2._6.  _J_o_b_s; _B_a_c_k_g_r_o_u_n_d, _F_o_r_e_g_r_o_u_n_d, _o_r _S_u_s_p_e_n_d_e_d

     When one or more commands are typed together as a pipe-
line or as a sequence of commands separated by semicolons, a
single _j_o_b is created by the shell consisting of these  com-
mands  together as a unit.  Single commands without pipes or
semicolons create the simplest jobs.   Usually,  every  line
typed  to  the  shell creates a job.  Some lines that create
jobs (one per line) are

        sort < data
        ls -s | sort -n | head -5
        mail harold


     If the metacharacter `&' is typed at  the  end  of  the
commands, then the job is started as a _b_a_c_k_g_r_o_u_n_d job.  This
means that the shell does not wait for it  to  complete  but
immediately  prompts  and is ready for another command.  The
job runs _i_n _t_h_e _b_a_c_k_g_r_o_u_n_d at  the  same  time  that  normal
jobs,  called  _f_o_r_e_g_r_o_u_n_d jobs, continue to be read and exe-
cuted by the shell one at a time.  Thus

        du > usage &

would run the _d_u program, which reports on the disk usage of
_________________________
#A command form

     command >&! file

exists, and is used when _n_o_c_l_o_b_b_e_r is set and _f_i_l_e  al-
ready exists.
|-If _n_o_c_l_o_b_b_e_r is set, then an error will result if _f_i_l_e
does not exist, otherwise the shell will create _f_i_l_e if
it doesn't exist.  A form

     command >>! file

makes it not be an error for file to not exist when _n_o-
_c_l_o_b_b_e_r is set.


                           - 24 -


your working directory (as well  as  any  directories  below
it), put the output into the file `usage' and return immedi-
ately with a prompt for the next command without out waiting
for  _d_u  to finish.  The _d_u program would continue executing
in the background until it finished,  even  though  you  can
type  and  execute  more  commands in the mean time.  When a
background job terminates, a message is typed by  the  shell
just  before  the  next  prompt telling you that the job has
completed.  In the following example  the  _d_u  job  finishes
sometime  during  the  execution of the _m_a_i_l command and its
completion is reported just before the prompt after the _m_a_i_l
job is finished.

        % du > usage &
        [1] 503
        % mail bill
        How do you know when a background job is finished?
        EOT
        [1] - Done       du > usage
        %

If the job did not terminate  normally  the  `Done'  message
might  say  something  else  like `Killed'.  If you want the
terminations of background jobs to be reported at  the  time
they  occur (possibly interrupting the output of other fore-
ground jobs), you can set the _n_o_t_i_f_y variable.  In the  pre-
vious  example this would mean that the `Done' message might
have come right in the middle of the message to Bill.  Back-
ground  jobs are unaffected by any signals from the keyboard
like the STOP, INTERRUPT, or QUIT signals mentioned earlier.

     Jobs are recorded in a table  inside  the  shell  until
they terminate.  In this table, the shell remembers the com-
mand names, arguments and the _p_r_o_c_e_s_s _n_u_m_b_e_r_s  of  all  com-
mands  in the job as well as the working directory where the
job was started.  Each job in the table is either running _i_n
_t_h_e  _f_o_r_e_g_r_o_u_n_d  with the shell waiting for it to terminate,
running _i_n _t_h_e _b_a_c_k_g_r_o_u_n_d, or _s_u_s_p_e_n_d_e_d.  Only one  job  can
be  running  in the foreground at one time, but several jobs
can be suspended or running in the background at  once.   As
each  job  is  started,  it  is assigned a small identifying
number called the _j_o_b _n_u_m_b_e_r which  can  be  used  later  to
refer  to  the  job  in  the  commands described below.  Job
numbers remain the same until the job  terminates  and  then
are re-used.

     When a job is started in the backgound using  `&',  its
number,  as  well  as  the  process  numbers of all its (top
level) commands, is typed by the shell before prompting  you
for another command. For example,

        % ls -s | sort -n > usage &
        [2] 2034 2035
        %


                           - 25 -


runs the `ls' program with the `-s' options, pipes this out-
put  into the `sort' program with the `-n' option which puts
its output into the file `usage'.  Since the `&' was at  the
end of the line, these two programs were started together as
a background job.  After starting the job, the shell  prints
the  job number in brackets (2 in this case) followed by the
process number of each program started in the job.  Then the
shell  immediates prompts for a new command, leaving the job
running simultaneously.

     As mentioned in section  1.8,  foreground  jobs  become
_s_u_s_p_e_n_d_e_d  by  typing  |^Z  which  sends a STOP signal to the
currently running foreground  job.   A  background  job  can
become  suspended by using the _s_t_o_p command described below.
When jobs are suspended they merely stop  any  further  pro-
gress  until  started again, either in the foreground or the
backgound.  The shell notices when a job becomes stopped and
reports  this  fact, much like it reports the termination of
background jobs.  For foreground jobs this looks like

        % du > usage
        |^Z
        Stopped
        %

`Stopped' message is typed by the shell when it notices that
the _d_u program stopped.  For background jobs, using the _s_t_o_p
command, it is

        % sort usage &
        [1] 2345
        % stop %1