A Tour through the UNIX|- C Compiler


                       D. M. Ritchie

                     Bell Laboratories
               Murray Hill, New Jersey 07974


_T_h_e _I_n_t_e_r_m_e_d_i_a_t_e _L_a_n_g_u_a_g_e

Communication between the two phases of the compiler  proper
is  carried  out  by  means of a pair of intermediate files.
These files  are  treated  as  having  identical  structure,
although  the  second  file contains only the code generated
for  strings.   It  is  convenient  to  write  strings   out
separately to reduce the need for multiple location counters
in a later assembly phase.

     The intermediate language is  not  machine-independent;
its  structure  in a number of ways reflects the fact that C
was originally a one-pass compiler chopped in two to  reduce
the  maximum  memory  requirement.  In fact, only the latest
version of the compiler has a complete intermediate language
at  all.   Until  recently,  the first phase of the compiler
generated assembly code for  those  constructions  it  could
deal  with,  and  passed expression parse trees, in absolute
binary form, to the second phase for code generation.   Now,
at  least,  all  inter-phase  information  is  passed  in  a
describable  form,  and  there  are  no  absolute   pointers
involved,  so  the  coupling  between  the  phases is not so
strong.

     The areas in which the machine (and  system)  dependen-
cies are most noticeable are

1.   Storage allocation for automatic  variables  and  argu-
     ments  has  already  been performed, and nodes for such
     variables refer  to  them  by  offset  from  a  display
     pointer.  Type conversion (for example, from integer to
     pointer) has already occurred using the  assumption  of
     byte addressing and 2-byte words.

2.   Data  representations  suitable  to  the   PDP-11   are
__________________________
|-UNIX is a Trademark of Bell Laboratories.


                     September 28, 1987


                           - 2 -


     assumed; in particular, floating  point  constants  are
     passed as four words in the machine representation.

     As it happens, each intermediate file is represented as
a  sequence  of binary numbers without any explicit demarca-
tions.  It consists of a sequence of conceptual lines,  each
headed  by  an  operator,  and  possibly  containing various
operands.  The operators are small  numbers;  to  assist  in
recognizing  failure in synchronization, the high-order byte
of each operator  word  is  always  the  octal  number  376.
Operands  are  either  16-bit  binary  numbers or strings of
characters representing names.  Each name is terminated by a
null  character.   There  is  no  alignment  requirement for
numerical operands and so there is no padding after  a  name
string.

     The binary  representation  was  chosen  to  avoid  the
necessity  of  converting  to and from character form and to
minimize the size of the files.  It would be  very  easy  to
make  each operator-operand `line' in the file be a genuine,
printable line, with the numbers in octal or  decimal;  this
in fact was the representation originally used.

     The operators fall naturally into  two  classes:  those
which  represent  part  of  an  expression,  and all others.
Expressions are transmitted in a reverse-Polish notation; as
they  are being read, a tree is built which is isomorphic to
the tree constructed in the first  phase.   Expressions  are
passed  as  a whole, with no non-expression operators inter-
vening.  The reader maintains a  stack;  each  leaf  of  the
expression  tree  (name,  constant)  is pushed on the stack;
each unary operator replaces the top of the stack by a  node
whose  operand is the old top-of-stack; each binary operator
replaces the top pair on the  stack  with  a  single  entry.
When the expression is complete there is exactly one item on
the stack.  Following each expression is a special  operator
which  passes  the unique previous expression to the `optim-
izer' described below and then to the code generator.

     Here is the list of operators not  themselves  part  of
expressions.


_E_O_F

     marks the end of an input file.

_B_D_A_T_A _f_l_a_g _d_a_t_a ...

     specifies a sequence of bytes to be assembled as static
     data.   It  is  followed  by  pairs of words; the first
     member of the pair is non-zero  to  indicate  that  the
     data  continue; a zero flag is not followed by data and
     terminates the operator.  The  data  bytes  occupy  the


                     September 28, 1987


                           - 3 -


     low-order part of a word.

_W_D_A_T_A _f_l_a_g _d_a_t_a ...

     specifies a sequence of words to be assembled as static
     data; it is identical to the BDATA operator except that
     entire words, not just bytes, are passed.

_P_R_O_G

     means that subsequent information is to be compiled  as
     program text.

_D_A_T_A

     means that subsequent information is to be compiled  as
     static data.

_B_S_S

     means that subsequent information is to be compiled  as
     unitialized static data.

_S_Y_M_D_E_F _n_a_m_e

     means that the symbol _n_a_m_e is an external name  defined
     in the current program.  It is produced for each exter-
     nal data or function definition.

_C_S_P_A_C_E _n_a_m_e _s_i_z_e

     indicates that the name refers to  a  data  area  whose
     size  is the specified number of bytes.  It is produced
     for external data definitions without explicit initial-
     ization.

_S_S_P_A_C_E _s_i_z_e

     indicates that _s_i_z_e bytes should be set aside for  data
     storage.   It  is used to pad out short initializations
     of external  data  and  to  reserve  space  for  static
     (internal) data.  It will be preceded by an appropriate
     label.

_E_V_E_N

     is produced after each external data  definition  whose
     size  is  not  an  integral number of words.  It is not
     produced after strings except when  they  initialize  a
     character array.


                     September 28, 1987


                           - 4 -


_N_L_A_B_E_L _n_a_m_e

     is produced just before a BDATA or  WDATA  initializing
     external data, and serves as a label for the data.

_R_L_A_B_E_L _n_a_m_e

     is produced just before each function  definition,  and
     labels its entry point.

_S_N_A_M_E _n_a_m_e _n_u_m_b_e_r

     is produced at the start  of  each  function  for  each
     static  variable or label declared therein.  Subsequent
     uses of the variable will be  in  terms  of  the  given
     number.  The code generator uses this only to produce a
     debugging symbol table.

_A_N_A_M_E _n_a_m_e _n_u_m_b_e_r

     Likewise, each  automatic  variable's  name  and  stack
     offset  is specified by this operator.  Arguments count
     as automatics.

_R_N_A_M_E _n_a_m_e _n_u_m_b_e_r

     Each register variable is  similarly  named,  with  its
     register number.

_S_A_V_E _n_u_m_b_e_r

     produces a register-save sequence at the start of  each
     function, just after its label (RLABEL).

_S_E_T_R_E_G _n_u_m_b_e_r

     is used to indicate the number of  registers  used  for
     register  variables.   It  actually  gives the register
     number of the lowest free  register;  it  is  redundant
     because the RNAME operators could be counted instead.

_P_R_O_F_I_L

     is produced before the save sequence for functions when
     the  profile  option is turned on.  It produces code to
     count the number of times the function is called.

_S_W_I_T _d_e_f_l_a_b _l_i_n_e _l_a_b_e_l _v_a_l_u_e ...

     is produced for switches.  When control flows into  it,
     the  value  being switched on is in the register forced
     by RFORCE (below).  The switch  statement  occurred  on
     the  indicated line of the source, and the label number
     of the default location is _d_e_f_l_a_b.  Then  the  operator


                     September 28, 1987


                           - 5 -


     is  followed  by  a  sequence of label-number and value
     pairs; the list is terminated by a 0 label.

_L_A_B_E_L _n_u_m_b_e_r

     generates an internal label.  It is referred  to  else-
     where using the given number.

_B_R_A_N_C_H _n_u_m_b_e_r

     indicates an unconditional  transfer  to  the  internal
     label number given.

_R_E_T_R_N

     produces the return sequence for a function.  It occurs
     only once, at the end of each function.

_E_X_P_R _l_i_n_e

     causes the expression just preceding  to  be  compiled.
     The argument is the line number in the source where the
     expression occurred.

_N_A_M_E _c_l_a_s_s _t_y_p_e _n_a_m_e


_N_A_M_E _c_l_a_s_s _t_y_p_e _n_u_m_b_e_r

     indicates a name occurring in an expression.  The first
     form is used when the name is external; the second when
     the name is automatic, static, or a register.  Then the
     number indicates the stack offset, the label number, or
     the register number as  appropriate.   Class  and  type
     encoding is described elsewhere.

_C_O_N _t_y_p_e _v_a_l_u_e

     transmits an integer constant.  This and the  next  two
     operators occur as part of expressions.

_F_C_O_N _t_y_p_e _4-_w_o_r_d-_v_a_l_u_e

     transmits a floating constant as four words  in  PDP-11
     notation.

_S_F_C_O_N _t_y_p_e _v_a_l_u_e

     transmits a  floating-point  constant  whose  value  is
     correctly  represented by its high-order word in PDP-11
     notation.


                     September 28, 1987


                           - 6 -


_N_U_L_L

     indicates a null argument list of a function call in an
     expression;  call  is  a  binary  operator whose second
     operand is the argument list.

_C_B_R_A_N_C_H _l_a_b_e_l _c_o_n_d

     produces a conditional branch.   It  is  an  expression
     operator,  and will be followed by an EXPR.  The branch
     to the label number takes  place  if  the  expression's
     truth  value  is the same as that of _c_o_n_d.  That is, if
     _c_o_n_d=_1 and the expression evaluates to true, the branch
     is taken.

_b_i_n_a_r_y-_o_p_e_r_a_t_o_r _t_y_p_e

     There are binary operators corresponding to  each  such
     source-language  operator;  the  type  of the result of
     each is passed as well.  Some  perhaps-unexpected  ones
     are:  COMMA,  which  is  a  right-associative  operator
     designed to simplify right-to-left evaluation of  func-
     tion  arguments;  prefix  and  postfix ++ and --, whose
     second operand is the increment amount, as a CON; QUEST
     and  COLON,  to  express  the conditional expression as
     `a?(b:c)'; and a  sequence  of  special  operators  for
     expressing  relations between pointers, in case pointer
     comparison is different from integer  comparison  (e.g.
     unsigned).

_u_n_a_r_y-_o_p_e_r_a_t_o_r _t_y_p_e

     There are also numerous unary operators.  These include
     ITOF,  FTOI, FTOL, LTOF, ITOL, LTOI which convert among
     floating,  long,  and  integer;  JUMP  which   branches
     indirectly through a label expression; INIT, which com-
     piles the value of a constant  expression  used  as  an
     initializer;  RFORCE,  which  is  used  before a return
     sequence or a switch to place a value in an agreed-upon
     register.

_E_x_p_r_e_s_s_i_o_n _O_p_t_i_m_i_z_a_t_i_o_n

     Each expression tree, as it is read in, is subjected to
a  fairly  comprehensive analysis.  This is performed by the
_o_p_t_i_m routine and a number of subroutines; the major  things
done are

1.   Modifications and simplifications of the  tree  so  its
     value may be computed more efficiently and conveniently
     by the code generator.

2.   Marking each interior node  with  an  estimate  of  the
     number  of  registers  required  to  evaluate it.  This


                     September 28, 1987


                           - 7 -


     register count is needed to guide the  code  generation
     algorithm.

     One thing that is definitely not done is  discovery  or
exploitation of common subexpressions, nor is this done any-
where in the compiler.

     The basic organization is simple: a depth-first scan of
the  tree.   _O_p_t_i_m  does  nothing for leaf nodes (except for
automatics; see below), and calls _u_n_o_p_t_i_m  to  handle  unary
operators.  For binary operators, it calls itself to process
the operands, then treats  each  operator  separately.   One
important  case  is  commutative  and associative operators,
which are handled by _a_c_o_m_m_u_t_e.

     Here is a brief catalog of the transformations  carried
out  by by _o_p_t_i_m itself.  It is not intended to be complete.
Some of the transformations are machine-dependent,  although
they may well be useful on machines other than the PDP-11.

1.   As indicated in the discussion of  _u_n_o_p_t_i_m  below,  the
     optimizer  can  create a node type corresponding to the
     location  addressed  by  a  register  plus  a  constant
     offset.   Since this is precisely the implementation of
     automatic variables and arguments, where  the  register
     is  fixed  by convention, such variables are changed to
     the new form to simplify later processing.

2.   Associative and commutative operators are processed  by
     the special routine _a_c_o_m_m_u_t_e.

3.   After processing by _a_c_o_m_m_u_t_e, the bitwise & operator is
     turned  into  a  new  _a_n_d_n operator; `a & b' becomes `a
     _a_n_d_n ~b'.  This is done because the PDP-11 provides  no
     _a_n_d  operator, but only _a_n_d_n.  A similar transformation
     takes place for `=&'.

4.   Relationals are turned around so the  more  complicated
     expression is on the left.  (So that `2 > f(x)' becomes
     `f(x) < 2').  This improves code generation  since  the
     algorithm  prefers  to  have  the right operand require
     fewer registers than the left.

5.   An expression minus  a  constant  is  turned  into  the
     expression plus the negative constant, and the _a_c_o_m_m_u_t_e
     routine is called to take advantage of  the  properties
     of addition.

6.   Operators with constant operands are evaluated.

7.   Right shifts (unless by 1) are turned into left  shifts
     with  a negated right operand, since the PDP-11 lacks a
     general right-shift operator.


                     September 28, 1987


                           - 8 -


8.   A number of special cases are simplified, such as divi-
     sion or multiplication by 1, and shifts by 0.

The _u_n_o_p_t_i_m routine performs the same sort of processing for
unary operators.

1.   `*&x' and `&*x' are simplified to `x'.

2.   If _r is a register and _c is a constant or  the  address
     of  a  static  or  external  variable,  the expressions
     `*(r+c)' and `*r' are turned into  a  special  kind  of
     name  node  which  expresses  the  name  itself and the
     offset.  This simplifies subsequent processing  because
     such  constructions  can appear as the the address of a
     PDP-11 instruction.

3.   When the unary `&' operator is applied to a  name  node
     of  the  special kind just discussed, it is reworked to
     make the addition explicit again; this is done  because
     the PDP-11 has no `load address' instruction.

4.   Constructions like `*r++'  and  `*--r'  where  _r  is  a
     register  are discovered and marked as being implement-
     able using the  PDP-11  auto-increment  and  -decrement
     modes.

5.   If `!' is applied to a relational, the `!' is discarded
     and the sense of the relational is reversed.

6.   Special cases involving reflexive use of  negation  and
     complementation are discovered.

7.   Operations applying to constants are evaluated.

     The _a_c_o_m_m_u_t_e routine, called for associative and commu-
tative operators, discovers clusters of the same operator at
the top levels of the current tree, and arranges them  in  a
list:  for `a+((b+c)+(d+f))' the list would be`a,b,c,d,e,f'.
After each subtree is  optimized,  the  list  is  sorted  in
decreasing  difficulty  of  computation; as mentioned above,
the code generation algorithm works best when left  operands
are the difficult ones.  The `degree of difficulty' computed
is  actually  finer  than  the  mere  number  of   registers
required;  a constant is considered simpler than the address
of a static or external, which is simpler than reference  to
a  variable.   This  makes it easy to fold all the constants
together, and also to merge together the sum of  a  constant
and the address of a static or external (since in such nodes
there is space for an `offset' value).  There are also  spe-
cial cases, like multiplication by 1 and addition of 0.

A special routine is invoked to  handle  sums  of  products.
_D_i_s_t_r_i_b  is  based  on the fact that it is better to compute
`c1*c2*x  +  c1*y'  as  `c1*(c2*x  +  y)'  and   makes   the


                     September 28, 1987


                           - 9 -


divisibility tests required to assure the correctness of the
transformation.  This transformation is rarely possible with
code  directly written by the user, but it invariably occurs
as a  result  of  the  implementation  of  multi-dimensional
arrays.

     Finally, _a_c_o_m_m_u_t_e reconstructs a tree from the list  of
expressions which result.

_C_o_d_e _G_e_n_e_r_a_t_i_o_n

     The grand plan for code-generation  is  independent  of
any  particular  machine;  it  depends  largely  on a set of
tables.  But this fact does not  necessarily  make  it  very
easy  to  modify  the  compiler  to  produce  code for other
machines, both because there is  a  good  deal  of  machine-
dependent  structure in the tables, and because in any event
such tables are non-trivial to prepare.

     The arguments to  the  basic  code  generation  routine
_r_c_e_x_p_r  are  a pointer to a tree representing an expression,
the name of a code-generation table, and  the  number  of  a
register  in  which  the  value  of the expression should be
placed.  _R_c_e_x_p_r returns the number of the register in  which
the  value actually ended up; its caller may need to produce
a _m_o_v instruction if the value really needs  to  be  in  the
given register.  There are four code generation tables.

     _R_e_g_t_a_b is the basic one, which actually  does  the  job
described above: namely, compile code which places the value
represented by the expression tree in a register.

     _C_c_t_a_b is used when the value of the expression  is  not
actually  needed,  but  instead  the  value of the condition
codes resulting from evaluation  of  the  expression.   This
table is used, for example, to evaluate the expression after
_i_f.  It is clearly silly to calculate the value (0 or 1)  of
the expression `a==b' in the context `if (a==b) ... '

     The _s_p_t_a_b table is used when the value of an expression
is  to  be  pushed  on  the stack, for example when it is an
actual argument.  For example in the function call `f(a)' it
is a bad idea to load _a into a register which is then pushed
on the stack, when there is a single instruction which  does
the job.

     The _e_f_f_t_a_b table is used when an expression  is  to  be
evaluated  for its side effects, not its value.  This occurs
mostly for expressions which are statements, which  have  no
value.  Thus the code for the statement `a = b' need produce
only the approoriate _m_o_v instruction, and need not leave the
value  of _b in a register, while in the expression `a + (b =
c)' the value of `b = c' will appear in a register.


                     September 28, 1987


                           - 10 -


     All of the tables besides _r_e_g_t_a_b are rather small,  and
handle only a relatively few special cases.  If one of these
subsidiary tables does not contain an  entry  applicable  to
the  given  expression  tree,  _r_c_e_x_p_r uses _r_e_g_t_a_b to put the
value of the expression  into  a  register  and  then  fixes
things  up;  nothing need be done when the table was _e_f_f_t_a_b,
but a _t_s_t instruction is produced when the table called  for
was  _c_c_t_a_b,  and  a _m_o_v instruction, pushing the register on
the stack, when the table was _s_p_t_a_b.

     The _r_c_e_x_p_r routine itself picks off some special cases,
then  calls  _c_e_x_p_r to do the real work.  _C_e_x_p_r tries to find
an entry applicable to the given tree in  the  given  table,
and returns -1 if no such entry is found, letting _r_c_e_x_p_r try
again with a different table.  A successful match  yields  a
string  containing both literal characters which are written
out and pseudo-operations, or macros,  which  are  expanded.
Before  studying  the contents of these strings we will con-
sider how table entries are matched against trees.

     Recall that most non-leaf nodes in an  expression  tree
contain  the  name  of  the  operator, the type of the value
represented, and pointers to the subtrees (operands).   They
also contain an estimate of the number of registers required
to evaluate the expression, placed there by the  expression-
optimizer  routines.   The register counts are used to guide
the code generation process, which is based  on  the  Sethi-
Ullman algorithm.

     The main code generation tables consist of entries each
containing  an  operator  number and a pointer to a subtable
for the corresponding operator.  A subtable  consists  of  a
sequence of entries, each with a key describing certain pro-
perties of the operands of the operator involved; associated
with   the   key  is  a  code  string.   Once  the  subtable
corresponding to the operator  is  found,  the  subtable  is
searched linearly until a key is found such that the proper-
ties demanded by the key are compatible with the operands of
the  tree node.  A successful match returns the code string;
an unsuccessful search, either for the operator in the  main
table  or a compatble key in the subtable, returns a failure
indication.

     The tables are all contained in a file  which  must  be
processed to obtain an assembly language program.  Thus they
are written in  a  special-purpose  language.   To  provided
definiteness to the following discussion, here is an example
of a subtable entry.

        %n,aw
                F
                add     A2,R

The `%' indicates the key; the information following (up  to


                     September 28, 1987


                           - 11 -


a blank line) specifies the code string.  Very briefly, this
entry is in the subtable for `+' of _r_e_g_t_a_b; the  key  speci-
fies  that  the  left  operand is any integer, character, or
pointer expression, and the right operand is any word  quan-
tity  which is directly addressible (e.g. a variable or con-
stant).  The code string calls for  the  generation  of  the
code  to  compile  the left (first) operand into the current
register (`F') and then  to  produce  an  `add'  instruction
which  adds the second operand (`A2') to the register (`R').
All of the notation will be explained below.

     Only three features of the operands are used in  decid-
ing whether a match has occurred.  They are:

1.   Is  the  type  of  the  operand  compatible  with  that
     demanded?

2.   Is the `degree of difficulty'  (in  a  sense  described
     below) compatible?

3.   The table may  demand  that  the  operand  have  a  `*'
     (indirection operator) as its highest operator.

     As suggested above, the key for  a  subtable  entry  is
indicated by a `%,' and a comma-separated pair of specifica-
tions  for  the  operands.   (The  second  specification  is
ignored  for  unary operators).  A specification indicates a
type requirement by including one of the following  letters.
If  no  type  letter  is present, any integer, character, or
pointer operand will satisfy  the  requirement  (not  float,
double, or long).

b    A byte (character) operand is required.

w    A word (integer or pointer) operand is required.

f    A float or double operand is required.

d    A double operand is required.

l    A long (32-bit integer) operand is required.

     Before discussing the `degree of difficulty' specifica-
tion,  the  algorithm  has  to be explained more completely.
_R_c_e_x_p_r (and _c_e_x_p_r) are called  with  a  register  number  in
which  to  place their result.  Registers 0, 1, ... are used
during evaluation of expressions; the maximum register which
can  be  used  in this way depends on the number of register
variables, but in any event only registers 0 through  4  are
available  since  r5  is used as a stack frame header and r6
(sp) and r7 (pc) have special hardware properties.  The code
generation  routines assume that when called with register _n
as argument, they may use _n+_1, ...  (up to the first  regis-
ter  variable)  as  temporaries.   Consider  the  expression


                     September 28, 1987


                           - 12 -


`X+Y', where both X and  Y  are  expressions.   As  a  first
approximation, there are three ways of compiling code to put
this expression in register _n.

1.   If Y is an addressible cell, (recursively) put  X  into
     register _n and add Y to it.

2.   If Y is an expression  that  can  be  calculated  in  _k
     registers, where _k smaller than the number of registers
     available, compile X into register _n, Y  into  register
     _n+_1, and add register _n+_1 to _n.

3.   Otherwise, compile Y into register _n, save  the  result
     in  a temporary (actually, on the stack) compile X into
     register _n, then add in the temporary.

     The distinction between cases 2 and 3 therefore depends
on whether the right operand can be compiled in fewer than _k
registers, where _k is the  number  of  free  registers  left
after  registers  0  through  _n are taken: 0 through _n-_1 are
presumed to contain already computed  temporary  results;  _n
will, in case 2, contain the value of the left operand while
the right is being evaluated.

     These considerations should make clear  the  specifica-
tion  codes  for  the  degree of difficulty, bearing in mind
that a number of special cases are also present:

z    is satisfied when the operand is zero, so that  special
     code can be produced for expressions like `x = 0'.

1    is satisfied when the operand is  the  constant  1,  to
     optimize  cases  like  left and right shift by 1, which
     can be done efficiently on the PDP-11.

c    is satisfied when the operand is  a  positive  (16-bit)
     constant; this takes care of some special cases in long
     arithmetic.

a    is satisfied when  the  operand  is  addressible;  this
     occurs  not  only for variables and constants, but also
     for  some  more  complicated  constructions,  such   as
     indirection  through  a simple variable, `*p++' where _p
     is a register variable (because of the  PDP-11's  auto-
     increment  address  mode),  and  `*(p+c)'  where _p is a
     register and _c is a constant.  Precisely, the  require-
     ment is that the operand refers to a cell whose address
     can be written as a source or destination of  a  PDP-11
     instruction.

e    is satisfied by an operand whose value can be generated
     in  a  register using no more than _k registers, where _k
     is the number  of  registers  left  (not  counting  the
     current register).  The `e' stands for `easy.'


                     September 28, 1987


                           - 13 -


n    is satisfied by any operand.  The `n' stands for  `any-
     thing.'

     These degrees of difficulty are considered to lie in  a
linear  ordering and any operand which satisfies an earlier-
mentioned requirement will satisfy a later one.   Since  the
subtables  are  searched linearly, if a `1' specification is
included, almost certainly a `z' must be  written  first  to
prevent expressions containing the constant 0 to be compiled
as if the 0 were 1.

     Finally, a key specification may contain  a  `*'  which
requires  the  operand to have an indirection as its leading
operator.  Examples below should clarify the utility of this
specification.

     Now let us consider the contents  of  the  code  string
associated   with   each  subtable  entry.   Conventionally,
lower-case letters in this string represent literal informa-
tion  which  is  copied  directly to the output.  Upper-case
letters generally introduce specific macro-operations,  some
of which may be followed by modifying information.  The code
strings in the tables are written with  tabs  and  new-lines
used freely to suggest instructions which will be generated;
the table-compiling program compresses tabs (using the  0200
bit  of the next character) and throws away some of the new-
lines.  For example the macro `F' is ordinarily written on a
line  by  itself;  but  since  its expansion will end with a
new-line, the new-line  after  `F'  itself  is  dispensable.
This is all to reduce the size of the stored tables.

     The first set of  macro-operations  is  concerned  with
compiling  subtrees.   Recall that this is done by the _c_e_x_p_r
routine.  In the following discussion the `current register'
is  generally  the  argument register to _c_e_x_p_r; that is, the
place where the result is desired.  The `next  register'  is
numbered one higher than the current register.  (This expla-
nation isn't fully true because of complications,  described
below,  involving operations which require even-odd register
pairs.)

F    causes a recursive call to the _r_c_e_x_p_r routine  to  com-
     pile  code  which  places the value of the first (left)
     operand of the operator in the current register.

F1   generates code which places  the  value  of  the  first
     operand  in  the next register.  It is incorrectly used
     if there might be no next register;  that  is,  if  the
     degree  of  difficulty  of  the  first  operand  is not
     `easy;' if not, another register might  not  be  avail-
     able.

FS   generates code which pushes  the  value  of  the  first
     operand  on  the  stack,  by  calling _r_c_e_x_p_r specifying


                     September 28, 1987


                           - 14 -


     _s_p_t_a_b as the table.

Analogously,

S, S1, SScompile the second (right) operand into the current
     register, the next register, or onto the stack.

To deal with registers, there are

R    which expands into the name of the current register.

R1   which expands into the name of the next register.

R+   which expands into the the name of the current register
     plus  1.   It was suggested above that this is the same
     as the next register, except for complications; here is
     one  of  them.  Long integer variables have 32 bits and
     require 2 registers; in such cases the next register is
     the  current  register  plus 2.  The code would like to
     talk about both halves  of  the  long  quantity,  so  R
     refers  to the register with the high-order part and R+
     to the low-order part.

R-   This is another complication,  involving  division  and
     mod.   These  operations involve a pair of registers of
     which  the  odd-numbered  contains  the  left  operand.
     _C_e_x_p_r arranges that the current register is odd; the R-
     notation allows the code to refer to  the  next  lower,
     even-numbered register.

To refer to addressible quantities, there are the notations:

A1   causes generation of the address specified by the first
     operand.   For  this  to  be legal, the operand must be
     addressible; its key must contain an `a' or a more res-
     trictive specification.

A2   correspondingly generates the  address  of  the  second
     operand providing it has one.

     We now have enough mechanism to  show  a  complete,  if
suboptimal,  table  for  the  +  operator  on  word  or byte
operands.


                     September 28, 1987


                           - 15 -


        %n,z
                F

        %n,1
                F
                inc     R

        %n,aw
                F
                add     A2,R

        %n,e
                F
                S1
                add     R1,R

        %n,n
                SS
                F
                add     (sp)+,R

The first two sequences handle some special cases.  Actually
it  turns out that handling a right operand of 0 is unneces-
sary since the expression-optimizer throws out  adds  of  0.
Adding  1 by using the `increment' instruction is done next,
and then the case where the right  operand  is  addressible.
It  must  be a word quantity, since the PDP-11 lacks an `add
byte'  instruction.   Finally  the  cases  where  the  right
operand  either  can,  or  cannot,  be done in the available
registers are treated.

     The next macro-instructions are conveniently introduced
by noticing that the above table is suitable for subtraction
as well as addition, since no use is made  of  the  commuta-
tivity  of  addition.  All that is needed is substitution of
`sub' for `add' and `dec' for 'inc.' Considerable saving  of
space  is  achieved  by factoring out several similar opera-
tions.

I    is replaced by a string from another table  indexed  by
     the  operator  in the node being expanded.  This secon-
     dary table actually contains two strings per operator.

I'   is replaced by the second  string  in  the  side  table
     entry for the current operator.

     Thus, given that the entries for `+'  and  `-'  in  the
side  table  (which  is  called _i_n_s_t_a_b) are `add' and `inc,'
`sub' and `dec' respectively, the middle  of  of  the  above
addition table can be written


                     September 28, 1987


                           - 16 -


        %n,1
                F
                I'      R

        %n,aw
                F
                I       A2,R

and it will be suitable for subtraction, and  several  other
operators, as well.

     Next, there is the question of character and  floating-
point operations.

B1   generates the letter `b' if  the  first  operand  is  a
     character,  `f'  if  it is float or double, and nothing
     otherwise.  It is used in a context like `movB1'  which
     generates   a  `mov',  `movb',  or  `movf'  instruction
     according to the type of the operand.

B2   is just like B1 but applies to the second operand.

BE   generates `b' if either operand is a character and null
     otherwise.

BF   generates `f' if the type of the operator  node  itself
     is float or double, otherwise null.

     For example, there is an entry in _e_f_f_t_a_b  for  the  `='
operator

        %a,aw
        %ab,a
                IBE     A2,A1

Note first that two key specifications can be applied to the
same  code  string.   Next,  observe  that  when  a  word is
assigned to a byte or to a word, or a word is assigned to  a
byte,  a  single  instruction, a _m_o_v or _m_o_v_b as appropriate,
does the job.  However, when a byte is assigned to  a  word,
it  must  pass  through  a  register  to implement the sign-
extension rules:

        %a,n
                S
                IB1     R,A1


     Next, there is the  question  of  handling  indirection
properly.   Consider  the expression `X + *Y', where X and Y
are expressions, Assuming that Y is  more  complicated  than
just  a  variable, but on the other hand qualifies as `easy'
in the context, the expression would be compiled by  placing


                     September 28, 1987


                           - 17 -


the  value of X in a register, that of *Y in the next regis-
ter, and adding the registers.  It is easy  to  see  that  a
better job can be done by compiling X, then Y (into the next
register), and producing the instruction symbolized by  `add
(R1),R'.  This scheme avoids generating the instruction `mov
(R1),R1' required actually to place the value  of  *Y  in  a
register.  A related situation occurs with the expression `X
+ *(p+6)', which  exemplifies  a  construction  frequent  in
structure  and  array  references.  The addition table shown
above would produce

        [put X in register R]
        mov     p,R1
        add     $6,R1
        mov     (R1),R1
        add     R1,R

when the best code is

        [put X in R]
        mov     p,R1
        add     6(R1),R

As we said above, a key specification for a code table entry
may require an operand to have an indirection as its highest
operator.  To make use of  the  requirement,  the  following
macros are provided.

F*   the first operand must have the form *X.  If in partic-
     ular  it  has  the  form *(Y + c), for some constant _c,
     then code is produced which places the value  of  Y  in
     the  current  register.   Otherwise,  code  is produced
     which loads X into the current register.

F1*  resembles F* except that the next register is loaded.

S*   resembles F* except that the second operand is loaded.

S1*  resembles S* except that the next register is loaded.

FS*  The first operand must have the form  `*X'.   Push  the
     value of X on the stack.

SS*  resembles FS* except that  it  applies  to  the  second
     operand.

To capture the constant that may have been skipped  over  in
the above macros, there are

#1   The first operand must have the form *X; if in particu-
     lar it has the form *(Y + c) for _c a constant, then the
     constant is written out, otherwise a null string.

#2   is the same as #1 except that  the  second  operand  is


                     September 28, 1987


                           - 18 -


     used.

Now we can improve the addition table  above.   Just  before
the `%n,e' entry, put

        %n,ew*
                F
                S1*
                add     #2(R1),R

and just before the `%n,n' put

        %n,nw*
                SS*
                F
                add     *(sp)+,R

When using the stacking macros there is no place to use  the
constant  as  an index word, so that particular special case
doesn't occur.

     The constant mentioned above can actually be more  gen-
eral  than  a number.  Any quantity acceptable to the assem-
bler as an expression will do, in particular the address  of
a  static  cell,  perhaps with a numeric offset.  If _x is an
external character array, the expression `x[i+5] =  0'  will
generate the code

        mov     i,r0
        clrb    x+5(r0)

via the table entry (in the `=' part of _e_f_f_t_a_b)

        %e*,z
                F
                I'B1    #1(R)

Some machine operations place restrictions on the  registers
used.   The divide instruction, used to implement the divide
and mod operations, requires the dividend to  be  placed  in
the  odd  member of an even-odd pair; other peculiarities of
multiplication make it simplest to put the  multiplicand  in
an   odd-numbered   register.   There  is  no  theory  which
optimally accounts for this kind of requirement.  _C_e_x_p_r han-
dles  it  by  checking for a multiply, divide, or mod opera-
tion; in these cases, its argument register number is incre-
mented by one or two so that it is odd, and if the operation
was divide or mod, so that it is a member of a free even-odd
pair.   The routine which determines the number of registers
required estimates, conservatively, that at least two regis-
ters  are  required  for  a multiplication and three for the
other peculiar operators.  After the expression is compiled,
the register where the result actually ended up is returned.
(Divide and mod are actually the same operation  except  for


                     September 28, 1987


                           - 19 -


the location of the result).

     These operations are the ones which  cause  results  to
end  up  in  unexpected  places, and this possibility adds a
further level of complexity.  The simplest way  of  handling
the  problem is always to move the result to the place where
the caller expected it, but this  will  produce  unnecessary
register  moves  in  many simple cases; `a = b*c' would gen-
erate

        mov     b,r1
        mul     c,r1
        mov     r1,r0
        mov     r0,a

The next thought is used the passed-back information  as  to
where  the result landed to change the notion of the current
register.  While compiling the `='  operation  above,  which
comes from a table entry like

        %a,e
                S
                mov     R,A1

it is sufficient to redefine the meaning of `R'  after  pro-
cessing  the `S' which does the multiply.  This technique is
in fact used; the tables are written  in  such  a  way  that
correct code is produced.  The trouble is that the technique
cannot be used in general, because it invalidates the  count
of the number of registers required for an expression.  Con-
sider just `a*b + X' where X is some expression.  The  algo-
rithm assumes that the value of a*b, once computed, requires
just one register.  If there are three registers  available,
and  X  requires two registers to compute, then this expres-
sion will match a key specifying `%n,e'.  If a*b is computed
and left in register 1, then there are, contrary to expecta-
tions, no longer two registers available to compute  X,  but
only  one,  and bad code will be produced.  To guard against
this possibility, _c_e_x_p_r checks the result returned by recur-
sive calls which implement F, S and their relatives.  If the
result is not in the expected register, then the  number  of
registers  required  by  the other operand is checked; if it
can be done using those registers which  remain  even  after
making  unavailable the unexpectedly-occupied register, then
the notions of the `next register' and possibly the `current
register' are redefined.  Otherwise a register-copy instruc-
tion is produced.  A register-copy is also  always  produced
when  the  current  operator is one of those which have odd-
even requirements.

     Finally, there are a few loose-end macro operations and
facts about the tables.  The operators:

V    is used for long operations.  It  is  written  with  an


                     September 28, 1987


                           - 20 -


     address  like  a  machine  instruction; it expands into
     `adc' (add carry)  if  the  operation  is  an  additive
     operator,  `sbc' (subtract carry) if the operation is a
     subtractive operator, and disappears,  along  with  the
     rest  of  the line, otherwise.  Its purpose is to allow
     common treatment of logical operations, which  have  no
     carries, and additive and subtractive operations, which
     generate carries.

T    generates a `tst' instruction if the first  operand  of
     the  tree  does  not set the condition codes correctly.
     It is  used  with  divide  and  mod  operations,  which
     require a sign-extended 32-bit operand.  The code table
     for the  operations  contains  an  `sxt'  (sign-extend)
     instruction  to  generate  the  high-order  part of the
     dividend.

H    is analogous to the `F' and `S' macros, except that  it
     calls  for  the generation of code for the current tree
     (not one of its operands) using _r_e_g_t_a_b.  It is used  in
     _c_c_t_a_b  for  all the operators which, when executed nor-
     mally, set the condition codes  properly  according  to
     the result.  It prevents a `tst' instruction from being
     generated for constructions like `if (a+b)  ...'  since
     after  calculation  of the value of `a+b' a conditional
     branch can be written immediately.

     All of the discussion above is in  terms  of  operators
with operands.  Leaves of the expression tree (variables and
constants), however, are  peculiar  in  that  they  have  no
operands.   In  order  to  regularize  the matching process,
_c_e_x_p_r examines its operand to determine if it is a leaf;  if
so,  it  creates  a special `load' operator whose operand is
the leaf, and substitutes it for  the  argument  tree;  this
allows  the  table entry for the created operator to use the
`A1' notation to load the leaf into a register.

     Purely to save space in the tables, pieces of subtables
can  be  labelled  and referred to later.  It turns out, for
example, that rather large portions of the the _e_f_f_t_a_b  table
for  the `=' and `=+' operators are identical.  Thus `=' has
an entry

        %[move3:]
        %a,aw
        %ab,a
                IBE     A2,A1

while part of the `=+' table is

        %aw,aw
        %       [move3]

Labels  are  written  as  `%[  ...  :  ]',  before  the  key


                     September 28, 1987


                           - 21 -


specifications;  references  are  written  with `%  [ ... ]'
after the key.  Peculiarities in the implementation make  it
necessary that labels appear before references to them.

     The  example  illustrates  the  utility   of   allowing
separate keys to point to the same code string.  The assign-
ment code works properly if either the right  operand  is  a
word,  or  the left operand is a byte; but since there is no
`add byte' instruction the addition  code  has  to  be  res-
tricted to word operands.

_D_e_l_a_y_i_n_g _a_n_d _r_e_o_r_d_e_r_i_n_g

     Intertwined with the code generation routines  are  two
other,  interrelated processes.  The first, implemented by a
routine called _d_e_l_a_y, is based on the observation that naive
code generation for the expression `a = b++' would produce

        mov     b,r0
        inc     b
        mov     r0,a

The point is that the table for postfix ++ has  to  preserve
the value of _b before incrementing it; the general way to do
this is to preserve its value in  a  register.   A  cleverer
scheme would generate

        mov     b,a
        inc     b

_D_e_l_a_y is called for each expression input to _r_c_e_x_p_r, and  it
searches  for  postfix ++ and -- operators.  If one is found
applied to a variable, the tree is  patched  to  bypass  the
operator  and  compiled  as it stands; then the increment or
decrement itself is done.  The effect is as if `a = b;  b++'
had been written.  In this example, of course, the user him-
self could have done the  same  job,  but  more  complicated
examples are easily constructed, for example `switch (x++)'.
An essential restriction is that the condition codes not  be
required.   It  would be incorrect to compile `if (a++) ...'
as

        tst     a
        inc     a
        beq     ...

because the `inc' destroys the required setting of the  con-
dition codes.

     Reordering is a similar  sort  of  optimization.   Many
cases which it detects are useful mainly with register vari-
ables.  If _r is a register variable,  the  expression  `r  =
x+y' is best compiled as


                     September 28, 1987


                           - 22 -


        mov     x,r
        add     y,r

but the codes tables would produce

        mov     x,r0
        add     y,r0
        mov     r0,r

which is in fact preferred if _r is not a register.  (If _r is
not a register, the two sequences are the same size, but the
second is slightly faster.) The scheme  is  to  compile  the
expression  as  if it had been written `r = x; r =+ y'.  The
_r_e_o_r_d_e_r routine is called with a pointer to each  tree  that
_r_c_e_x_p_r is about to compile; if it has the right characteris-
tics, the `r = x' tree is constructed and passed recursively
to  _r_c_e_x_p_r; then the original tree is modified to read `r =+
y' and the calling instance of _r_c_e_x_p_r compiles that instead.
Of  course  the  whole  business is itself recursive so that
more extended forms of the same phenomenon are handled, like
`r = x + y | z'.

     Care does have to be taken  to  avoid  `optimizing'  an
expression  like  `r  =  x + r' into `r = x; r =+ r'.  It is
required that the right operand of  the  expression  on  the
right  of  the  `=' be a ', distinct from the register vari-
able.

     The second case that _r_e_o_r_d_e_r handles is expressions  of
the  form  `r = X' used as a subexpression.  Again, the code
out of the tables for `x = r = y' would be

        mov     y,r0
        mov     r0,r
        mov     r0,x

whereas if _r were a register it would be better to produce

        mov     y,r
        mov     r,x

When _r_e_o_r_d_e_r discovers that a  register  variable  is  being
assigned  to in a subexpression, it calls _r_c_e_x_p_r recursively
to compile the subexpression, then fiddles the  tree  passed
to  it  so  that the register variable itself appears as the
operand instead of the whole subexpression.  Here  care  has
to  be  taken  to avoid an infinite regress, with _r_c_e_x_p_r and
_r_e_o_r_d_e_r calling each other forever to handle assignments  to
registers.

     A third set of cases treated by _r_e_o_r_d_e_r comes  up  when
any  name,  not  necessarily  a  register,  occurs as a left
operand of an assignment operator other than `='  or  as  an


                     September 28, 1987


                           - 23 -


operand of prefix `++' or `--'.  Unless condition-code tests
are involved, when a subexpression like `(a =+ b)' is  seen,
the  assignment  is performed and the argument tree modified
so that _a is its operand; effectively `x + (y =+ z)' is com-
piled  as  `y =+ z; x + y'.  Similarly, prefix increment and
decrement are pulled  out  and  performed  first,  then  the
remainder of the expression.

     Throughout code generation, the expression optimizer is
called whenever _d_e_l_a_y or _r_e_o_r_d_e_r change the expression tree.
This allows some special cases to be  found  that  otherwise
would not be seen.

                 _A _N_e_w _I_n_p_u_t-_O_u_t_p_u_t _P_a_c_k_a_g_e

                       D. M. Ritchie

     A new package of IO routines  is  available  under  the
Unix  system.   It  was designed with the following goals in
mind.

1.   It should be similar in spirit to the earlier  Portable
     Library,  and,  to  the  extent possible, be compatible
     with it.  At the same time a few dubious design choices
     in the Portable Library will be corrected.

2.   It must be as efficient as possible, both in  time  and
     in  space, so that there will be no hesitation in using
     it no matter how critical the application.

3.   It must be simple to use, and also free  of  the  magic
     numbers  and mysterious calls the use of which mars the
     understandability  and  portability  of  many  programs
     using older packages.

4.   The interface provided  should  be  applicable  on  all
     machines,  whether  or not the programs which implement
     it are  directly  portable  to  other  systems,  or  to
     machines  other  than  the  PDP11  running a version of
     Unix.

     It is intended that this package replace  the  Portable
Library.   Although  it  is not directly compatible, as dis-
cussed below, it is sufficiently similar that a set of rela-
tively  small, inexpensive adaptor routines exist which make
it appear identical to the current Portable  Library  except
in some very obscure details.

     The most crucial difference between  this  package  and
the  Portable  Library  is  that  the current offering names
streams in terms of pointers rather  than  by  the  integers
known  as `file descriptors.' Thus, for example, the routine
which opens a named file returns  a  pointer  to  a  certain
structure  rather  than a number; the routine which reads an


                     September 28, 1987


                           - 24 -


open file takes as an argument the pointer returned from the
open call.

_G_e_n_e_r_a_l _U_s_a_g_e
Each program using the library must have the line

                        #include <stdio.h>

which defines certain macros  and  variables.   The  library
containing the routines is `/usr/lib/libS.a,' so the command
to compile is

                        cc  . . .  -lS

All names in the include file intended only for internal use
begin  with  an  underscore `_' to reduce the possibility of
collision with a user name.  The names intended to be  visi-
ble outside the package are

stdin     The name of the standard input file

stdout    The name of the standard output file

stderr    The name of the standard error file

EOF       is actually -1, and is the value returned  by  the
          read routines on end-of-file or error.

NULL      is a notation for the null  pointer,  returned  by
          pointer-valued functions to indicate an error

FILE      expands to `struct _iob' and is a useful shorthand
          when declaring pointers to streams.

BUFSIZ    is a number (viz. 512) of the size suitable for an
          IO  buffer  supplied  by  the  user.   See _s_e_t_b_u_f,
          below.

getc, getchar, putc, putchar, feof, ferror, fileno
          are  defined  as  macros.    Their   actions   are
          described  below; they are mentioned here to point
          out that it is not possible to redeclare them  and
          that  they  are  not actually functions; thus, for
          example, they may  not  have  breakpoints  set  on
          them.

     The routines in this package, like the current Portable
Library,  offer  the convenience of automatic buffer alloca-
tion and output flushing where  appropriate.   Absent,  how-
ever, is the facility of changing the default input and out-
put streams by assigning to  `cin'  and  `cout.'  The  names
`stdin,'  stdout,'  and `stderr' are in effect constants and
may not be assigned to.


                     September 28, 1987


                           - 25 -


_C_a_l_l_s
The  routines  in  the  library  are  in  nearly  one-to-one
correspondence  with  those  in  the  Portable  Library.  In
several cases the name has been changed.  This is an attempt
to  reduce  confusion.  If the attempt is judged to fail the
names may be made identical even though the arguments may be
different.   The  order  of  this list generally follows the
order used in the Portable Library document.

_F_I_L_E *_f_o_p_e_n(_f_i_l_e_n_a_m_e, _t_y_p_e)
_F_o_p_e_n opens the file and, if needed, allocates a buffer  for
it.   _F_i_l_e_n_a_m_e  is  a  character string specifying the name.
_T_y_p_e is a character string (not a single character).  It may
be `"r",' `"w",' or `"a"' to indicate intent to read, write,
or append.  The value returned is a file pointer.  If it  is
null the attempt to open failed.

_i_n_t _g_e_t_c(_i_o_p_t_r)
returns the next character from the stream named  by  _i_o_p_t_r,
which  is  a pointer to a file such as returned by _f_o_p_e_n, or
the name _s_t_d_i_n.  The integer EOF is returned on  end-of-file
or  when  an  error  occurs.   The null character is a legal
character.

_p_u_t_c(_c, _i_o_p_t_r)
_P_u_t_c writes the character _c on the output  stream  named  by
_i_o_p_t_r,  which  is  a  value  returned  from _f_o_p_e_n or perhaps
_s_t_d_o_u_t or _s_t_d_e_r_r.  The character is returned as  value,  but
EOF is returned on error.

_f_c_l_o_s_e(_i_o_p_t_r)
The file corresponding to _i_o_p_t_r is closed after any  buffers
are  emptied.  A buffer allocated by the IO system is freed.
_F_c_l_o_s_e is automatic on normal termination of the program.

_f_f_l_u_s_h(_i_o_p_t_r)
Any buffered information on the  (output)  stream  named  by
_i_o_p_t_r is written out.  Output files are normally buffered if
and only if they are  not  directed  to  the  terminal,  but
_s_t_d_e_r_r is unbuffered unless _s_e_t_b_u_f is used.

_e_x_i_t(_e_r_r_c_o_d_e)
_E_x_i_t terminates the process  and  returns  its  argument  as
status to the parent.  This is a special version of the rou-
tine which calls _f_f_l_u_s_h for each output file.  To  terminate
without flushing, use __e_x_i_t.

_f_e_o_f(_i_o_p_t_r)
returns non-zero when end-of-file has occurred on the speci-
fied input stream.

_f_e_r_r_o_r(_i_o_p_t_r)
returns non-zero when an error has occurred while reading or
writing  the named stream.  The error indication lasts until


                     September 28, 1987


                           - 26 -


the file has been closed.

_g_e_t_c_h_a_r( )
is identical to `getc(stdin)'.

_p_u_t_c_h_a_r(_c)
is identical to `putc(c, stdout)'.

_c_h_a_r *_g_e_t_s(_s)
reads characters up to a new-line from the  standard  input.
The  new-line character is replaced by a null character.  It
is the user's responsibility to make sure that the character
array _s is large enough.  _G_e_t_s returns its argument, or null
if end-of-file or error occurred.

_c_h_a_r *_f_g_e_t_s(_s, _n, _i_o_p_t_r)
reads up to _n characters from  the  stream  _i_o_p_t_r  into  the
character  pointer  _s.   The read terminates with a new-line
character.  The new-line character is placed in  the  buffer
followed  by  a null pointer.  The first argument, or a null
pointer if error or end-of-file occurred, is returned.

_p_u_t_s(_s)
writes the null-terminated string (character array) _s on the
standard  output.   A  new-line  is  appended.   No value is
returned.

_f_p_u_t_s(_s, _i_o_p_t_r)
writes the null-terminated string (character array)  on  the
stream _s.  No new-line is appended.  No value is returned.

_u_n_g_e_t_c(_c, _i_o_p_t_r)
The argument character _c is pushed back on the input  stream
named by _i_o_p_t_r.  Only one character may be pushed back.

_p_r_i_n_t_f(_f_o_r_m_a_t, _a_1, . . .)

_f_p_r_i_n_t_f(_i_o_p_t_r, _f_o_r_m_a_t, _a_1, . . .)

_s_p_r_i_n_t_f(_s, _f_o_r_m_a_t, _a_1, . . .)
_P_r_i_n_t_f writes on the standard output.  _F_p_r_i_n_t_f writes on the
named output stream.  _S_p_r_i_n_t_f puts characters in the charac-
ter array (string) named by _s.  The  specifications  are  as
usual.

_s_c_a_n_f(_f_o_r_m_a_t, _a_1, . . .)

_f_s_c_a_n_f(_i_o_p_t_r, _f_o_r_m_a_t, _a_1, . . .)

_s_s_c_a_n_f(_s, _f_o_r_m_a_t, _a_1, . . .)
_S_c_a_n_f reads from the standard input.  _F_s_c_a_n_f reads from  the
named  input stream.  _S_s_c_a_n_f reads from the character string
supplied as _s.  The specifications are identical to those of
the Portable Library.


                     September 28, 1987


                           - 27 -


_f_r_e_a_d(_p_t_r, _s_i_z_e_o_f(*_p_t_r), _n_i_t_e_m_s, _i_o_p_t_r)
writes _n_i_t_e_m_s of data beginning at _p_t_r on  file  _i_o_p_t_r.   It
behaves  identically  to  the  Portable Library's _c_r_e_a_d.  No
advance  notification  that  binary  IO  is  being  done  is
required;   when,   for   portability  reasons,  it  becomes
required, it will be done by adding an additional  character
to the mode-string on the fopen call.

_f_w_r_i_t_e(_p_t_r, _s_i_z_e_o_f(*_p_t_r), _n_i_t_e_m_s, _i_o_p_t_r)
Like _f_r_e_a_d, but in the other direction.

_r_e_w_i_n_d(_i_o_p_t_r)
rewinds the stream named by _i_o_p_t_r.  It is  not  very  useful
except  on  input, since a rewound output file is still open
only for output.

_s_y_s_t_e_m(_s_t_r_i_n_g)

_a_t_o_f(_s)

_t_m_p_n_a_m(_s)

_a_b_o_r_t(_c_o_d_e)

_i_n_t_s_s( )

_c_f_r_e_e(_p_t_r)

_w_d_l_e_n_g( )
are  available  with  specifications  identical   to   those
described for the Portable Library.

_c_h_a_r *_c_a_l_l_o_c(_n, _s_i_z_e_o_f(_o_b_j_e_c_t))
returns null when no  space  is  available.   The  space  is
guaranteed to be 0.

_f_t_o_a
is not implemented but there are plausible alternatives.

_n_a_r_g_s( )
is not implemented.

_g_e_t_w(_i_o_p_t_r)
returns the next word from the input stream named by  _i_o_p_t_r.
EOF  is  returned  on end-of-file or error, but since this a
perfectly good integer _f_e_o_f and _f_e_r_r_o_r should be used.

_p_u_t_w(_w, _i_o_p_t_r)
writes the integer _w on the named output stream.

_s_e_t_b_u_f(_i_o_p_t_r, _b_u_f)
_S_e_t_b_u_f may be used after a stream has been opened but before
IO  has  started.  If _b_u_f is null, the stream will be unbuf-
fered.  Otherwise the buffer supplied will be used.  It is a


                     September 28, 1987


                           - 28 -


character array of sufficient size:

        char    buf[BUFSIZ];


_f_i_l_e_n_o(_i_o_p_t_r)
returns the integer  file  descriptor  associated  with  the
file.

     Several additional routines are available.

_f_s_e_e_k(_i_o_p_t_r, _o_f_f_s_e_t, _p_t_r_n_a_m_e)
The location of the next byte in the stream named  by  _i_o_p_t_r
is  adjusted.   _O_f_f_s_e_t  is a long integer.  If _p_t_r_n_a_m_e is 0,
the offset is measured from the beginning of  the  file;  if
_p_t_r_n_a_m_e  is  1, the offset is measured from the current read
or write pointer; if _p_t_r_n_a_m_e is 2, the  offset  is  measured
from the end of the file.  The routine accounts properly for
any buffering.

_l_o_n_g _f_t_e_l_l(_i_o_p)
The byte offset, measured from the beginning  of  the  file,
associated with the named stream is returned.  Any buffering
is properly accounted for.

_g_e_t_p_w(_u_i_d, _b_u_f)
The password file is searched for the given integer user ID.
If an appropriate line is found, it is copied into the char-
acter array _b_u_f, and 0 is returned.  If  no  line  is  found
corresponding to the user ID then 1 is returned.

_s_t_r_c_a_t(_s_1, _s_2)
_S_1 and _s_2 are character pointers.  The end  (null  byte)  of
the  _s_1  string  is  found  and  _s_2 is copied to _s_1 starting
there.  The space pointed to by _s_1 must be large enough.

_s_t_r_c_m_p(_s_1, _s_2)
The character strings _s_1 and _s_2 are compared.  The result is
positive, zero, or negative according as _s_1 is greater than,
equal to, or less than _s_2 in ASCII collating sequence.

_s_t_r_c_p_y(_s_1, _s_2)
The null-terminated character string _s_2  is  copied  to  the
location pointed to by _s_1.

_s_t_r_l_e_n(_s)
The number of bytes in s up to a null byte is  returned.   _S
is a character pointer.

_g_c_v_t(_n_u_m, _n_d_i_g, _b_u_f)
_N_u_m is a floating  or  double  quantity.   _N_d_i_g  significant
digits  are converted to ASCII and placed into the character
array _b_u_f.  The conversion is  in  Fortran  _e  or  _f  style,
whichever yields the shorter string.  Insignificant trailing


                     September 28, 1987


                           - 29 -


zeros are eliminated.
                         C Changes

1.  Long integers

The compiler implements 32-bit integers.  The associated
type keyword is `long'.  The word can act rather like an
adjective in that `long int' means a 32-bit integer and
`long float' means the same as `double.' But plain `long' is
a long integer.  Essentially all operations on longs are
implemented except that assignment-type operators do not
have values, so l1+(l2=+l3) won't work.  Neither will l1 =
l2 = 0.

Long constants are written with a terminating `l' or `L'.
E.g. "123L" or "0177777777L" or "0X56789abcdL".  The latter
is a hex constant, which could also have been short; it is
marked by starting with "0X".  Every fixed decimal constant
larger than 32767 is taken to be long, and so are octal or
hex constants larger than 0177777 (0Xffff, or 0xFFFF if you
like).  A warning is given in such a case since this is
actually an incompatibility with the older compiler.  Where
the constant is just used as an initializer or assigned to
something it doesn't matter.  If it is passed to a subrou-
tine then the routine will not get what it expected.

When a short and a long integer are operands of an arith-
metic operator, the short is converted to long (with sign
extension).  This is true also when a short is assigned to a
long.  When a long is assigned to a short integer it is
truncated at the high order end with no notice of possible
loss of significant digits.  This is true as well when a
long is added to a pointer (which includes its usage as a
subscript).  The conversion rules for expressions involving
doubles and floats mixed with longs are the same as those
for short integers, _m_u_t_a_t_i_s _m_u_t_a_n_d_i_s.

A point to note is that constant expressions involving longs
are not evaluated at compile time, and may not be used where
constants are expected.  Thus

     long x {5000L*5000L};

is illegal;

     long x {5000*5000};

is legal but wrong because the high-order part is lost; but
both

     long x 25000000L;

and


                     September 28, 1987


                           - 30 -


     long x 25.e6;

are correct and have the same meaning because the double
constant is converted to long at compile time.

2.  Unsigned integers

A new fundamental data type with keyword `unsigned,' is
available.  It may be used alone:

        unsigned u;

or as an adjective with `int'

        unsigned int u;

with the same meaning.  There are not yet (or possibly ever)
unsigned longs or chars.  The meaning of an unsigned vari-
able is that of an integer modulo 2^n, where n is 16 on the
PDP-11.  All operators whose operands are unsigned produce
results consistent with this interpretation except division
and remainder where the divisor is larger than 32767; then
the result is incorrect.  The dividend in an unsigned divi-
sion may however have any value (i.e.  up to 65535) with
correct results.  Right shifts of unsigned quantities are
guaranteed to be logical shifts.

When an ordinary integer and an unsigned integer are com-
bined then the ordinary integer is mapped into an integer
mod 2^16 and the result is unsigned.  Thus, for example `u =
-1' results in assigning 65535 to u.  This is mathematically
reasonable, and also happens to involve no run-time over-
head.

When an unsigned integer is assigned to a plain integer, an
(undiagnosed) overflow occurs when the unsigned integer
exceeds 2^15-1.

It is intended that unsigned integers be used in contexts
where previously character pointers were used (artificially
and nonportably) to represent unsigned integers.

3.  Block structure.

A sequence of declarations may now appear at the beginning
of any compound statement in {}.  The variables declared
thereby are local to the compound statement.  Any declara-
tions of the same name existing before the block was entered
are pushed down for the duration of the block.  Just as in
functions, as before, auto variables disappear and lose
their values when the block is left; static variables retain
their values.  Also according to the same rules as for the
declarations previously allowed at the start of functions,
if no storage class is mentioned in a declaration the


                     September 28, 1987


                           - 31 -


default is automatic.

Implementation of inner-block declarations is such that
there is no run-time cost associated with using them.

4.  Initialization (part 1)

This compiler properly handles initialization of structures
so the construction

     struct { char name[8]; char type; float val; } x
          { "abc", 'a', 123.4 };

compiles correctly.  In particular it is recognized that the
string is supposed to fill an 8-character array, the `a'
goes into a character, and that the 123.4 must be rounded
and placed in a single-precision cell.  Structures of
arrays, arrays of structures, and the like all work; a more
formal description of what is done follows.

<initializer> ::= <element>

<element> ::= <expression> | <element> , <element> |
          { <element> } | { <element> , }

An element is an expression or a comma-separated sequence of
elements possibly enclosed in braces.  In a brace-enclosed
sequence, a comma is optional after the last element.  This
very ambiguous definition is parsed as described below.
"Expression" must of course be a constant expression within
the previous meaning of the Act.

An initializer for a non-structured scalar is an element
with exactly one expression in it.

An "aggregate" is a structure or an array.  If the initial-
izer for an aggregate begins with a left brace, then the
succeeding comma-separated sequence of elements initialize
the members of the aggregate.  It is erroneous for the
number of members in the sequence to exceed the number of
elements in the aggregate.  If the sequence has too few
members the aggregate is padded.

If the initializer for an aggregate does not begin with a
left brace, then the members of the aggregate are initial-
ized with successive elements from the succeeding comma-
separated sequence.  If the sequence terminates before the
aggregate is filled the aggregate is padded.

The "top level" initializer is the object which initializes
an external object itself, as opposed to one of its members.
The top level initializer for an aggregate must begin with a
left brace.


                     September 28, 1987


                           - 32 -


If the top-level object being initialized is an array and if
its size is omitted in the declaration, e.g. "int a[]", then
the size is calculated from the number of elements which
initialized it.

Short of complete assimilation of this description, there
are two simple approaches to the initialization of compli-
cated objects.  First, observe that it is always legal to
initialize any object with a comma-separated sequence of
expressions.  The members of every structure and array are
stored in a specified order, so the expressions which ini-
tialize these members may if desired be laid out in a row to
successively, and recursively, initialize the members.

Alternatively, the sequences of expressions which initialize
arrays or structures may uniformly be enclosed in braces.

5.  Initialization (part 2)

Declarations, whether external, at the head of functions, or
in inner blocks may have initializations whose syntax is the
same as previous external declarations with initializations.
The only restrictions are that automatic structures and
arrays may not be initialized (they can't be assigned
either); nor, for the moment at least, may external vari-
ables when declared inside a function.

The declarations and initializations should be thought of as
occurring in lexical order so that forward references in
initializations are unlikely to work.  E.g.,

        { int a a;
          int b c;
          int c 5;
          ...
        }

Here a is initialized by itself (and its value is thus unde-
fined); b is initialized with the old value of c (which is
either undefined or any c declared in an outer block).

6.  Bit fields

A declarator inside a structure may have the form

     <declarator> : <constant>

which specifies that the object declared is stored in a
field the number of bits in which is specified by the con-
stant.  If several such things are stacked up next to each
other then the compiler allocates the fields from right to
left, going to the next word when the new field will not
fit.  The declarator may also have the form


                     September 28, 1987


                           - 33 -


     : <constant>

which allocates an unnamed field to simplify accurate model-
ling of things like hardware formats where there are unused
fields.  Finally,

     : 0

means to force the next field to start on a word boundary.

The types of bit fields can be only "int" or "char".  The
only difference between the two is in the alignment and
length restrictions: no int field can be longer than 16
bits, nor any char longer than 8 bits.  If a char field will
not fit into the current character, then it is moved up to
the next character boundary.

Both int and char fields are taken to be unsigned (non-
negative) integers.

Bit-field variables are not quite full-class citizens.
Although most operators can be applied to them, including
assignment operators, they do not have addresses (i.e. there
are no bit pointers) so the unary & operator cannot be
applied to them.  For essentially this reason there are no
arrays of bit field variables.

There are three twoes in the implementation: addition (=+)
applied to fields can result in an overflow into the next
field; it is not possible to initialize bit fields.

7.  Macro preprocessor

The proprocessor handles `define' statements with formal
arguments.  The line

     #define macro(a1,...,an) ...a1...an...

is recognized by the presence of a left parenthesis follow-
ing the defined name.  When the form

     macro(b1,...,bn)

is recognized in normal C program text, it is replaced by
the definition, with the corresponding _b_i actual argument
string substituted for the corresponding _a_i formal argu-
ments.  Both actual and formal arguments are separated by
commas not included in parentheses; the formal arguments
have the syntax of names.

Macro expansions are no longer surrounded by spaces.  Lines
in which a replacement has taken place are rescanned until
no macros remain.


                     September 28, 1987


                           - 34 -


The preprocessor has a rudimentary conditional facility.  A
line of the form

     #ifdef name

is ignored if `name' is defined to the preprocessor (i.e.
was the subject of a `define' line).  If name is not defined
then all lines through a line of the form

     #endif

are ignored.  A corresponding form is

     #ifndef name
     ...
     #endif

which ignores the intervening lines unless `name' is
defined.  The name `unix' is predefined and replaced by
itself to aid writers of C programs which are expected to be
transported to other machines with C compilers.

In connection with this, there is a new option to the cc
command:

     cc -Dname

which causes `name' to be defined to the preprocessor (and
replaced by itself).  This can be used together with condi-
tional preprocessor statements to select variant versions of
a program at compile time.

The previous two facilities (macros with arguments, condi-
tional compilation) were actually available in the 6th Edi-
tion system, but undocumented.  New in this release of the
cc command is the ability to nest `include' files.  Prepro-
cessor include lines may have the new form

     #include <file>

where the angle brackets replace double quotes.  In this
case, the file name is prepended with a standard prefix,
namely `/usr/include'.  In is intended that commonly-used
include files be placed in this directory; the convention
reduces the dependence on system-specific naming conven-
tions.  The standard prefix can be replaced by the cc com-
mand option `-I':

     cc -Iotherdirectory

8.  Registers

A formal argument may be given the storage class `register.'
When this occurs the save sequence copies it from the place


                     September 28, 1987


                           - 35 -


the caller left it into a fast register; all usual restric-
tions on its use are the same as for ordinary register vari-
ables.

Now any variable inside a function may be declared `regis-
ter;' if the type is unsuitable, or if there are more than
three register declarations, then the compiler makes it
`auto' instead.  The restriction that the & operator may not
be applied to a register remains.

9.  Mode declarations

A declaration of the form

     typedef_______ type-specifier declarator ;_

makes the name given in the declarator into the equivalent
of a keyword specifying the type which the name would have
in an ordinary declaration.  Thus

     typedef int *iptr;

makes `iptr' usable in declarations of pointers to integers;
subsequently the declarations

     iptr ip;
     int *ip;

would mean the same thing.  Type names introduced in this
way obey the same scope rules as ordinary variables.  The
facility is new, experimental, and probably buggy.

10. Restrictions

The compiler is somewhat stickier about some constructions
that used to be accepted.

One difference is that external declarations made inside
functions are remembered to the end of the file, that is
even past the end of the function.  The most frequent prob-
lem that this causes is that implicit declaration of a func-
tion as an integer in one routine, and subsequent explicit
declaration of it as another type, is not allowed.  This
turned out to affect several source programs distributed
with the system.

It is now required that all forward references to labels
inside a function be the subject of a `goto.' This has
turned out to affect mainly people who pass a label to the
routine `setexit.' In fact a routine is supposed to be
passed here, and why a label worked I do not know.

In general this compiler makes it more difficult to use
label variables.  Think of this as a contribution to


                     September 28, 1987


                           - 36 -


structured programming.

The compiler now checks multiple declarations of the same
name more carefully for consistency.  It used to be possible
to declare the same name to be a pointer to different struc-
tures; this is caught.  So too are declarations of the same
array as having different sizes.  The exception is that
array declarations with empty brackets may be used in con-
junction with a declaration with a specified size.  Thus

     int a[];      int a[50];

is acceptable (in either order).

An external array all of whose definitions involve empty
brackets is diagnosed as `undefined' by the loader; it used
to be taken as having 1 element.


                     September 28, 1987