Next: Graphics formats
Up: Common Formats for Printed
Previous: Microsoft Word
HTML is the format used in Web Pages. (Suffices .htm, .html).
It is a subset of the SGML format (Suffix .sgml),
the Standard Generalized Markup Language, a document standard written to enhance
the portability of documents over the Internet. HTML, SGML, and a recent addition to
the family, XML (eXtensible Markup Language), are all logical markup languages,
that are meant for the logical organization of document information,
rather than typesetting. This means that these documents can be generated and parsed
by computer programs such as Web-crawling robots and database programs. It also means
that each computer user will display them differently, and print them differently.
ASCII is the term for a simple text file (often .txt). Unix manual pages
use their own format (nroff) and on the Web you can find utilities for converting
manual files into other formats. (At http://www.freshmeat.net).
Note this: LATEX and HTML
files are both text-based file formats, while PDF, DVI, et cetera are
binary formats. This means that they are edited by hand usually using emacs.
To convert among all these formats you can use Frame Maker or Applix, but also the
utility latex2html (add infoagents).
Next: Graphics formats
Up: Common Formats for Printed
Previous: Microsoft Word
Omri Schwarz
1999-09-12