next up previous contents
Next: Graphics formats Up: Common Formats for Printed Previous: Microsoft Word

Other formats and conversions.

HTML is the format used in Web Pages. (Suffices .htm, .html). It is a subset of the SGML format (Suffix .sgml), the Standard Generalized Markup Language, a document standard written to enhance the portability of documents over the Internet. HTML, SGML, and a recent addition to the family, XML (eXtensible Markup Language), are all logical markup languages, that are meant for the logical organization of document information, rather than typesetting. This means that these documents can be generated and parsed by computer programs such as Web-crawling robots and database programs. It also means that each computer user will display them differently, and print them differently. ASCII is the term for a simple text file (often .txt). Unix manual pages use their own format (nroff) and on the Web you can find utilities for converting manual files into other formats. (At http://www.freshmeat.net). Note this: LATEX  and HTML files are both text-based file formats, while PDF, DVI, et cetera are binary formats. This means that they are edited by hand usually using emacs. To convert among all these formats you can use Frame Maker or Applix, but also the utility latex2html (add infoagents).


next up previous contents
Next: Graphics formats Up: Common Formats for Printed Previous: Microsoft Word
Omri Schwarz
1999-09-12