The Web was originally created to serve HTML documents. Now it is used to serve all sorts of documents as well as data of dirrent kinds. Nevertheless, HTML is still the main document type delivered over the Web Go has basic mechanisms for parsing HTML documents, which are covered in this chapter
The Web was originally created to serve HTML documents. Now it is used to serve all sorts of documents as well as data of dirrent kinds. Nevertheless, HTML is still the main document type delivered over the Web
HTML has been through a large number of versions, and HTML 5 is currently under development. There have also been many "vendor" versions of HTML, introducing tags that never made it into standards.
HTML is simple enough to be edited by hand. Consequently, many HTML documents are "ill formed", not following the syntax of the language. HTML parsers generally are not very strict, and will accept many "illegal" documents.
There wasn't much in earlier versions of Go about handling HTML documents -
basically, just a tokenizer. The incomplete nature of the package has led
to its removal for Go 1. It can still be found in the exp
(experimental) package if you really need it. No doubt some improved
form will become available in a later version of Go, and then it will be
added back into this book.
There is limited support for HTML in the XML package, discussed in the next chapter.
There isn't anything to this package at present as it is still under development.
If you like this book, please contribute using Flattr
or donate using PayPal