Help:Manual Importhtml

Table of Contents

Notes on Importing HTML into Scribus Scribus has an HTML importer which can import clean, well formed HTML and retain much of the layout and and formatting, provided the formatting or styling is basic HTML in the HTML markup, not via css style sheets. CSS support will come in the future. So what kind of HTML is supported and how does it work ? Upon import, the importer will create paragraph styles which correspond to the html markup. Bold, Italics and monospace text and alignment are also supported. Below is a listing of the HTML markup supported - both upper and lower case tags.  body, div, a - Text must be within the &lt;body&gt; tags. p and br - Corresponding to paragraph and line breaks. H1 to H4 - Correspond to Heading Sizes 1 to 4. ol,ul,li - Corresponding to ordered or unordered lists. pre and code - Corresponding to preformatted text and source code listings. These will be converted to text using the fixed pitch font Courier. www. style web links are converted to text with blue coloring to highlight them in the same manner as most web browsers do in their default settings. b, u, i, em, strong,sub.sup,del,u - Text formatting is converted to the corresponding font styles. Note, your default font should have all of these variants available to Scribus.  Hence the following section of html will display and create paragraph styles in Scribus as demonstrated below. &lt;?xml version="1.0" encoding="utf-8"?&gt; &lt;!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"&gt; &lt;html xmlns="http://www.w3.org/1999/xhtml"&gt; &lt;head&gt; &lt;title&gt;&lt;/title&gt; &lt;meta http-equiv="Content-Type" content="text/html; charset=utf-8" /&gt; &lt;/head&gt; &lt;body&gt; &lt;h1&gt;H1 Text&lt;/h1&gt; &lt;h2&gt;H2 Text&lt;/h2&gt; &lt;h3&gt;H3 Text&lt;/h3&gt; &lt;h4&gt;H4 Text&lt;/h4&gt; &lt;ol&gt; &lt;li&gt;Ordered List Item 1 &lt;/li&gt; &lt;li&gt;Ordered List Item 2  &lt;/li&gt;  &lt;/ol&gt;  &lt;ul&gt; &lt;li&gt;Un-Ordered List Item 1  &lt;/li&gt;  &lt;li&gt;Un-Ordered List Item 2  &lt;/li&gt;  &lt;/ul&gt;  &lt;code&gt;code listings&lt;/code&gt;  &lt;p&gt;&lt;b&gt;Bold Paragraph Style&lt;/b&gt;&lt;/p&gt;  &lt;p&gt;&lt;i&gt;Italic Paragraph Style&lt;/i&gt;&lt;/p&gt;  &lt;p align="center"&gt;Centered Text&lt;/p&gt; &lt;/body&gt; &lt;/html&gt;  Below the imported styles from the file above. Below is the imported text displayed on the canvas:  Not all applications export HTML with high fidelity to the W3C specifications. You can use htmltidy to clean up and make conformant HTML text you need to import. See : http://w3c.org