Generated content, elements, DOM and entities

StoneCypher
30 Mar 2007

Several of the requests that have been made in forums which are under investigation for future releases would be implementable if there was a mechanism for injecting actual tags, rather than flat text, through the content: mechanism. Most of these tricks would revolve around @page { @location {} } schemes.

Of recent note, for example, is the user who wanted a mechanism akin to an old version of Prince's watermarking mechanism, in order to apply a logo to each page. I provided him with a relatively ugly hack revolving around the misappropriation of the page background image, which is servicable in his limited context; however, were it the case that a person wanted two such images per page - say, a logo for the company in one corner and for the department in another - the hack would cease to be applicable. (It is worth noting that CSS3 stacked properties would allow the arbitrary extension of this hack, but that is not a particularly desirable answer - though that on its own is a desirable mechanism for other reasons, such as single-tag implementation of the sliding doors pattern.)

Now, here semantic people get into a remarkably bitter debate about whether the ability to inject tags through :content, :before and :after is a legitimate ability for a stylesheet system. I note that you use vendor prefix codes and support xpath, so it seems likely that you are intensely standards focussed. This presents a difficulty in finding an appropriate resolution system. This is exacerbated by all relevant standards' complete commentary abstinance on the issue (no part of the CSS3 specification currently makes any attempt to resolve the issue.)

I have seen people using foreign references and XPath references to pull in DOM structured content as formatting cue. Whereas this provides an appropriate mechanism for the stylesheet to simply denote the mechanism for injecting new content, I believe this is overengineered, and necessitates an unnessecary subdivision of a document, one which for complex documents could easily become inordinately difficult to contain without software support.

I have a compromise which has mollified essentially everyone I've thrown it at, which I would like for you to consider. Whereas it has no support from standards, it also has no condemnation from standards; it relies entirely on the particular interpretation of mechanisms which are well defined and well accepted. It is extremely unlikely that any future codification of CSS would break this mechanism, though it is quite likely that a future revision will either make this mechanism seem archaic and silly, or make it absolutely vital.

The XML standard allows for entities to contain DOM structured elements, and the CSS standard makes zero mention of the interpretation of strings, particularly with respect to entities, within its content fields. Furthermore, I have a hard time imagining a developer objecting to the interpretation of entities within a content field, and browsers are divided on the issue, which gives you good company to argue with (KHTML / Konqueror / Khameleon / Safari, Omniweb and Opera all believe that entities within generated content should be honored; only KHTML/etc's implementation seems to be strong enough to allow XML's entity rules to prevail.)

If you are willing to honor XML's entity rules from within CSS, then the problem becomes a trivial non-issue, wherein the question of generated repeating content becomes one of the stylesheet referring to a prefabricated entity (or two or six - entities may also legally be fractions of tags - you may define one entity as "<tab", the other as "le>", stick them together and expect them to work) which is defined in the DTD, the whole semantic part of this is kept out of the CSS and in the DTD where it belongs, and everybody's happy.

Furthermore, though I'm guessing at the nature of your software backend, it seems highly likely that this would not be particularly difficult to implement. This would require a (very short) pass through the source document(s) to get the DTDs involved, then an early parse of the DTDs, then the expansion application of entities to the content rules in the CSS, before the CSS was applied to the document. Most likely the only difficult part involved will be the reinterpretation of the document required after the injection of a DOM structure.

I remain unaware of a significant downside to this approach, other than the common "only some browsers do it that way" bit with which pretty much anything needs to wrestle. Quite a few header and footer issues, several border placement issues and so on could be reduced to what another language might call macros (yes, I know macro has a different meaning in CSS, I mean what a C programmer would call a macro.)

Indeed, were such a mechanism supported, advanced PrinceXML users would have a reasonable way to provide pre-fabricated layout toolkits for the benefit of newer users.

I look forward to your perspective on the matter.

John Haugeland is http://fullof.bs/

mikeday
31 Mar 2007

Quick correction: Prince does not support XPath, although it does support XPointer on XML Inclusions.

I think that XML entities might be a fairly tricky way of adding content to documents; we would like to explore some method of lightweight transformations or document augmentation however.

DanielPharos
8 Apr 2014

(Sorry for replying to such an old topic)

I'd like to know what (if any) the status on this kind of mechanism is. In my case, I've got an <img>-tag in a chapter title, and I'm using the CSS "content:" with "target-content" for my table of content, but this makes the tag completely disappear in the ToC entry. I can manually work-around it (by adding this entry to the ToC by hand), but of course, some official (even if non-standard) way of doing this would be very nice.

dauwhe
8 Apr 2014

@DanielPharos: The current draft of CSS GCPM has a target-text() function, but it returns the text value of the element identified by the URL fragment, not the structure.

Could you share an example (even just screen shots) of your chapter title page and toc? I've not heard such a use case before.

@StoneCypher: I'm working on proposals to copy or move elements to page margin boxes. See http://dev.w3.org/csswg/css-gcpm/#alternative-syntax-running.

Thanks,

Dave

DanielPharos
8 Apr 2014

Sure! By the way, I'm using PrinceXML 9.0r4 on Mac OS 10.8, in case that matters.

I've attached a screenshot, and here's the relevant pieces of XHTML and CSS:

<!DOCTYPE html    PUBLIC "-//W3C//DTD XHTML 1.1 plus MathML 2.0//EN"
           "http://www.w3.org/Math/DTD/mathml2/xhtml-math11-f.dtd">
<HTML xmlns="http://www.w3.org/1999/xhtml" lang="en" xml:lang="en">
...
<H1>Table of contents</H1>
<div id="navig">
<div><a class="TOC_ChapterLink" href="#chapterIntroduction">?</a></div>
...
<div><a class="TOC_SubSectionLink" href="#subsectionParVarPtjmin">?</a></div>
...
<NAV class="subsection" id="subsecParVarPtjmin"><H3 class="subsection" id="subsectionParVarPtjmin">ptjmin: Minimum <img src="images/Formula/pT.svg" style="vertical-align: middle; margin-bottom: -0.1em; height: 0.7em;"/> for partons</H3>



CSS:

#navig a.TOC_ChapterLink::before
{
	content: target-counter(attr(href), chapter-num) ": ";
}
#navig a
{
	content: target-content(attr(href));
}
#navig a::after
{
	content: leader(" .") target-counter(attr(href), page);
}
H3.subsection::before
{
	content: counter(chapter-num) "." counter(section-num) "." counter(subsection-num) " - ";
}
/* This shouldn't matter, but included for completeness: */
H3.subsection
{
	string-set: subsection-name content();
}

The reason I'm using the image is because the layouting of MathML isn't good enough (yet), so my formulas wouldn't look too well. I've attached the used image too.

PrinceXMLTagContentIssue.png‎ 27.8 kB
pT.svg‎ 5.4 kB

Forum › How do I...?

Generated content, elements, DOM and entities