blackfoundrycom / variable-components-spec Goto Github PK

View Code? Open in Web Editor NEW

9.0 9.0 0.0 50 KB

variable-components-spec's Introduction

Variable Components

A proposal for an add-on to OpenType 1.8 by Black[Foundry]

Superceded by github.com/harfbuzz/boring-expansion-spec in 2023

Authors:

Just van Rossum (@justvanrossum)
Jérémie Hornus (@JeremieHornus)

Terminology
What are Variable Components?
- Use cases
Proposal overview
- Extend the OpenType format
Format overview
- The Component data structure
  - Transformation
  - Designspace location
How to process VarC data?
Format details
- Notes on precision
- Field formats
New axis flag for the vartable
Notes on non-linear interpolation
Comparison to the COLRv1 proposal
Prototype Implementation
Prior Art

Terminology

designspace: equivalent to the term “variation space”
designspace location: a specific location in a designspace
composite: a glyph that is composed of references to other glyphs
component: a reference to another glyph
base glyph: the glyph referenced by a component

What are Variable Components?

TrueType has had the capability to form composite glyphs since its inception. A composite glyph references other glyphs as “components”. This is a simple way to save data storage, for glyphs that can be composed of other glyphs, such as diacritics. Components can be arbitrarily positioned in the composite glyph, and can be scaled and rotated and skewed if needed.

“Variable Components”, as described in this document, add further parameters to customize the appearance of components in the composite glyph.

With OpenType 1.8, “variations” were added to the format, allowing for live interpolation between (say) regular and bold. These variations are global to the font: they are controlled by the user for the font as a whole. Such a font is no longer static, and the end user can navigate a designspace along dimensions (axes) defined by the font. The chosen variation is defined by the designspace location, as a set of coordinates in the designspace.

“Variable Components” add the possibility to place a variable glyph in a composite glyph, specifying the interpolation settings (the designspace location) for that single occurrence. A single glyph may define its own design space, for composites to use as they see fit.

Use cases

Some design tools already implement a methodology like Variable Components (for example “Smart Components” in Glyphs.app). Users find it often a more efficient way to design certain glyphs. Upon export as TTF, such components have to be converted to traditional outlines.

However, a lot of space could be saved if the Variable Component information would be stored in the font, instead of traditional outlines: a component reference will typically take up less space than a discrete outline.

This is especially true for CJK fonts, which tend to contain very large numbers of glyphs, many of which can be composed of variations of more basic glyphs.

More generally, fonts often contain glyphs that can be seen as variations of regular glyphs, for example superior and inferior numerals and small caps. It can be beneficial to represent these variations as local instances using Variable Components.

Proposal overview

The proposal presented here was informed by the following priorities:

Only add to the OpenType 1.8 format, if possible
Avoid changing existing OpenType 1.8 tables as much as possible
Use existing data structures whenever possible
Build on TrueType-flavored data
Use the existing mechanisms available in OpenType 1.8 as much as possible:
- Use glyf table composites/components
- Use gvar for variations
Make it easy for existing implementations to adapt
Store the Variable Component data in a compact form, to maximize space saving

The first thing we need to pin down is how to do “local designspaces”. How does a glyph define its own designspace, to be used by composites?

Here are some of the insights that led to the current design, in order:

A component has full control over the designspace location of the component’s base glyph. It can even override a global axis value.
The global designspace location can affect the composite glyph, but to achieve the design goal of this propasal, it does not need to affect the designspace location of the base glyph directly. (This is still open for discussion, see #1)
A “base glyph” is just a regular glyf-based glyph, using gvar for variations, but it needs to be able to use axes that are not user-controllable.
We can use fvar axes, but we need to be able to flag an axis as “This axis is for variable component use only, do not expose it to the user, ever, at all”. This is perhaps more strict than the definition of the existing “hidden” axis flag, and we need to establish whether a new axis flag may be needed.
The total number of axes that can be used by a font (as specified in the fvar table) does not have an unreasonable limit per se (65536), but it is not without cost: in some places – for example in gvar variation tuples or VarStore regions – there is a value specified for every single axis in the font, even if that axis does not participate in a certain variation. This is especially relevant for gvar, as there can be many tuple variations (many glyphs × many variations per glyph), so adding even a single axis to a font can have a significant impact on the file size. So: let’s not use more axes than strictly necessary.
A Variable Component axis is not exposed to the user, and there is no need for “user coordinates”: the composite will only ever use “normalized coordinates” to specify a designspace location. Also: we don’t consider avar-like functionality to be necessary here.
A Variable Component axis is internally always referenced by its axis index. The “axis tag” is completely irrelevant. (Axis tags are only used for user interaction, and are not referenced anywhere in the font outside of the fvar table.)
The previous points lead to the conclusion that a single axis can be (re)used for different purposes by different base glyphs. The axis identity is no longer attached to a function that is meaningful for the end user, or any specific meaning at all. For example, base glyph X may use axis #2 to implement “stretching”, but base glyph Y may use the same axis #2 to implement “bending”. The meaning of an axis is completely local to the base glyph. Each component specifies the local designspace location for its base glyph.
Concluding, the local designspace for a base glyph does not need any more information than what we already have: it is completely defined by its variations’ locations in the gvar table.

Extending the OpenType format

To make Variable Components work, the only thing that is missing from OpenType 1.8 is the capability to store some additional information for each component of composite glyphs. The core of this proposal is to add a single new table, called VarC, that will provide a space for all new information.

A Variable Component reference needs the following information:

The base glyph ID. This specifies which glyph we are referencing.
A transformation (offset, scale, rotation, etc.)
A designspace location
Variations for the transformation and the designspace location, so the composite itself can become a variable glyph (whether as a “normal” glyph, or referenced as a Variable Component by another glyph)

We use the composites/components mechanism from the glyf table, so some of these values are already taken care of: the base glyph ID and the offset.

Components in the glyf table can optionally specify a scale value, or x/y scale values, or a 2×2 transformation matrix, but we chose not to use these for several reasons:

Scale factors (and matrix values) are Fixed2Dot14, meaning they are limited to the range -2.0..+2.0, which is a problem for some use cases.
gvar only supports interpolation of the component offset values, not of the scale values or the matrix.
To interpolate 2×2 transformation matrices in a useful way is non-obvious and non-trivial, even when decomposing into scale, rotations and skew values.

Summarizing:

For the base glyph ID, the component offset and its variations, we rely on glyf + gvar. Additional transformation values (scale, rotation, etc.) and its variations will be stored in VarC
The component designspace location and its variations will also be stored in VarC

Base glyphs are totally ordinary glyf + gvar glyphs, but can also be composites themselves, using Variable Components, so we fully embrace the recursive nature of TrueType components.

Format overview

High level structure of the VarC table:

name	description
Version	version field, initially `0x00010000`
numGlyphs	the number of glyphs
GlyphData[numGlyphs]	array of glyph data
VarStore	variation data

GlyphData is an array of offsets to Glyph subtables, indexed by GlyphID.

numGlyphs must be less than or equal to the numGlyphs field in the maxp table.

A Glyph subtable is an array of variable length Component data.

The VarC table depends on the glyf table: to parse a VarC Glyph, one needs to know the number of components from the glyf table. That number is not duplicated in VarC.

The Component data structure

The Component data structure stores the additional transformation fields, the designspace location for the components, and indices into the VarStore for each value that needs variations.

Transformation

The transformation data consists of individual optional fields, which can be used to construct a transformation matrix.

Transformation fields:

name	default value
Rotation	0
ScaleX	1
ScaleY	1
SkewX	0
SkewY	0
TCenterX	0
TCenterY	0

The TCenterX and TCenterY values represent the “center of transformation”. This is separate from the component offset as stored in the glyf table.

Details of how to build a transformation matrix, as pseudo-Python/fontTools code, where (X, Y) is the component offset from the glyf table:

# Using fontTools.misc.transform.Transform
t = Transform()  # Identity
t = t.translate(X + TCenterX, Y + TCenterY)
t = t.rotate(Rotation)
t = t.scale(ScaleX, ScaleY)
t = t.skew(SkewX, SkewY)
t = t.translate(-TCenterX, -TCenterX)

The transformation fields are stored as individual fields, and are interpolated as individual fields. If the client needs a transformation matrix, then this matrix needs to be constructed after interpolation.

Rationale for using a transformation center, using Rotation as an example:

Rotation by default happens around the origin of the component
For some cases this may be good enough, as the base glyph can be designed this way
However, in other cases components may need to determine the rotation center locally, depending on how the component is used. Imagine a base glyph that represents a horizontal bar. In one glyph, this bar should be rotated using the left side as the center, and in another, it should be rotated using the right side as the center.
This all wouldn’t make a difference if it wasn’t for interpolation: it’s really about how the component moves when transitioning from one composite master to another. (This should be illustrated visually)

Designspace location

The designspace location for components is stored as an array of axis indices and a matching array of axis values.

The VarStore subtable is used to store variation deltas. It uses 16 bit integer values, but we use these for various flavors of 16 bit fixed values, too.

How to process VarC data?

When preparing a glyph outline for the rasterizer, the following logic needs to be applied:

Inputs:

glyph ID
designspace location

Output:

outline ready to be sent to the rasterizer

Steps:

If the glyph is a composite and has an entry in the VarC table:
- for each component:
  - Using the input designspace location, interpolate the transformation fields and the component’s designspace location
  - Retrieve the outline using this algorithm recursively, but using the component’s designspace location and glyph ID as inputs instead.
  - Transform the outline according to the transformation
Else:
- Proceed as usual, but apply the entire algorithm recursively, allowing for nested Variable Components

Or in pseudo code:

def getGlyphOutline(gid, location):
    if gid is a composite:
        for each component:
            if gid in VarC:
                compoTransform = instantiateTransform(location)
                compoLocation = instantiateLocalLocation(location)
            else:
                offset = instantiateOffset(location)
                compoTransform = getComponentTransform(component, offset)
                compoLocation = location  # global
            outline = getGlyphOutline(compoGID, compoLocation)
            outline = transformOutline(outline, compoTransform)
    else:
        outline = instantiateGlyfGvarGlyph(gid, location)
    return outline

To clarify: Variable Components completely determine the designspace location for the base glyph. Any axis not specified by a Variable Component has to be interpreted as set to its default, regardless of the global designspace location. In other words, Variable Components do not implicitly pass the global designspace location down to the base glyphs. (It can’t pass down local designspace coordinates, as local designspace may reuse axis IDs for different purposes. Axis X may do something completely different for glyph A than for glyph B.

This may be opened for discussion: it can be useful to pass down the global designspace coordinates down to base glyphs (unless overridden), but then we need to distinguish between global fvar axes and local (anonymous) fvar axes, due to the reusable nature of local axes in this design. To allow this, we need a new fvar axis flag in addition to the “hidden” flag. Please discuss here: #1

Local designspace coordinates need to be clamped, but it’s not clear yet how: #3

Format details

type	name	value
Version	Version	0x00010000
uint16	numGlyphs
LOffset	GlyphData[numGlyphs]
LOffset	VarStore

GlyphData: this is an array of offsets to glyph data subtables. It is indexed by glyph ID. If an offset is zero, then there is no Glyph data for this glyph. The numGlyphs field less than or equal to the total number of glyphs in the font.

VarStore: existing data structure to store all variation data, as used by GDEF, HVAR, VVAR, MVAR, etc.

Glyph: the data for a single glyph contains the component data for all components. The number of components is derived from the glyf table.

Component:

type	name	notes
uint16	flags	see below
uint8 or uint16	numAxes	This is a uint16 if bit 3 of `flags` is set, else a uint8
uint8 or uint16	axisIndices[numAxes]	This is a uint16 if bit 3 of `flags` is set, else a uint8 The most significant bit of each axisIndex tells whether this axis has a VarIdx in the VarIdxs array below. Bits 0..6 (uint8) or 0..14 (uint16) form the axis index.
Coord16	axisValues[numAxes]	The axis value for each axis
Angle16	Rotation	Optional, only present if it 5 of `flags` is set
Scale16	ScaleX	Optional, only present if it 6 of `flags` is set
Scale16	ScaleY	Optional, only present if it 7 of `flags` is set
Angle16	SkewX	Optional, only present if it 8 of `flags` is set
Angle16	SkewY	Optional, only present if it 9 of `flags` is set
Int16	TCenterX	Optional, only present if it 10 of `flags` is set
int16	TCenterY	Optional, only present if it 11 of `flags` is set
uint8	entryFormat	See below
VarIdx	VarIdxs[varIdxCount]	see below

Each VarIdx value represents an index into the VarStore, which contains all variation data.
varIdxCount is determined by the sum of:
- The number of axes that have a VarIdx
- The number of transformation fields, if bit 4 of flags is set
VarIdx entries are 1, 2, 3 or 4 bytes long. This is determined by the entryFormat field, see below.

Component flags:

bit number	meaning
0..2	Number of integer bits for ScaleX and ScaleY, mask: 0x07
3	axis indices are shorts (clear = bytes, set = shorts)
4	Transformation fields have VarIdx
5	have Rotation
6	have ScaleX
7	have ScaleY
8	have SkewX
9	have SkewY
10	have TCenterX
11	have TCenterY
12	If ScaleY is missing: take value from ScaleX (to be discussed here: #2)
13	(reserved, set to 0)
14	(reserved, set to 0)
15	(reserved, set to 0)

Notes on precision

We chose to store all relevant fields as 16-bit values for maximum compactness, and compatibility with the VarStore format. The downside of this is that we need to choose the range of the fields carefully, as the range of delta values may exceed the range of master values by a factor related to the number of axes involved.

In one case (component scale factors) we chose to use a three-bit field to specify the number of integer bits to be used for scale factors. This gives us a flexible range that can be made to fit the required range.

In other cases (Angle16 and Coord16) we simply chose a larger range than required for the master values, so there is some wiggle room for delta values that are outside of the master value range.

Delta values are stored in the VarStore subtable, and they use the same formats as their corresponding master values.

We observe that https://github.com/googlefonts/colr-gradients-spec/ adds 32-bit value support to the VarStore format. VarC table could benefit from that as well, at the expense of compactness.

Field formats

Angle16: this is an int16 value used to represent an angle. To scale an angle in degrees to this format, multiply the angle by 0x8000 / (4 * 360). This gives us an effective range of -1440 degrees to +1440 degrees. Master values are expected to be between -360 and +360 degrees. The extra headroom is to allow for delta values that are outside of the master range.

Scale16: this is an int16 used as a 16 bit Fixed number, where the number of integer bits is specified by bits 0..2 of the flags field. This allows us to use 16 bits precision for a flexible range of scale values, depending on what the component needs. It avoids having a small maximum (as with Fixed2Dot14, which goes from -2 to +2) while sticking to 16 bits precision. The number of precision bits is 16 - number-of-integer-bits.

Coord16: this is in int16 value used to represent a coordinate in a designspace location. This is defined as a Fixed4Dot12. Master values are expected to be between -1.0 and +1.0, but delta values may be outside that range.

VarIdx array: this is a compactly stored array with VarIdx values, which reference items in the VarStore. A VarIdx value is normally 32 bit, using 16 bits for the outer index and 16 bits for the inner index. The array items are 1, 2, 3 or 4 bytes long, and are formatted as specified by the entryFormat field. This is identical to the entryFormat field of the DeltaSetIndexMap subtable from the HVAR table.

New axis flag for the fvar table

The axes used to implement local designspaces for components should never be exposed to users, and should be marked as such with a new fvar axis flag:

Mask	Name	Description
0x0002	INTERNAL_AXIS	The axis is only used internally, and should not be exposed in user interfaces. Used to implement local designspaces for Variable Components.

This is a backwards-compatible change, and therefore the fvar table version does not need to be updated.

Notes on non-linear interpolation

Because the (local) axes for VariableComponents are controlled by global (user) axes, this proposal contains the possibility to do non-linear interpolation, without the need for duplicate fvar axis tags (*). However, it is currently limited to variable components. It is possible to change the proposal in a small way that could apply the same control over designspace location and transformation on the outlines of the glyph, if it is not a composite.

*) By giving multiple fvar axes the same axis tags, many implementations allow multiple axes to be controlled from a single value.

Comparison to the COLRv1 proposal

There is a proposal which adds a much enhanced version of the COLR table: https://github.com/googlefonts/colr-gradients-spec/

There happens to be some overlap between the Variable Components proposal and the COLRv1 proposal.

A COLR glyph can be seen as a composite glyph, but with paint properties associated with each component. COLRv1 enhances this idea by adding transformations to the “components”, in a way that is conceptually very similar to the Variable Components proposal, but is different in every detail.

From a bird’s eye perspective, both proposals implement “doing components better”, but for different use cases.

We currently see two ways of addressing this:

Attempt to bring the data structures for component transformations together, so they can be shared in the spec.
Merge the two proposals completely, by adding “local designspaces” to COLRv1, and promote the idea that COLRv1 is about “better components” first, and about color second.

COLRv1 effectively adds a new glyph type, one that overrides glyf and CFF/CFF2 (but uses those as outline sources), whereas the Variable Components proposal builds on the composite/component structure of the glyf table (and is not compatible with CFF/CFF2).

More research and discussion is needed:

Prototype Implementation

There is some prototype-level code that implements readers and writers for the VarC table as part of this repository: https://github.com/BlackFoundryCom/rcjk-tools

It relates to Black[Foundry]’s Robo-CJK RoboFont extension, which heavily uses variable components: https://github.com/BlackFoundryCom/robo-cjk

Prior Art

In March 2020 we published the RoboCJK Deep Components Proposal.

In September 2018, Microsoft published an XVAR Proposal.

variable-components-spec's People

Contributors

Stargazers

Watchers

variable-components-spec's Issues

Clarify: The global designspace location can affect the composite glyph...

The global designspace location can affect the composite glyph, but it does not need to affect the designspace location of the base glyph directly.

Is this saying that the global designspace (GDS) location can affect the composite designspace location of components?

The OT 'glyf' chapter doesn't use the term "base glyph". Do you mean the first component in the composite description? (I'm guessing No.) Or is this saying that affects from the GDS location on a component do not affect the underlying glyph outline of the component (i.e., doesn't entail any effects on other uses of that glyph outline as a component)? Or, is this saying that the GDS location can affect the underlying outline of a component and that those effects will be seen in the composite?

In saying "doesn't need", it's not clear to me if you're saying, "In some cases, X might not happen" or rather "To achieve the design goal of the proposal, it is not a requirement that..."

Clarify: The base glyph ID. This specifies which glyph we are referencing.

Here and elsewhere you use "base glyph", which is not a term used in the OT 'glyf' chapter when discussing composites. It would help if you clarified. In this particular instance, it's not clear if what is meant is the glyph ID of the composite being assembled or of a component used in the composite.

Conflict with COLRv1 transforms… merge the proposals?

Nice to see this proposal.

I wanted to check you are aware of another OpenType format proposal relating to components, namely my own colr-gradients-spec/Rotated components suggestion to add component transforms to the COLRv1 format. The idea behind it is that COLRv0 layers are already rather like components, and that, by allowing transforms in each included glyph layer, we can make COLRv1 a superset of the capabilities of old-style "glyf" composites. While making the colour Muybridge horse, I realized that there was a common case where the fallback b/w glyph references exactly the same glyph components as the COLR glyph layers, and so it seemed sensible to give COLR the same capabilities for transformation as glyf has, rather than force this case to perform all transforms in glyf to new transformed components, then layer those transformed composite glyphs in COLR. For good measure, I proposed flags to handle explicit rotation in degrees and @jfkthame added shear in degrees too. I imagined in the future controlling this rotation angle with an axis. Also, one day, I imagined this improved transform structure being incorporated into an updated glyf spec, to avoid people using the COLR apparatus unnecessarily — @behdad believes clients should be free to ignore COLR/CPAL tables entirely if they have no use for colour.

So that’s the background to COLRv1 transforms. The proposal seems to have met with approval from @PeterConstable and @rsheeter, among others, and is well on the way to being proposed as an update to OpenType.

You can probably already see the issue I am worrying about, namely that, with your variable component idea and good old glyf, it makes THREE different ways to represent composite glyphs. I don’t think this is a good outcome. So I wonder what your thoughts are. I guess we can think of:

merging your component variations with the COLRv1 transform to use the same structure in both
removing transforms from COLRv1 for now, while we come up with a unified transform structure

In any case I would be pleased if you would take a look through the COLRv1 transform discussion, and add some comments. It would be a shame to do things differently without a clear reason.

How to specify uniform x, y scaling

In component transformations, we define ScaleX and ScaleY. Ideally only one should value be required when x and y scale are the same.

There are three options (the default scale factor is 1.0):

Each value needs to be specified if it's different from the default. So when x == y and x != default, both need to be specified (this is the current situation, as implented by rcjktools).
If ScaleY is missing, it should default to whatever ScaleX is. Downside: need two values if x != default and y == default
If ScaleY is missing, it should default to whatever ScaleX is if a certain flag has been set, else default to 1.0.

Option 3 is most compact, but perhaps it is trying too hard to solve a relatively rare case. It may not be worth the added complexity.

I'm leaning towards option 2.

Clarify: The composite is in full control of the designspace location of the component

I gather "the designspace" in this case is that of the composite, not the global designspace?

VarIdx's should be controlled by a flag

Not every font using smart components will be variable... I think a flag should control whether VarIdx values are present.

This also means the font should work without fvar.

Can an internal axis have a normalized range between -1.0 and +1.0, or are we tied to 0.0 and 1.0?

How does our design of reusable axes affect the normalized range of an axis, and does it matter?

Normalized axis values are between -1.0 and +1.0, but can be limited based on the fvar user values:

if min < default < max, then the range is -1.0 to +1.0
if min == default and default < max, then the range is 0.0 to +1.0
if min < default and default == max, then the range is -1.0 to 0.0

At the VarC level, we don't deal with user coordinates anymore, but we may need to specify how to clamp interpolated designspace coordinates.

Should the global designspace location be implicitly passed down to components?

The current design and implementation (at https://github.com/BlackFoundryCom/rcjk-tools/) do not do this.

If a component base glyph implements variations for a global axis (say wght), the current design requires to set the value wght explicitly from the component reference.

If the wght value does not need to be changed by the component reference, this will require some redundant data.

It appears that the model GlyphsApp uses for Smart Components would benefit from implicitly passing down global axis values. I don't think Smart Component references can influence/overwrite global axis values.

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.