Marking Data

Data Marking Icon

In STIX, data markings are used to mark specific pieces of the STIX document with some sort of information. In many cases that information is handling instructions, classifications, or terms of use but in reality the data markings structure can be used to mark the data with anything. For example, data markings could be used to indicate that the STIX document is part of an exercise and is not actual production data.

Markings in STIX are are abstract in specification but applied at the field level, meaning that marking statements can be made in locations other than directly on the fields being marked but can apply as atomically as individual fields (elements and attributes in the XML) up to all fields in the document (essentially marking the whole document). Thus a copyright may be applied across the entire document while specific terms of use might apply to certain fields in indicator test mechanisms or courses of action (as an example).

Using Markings

Before talking about how markings are defined and represented it’s useful to understand how and where markings are used. The most common place to see data markings is in the Handling field of the STIX Header. Markings placed in this field are often used to apply markings globally either to the entire package or to specific types of information regardless of where it appears in the Package.

For example, a copyright that applies to the entire STIX package would be best placed in the handling field of the header. Similarly, the indication that all indicator courses of action are TLP:RED would also be best placed in the header.

Here’s a simplified look at a marking placed in the STIX header. The actual marking content is omitted (see below for a discussion on how to create marking statements) in order to focus on where the marking appears.

1
2
3
4
5
6
7
<stix:STIX_Package>
  <stix:STIX_Header>
    <stix:Handling>
      <marking:Marking><!-- Marking content here --></marking:Marking>
    </stix:Handling>
  </stix:STIX_Header>
</stix:STIX_Package>
1
2
3
4
5
6
7
8
9
10
11
from stix.core import STIXPackage, STIXHeader
from stix.data_marking import Marking, MarkingSpecification

handling = Marking()
handling.add_marking(MarkingSpecification())

stix_package = STIXPackage()
stix_package.stix_header = STIXHeader()
stix_package.stix_header.handling = handling

print stix_package.to_xml()

The document header is not the only place data markings can be used, however. Individual STIX components (Indicators, Courses of Action, etc.) all each have their own Handling field. As you might expect, the Handling field within a component restricts the marking applicability to just fields within that component. That allows consumers to safely preserve markings within a component and move it between documents or into a datastore without worrying that the markings will change in meaning. Note, however, that this is not enforced in the data model: it’s up to consumers to enforce this and ensure that markings applied within components do not “break out” of those components.

1
2
3
4
5
6
7
8
9
<stix:STIX_Package>
  <stix:Indicators>
    <stix:Indicator>
      <indicator:Handling>
        <marking:Marking><!-- Marking content here --></marking:Marking>
      </indicator:Handling>
    </stix:Indicator>
  </stix:Indicators>
</stix:STIX_Package>
1
2
3
4
5
6
7
8
9
10
11
12
from stix.core import STIXPackage
from stix.indicator import Indicator
from stix.data_marking import Marking, MarkingSpecification

indicator = Indicator()
indicator.handling = Marking()
indicator.handling.add_marking(MarkingSpecification())

stix_package = STIXPackage()
stix_package.add_indicator(indicator)

print stix_package.to_xml()

Notice that, although the Handling field is placed directly in an Indicator rather than a STIX_Header, the field name is still <marking:Marking>. This is because markings use a common structure regardless of where they are used. That said, it’s important to understand where the marking structure is placed because the controlled structure may use relative document locations in order to mark specific fields (especially when used within components).

Aside: Marking Precedence

The STIX handling structures do allow documents to be marked with multiple markings, both of the same type and of different types. For example, an indicator could be marked both "Copyright Acme, Inc." and TLP:GREEN as well as both "TLP:GREEN" and "TLP:RED".

The behavior when an item is marked twice is undefined when multiple markings are applied (it's up to the consumers and producers to agree on what it means, though in many cases that will be obvious). Marking extensions themselves (TLP, for example) should define what to do when an item has multiple markings of that type applied. TLP says that the higher marking applies (so an item marked GREEN and RED would be considered marked RED) but other extensions might make it invalid to mark a construct twice or might define different rules.

Deciding where to place markings

For obvious reasons (given the scope for field-level markings) the only place to put markings that apply to a complete STIX Package/document are in the Handling field of the STIX header. Generally, copyright statements and terms of use will apply at this level, though there will be exceptions.

Similarly, when applying markings to specific types of information wherever they occur (all indicator titles, all threat actor identities, etc.) it’s usually best to place these markings in the header. That way they can consistently apply to everything in the package no matter which specific constructs they appear in.

On the other hand, it’s usually better to apply markings to specific constructs in the handling structure for the component itself. For example, if only one indicator out of 10 is TLP:RED, it would be best to apply that marking in the indicator’s handling field so it travels with the indicator.

One common use case, at least in TLP, is that most of the document is marked at a lower level (TLP:GREEN) while a few constructs are higher (TLP:AMBER). One strategy for approaching this is to use the global handling field to mark the entire document GREEN and then use more specific markings to override that and mark those constructs higher.

Defining Markings

STIX data markings are implemented via the MarkingSpecificationType, which has two primary fields: the Controlled_Structure field indicates which part of the STIX document is being marked, while the Marking_Structure field contains the marking itself. Besides these two primary fields are an id and idref pair to enable re-using markings, a version to indicate which version the data markings schema is being used, and an Information_Source to indicate who is marking the data and when it was marked.

Let’s dive into each piece of STIX data markings in turn.

Controlled Structure

As explained above, the Controlled_Structure field indicates which part of the document the markings apply to. Currently, this is implemented using XPath 1.0, a language for selecting portions of XML documents. As you would imagine, this means that the data markings structure is currently tied to XML, though notionally this controlled structure field simply indicates which part of the STIX document is being marked and so could be implemented in other document selection languages.

XPath treats elements (which it refers to as nodes) separately from the attributes on those elements. To select both XML elements and attributes, use both an XPath selector for nodes (such as node(), //node(), or descendant-or-self::node()) as well as an XPath selector for attributes (such as @*, //@*, or descendant-or-self::node()/@*). Join these with a | to indicate the union of the node selector and the attribute selector. Example selectors for common STIX scenarios are shown below.

As an example, here’s a small snippet that shows the controlled structure field in a handling construct:

1
2
3
4
5
6
<stix:Handling>
  <marking:Marking>
    <marking:Controlled_Structure>//node() | //@*</marking:Controlled_Structure>
    <!-- Marking statement would go here, indicating the marking statement applies to all nodes selected by the Controlled_Structure -->
  </marking:Marking>
</stix:Handling>
1
2
3
4
5
6
7
8
9
10
11
12
13
14
from stix.core import STIXPackage, STIXHeader
from stix.data_marking import Marking, MarkingSpecification

marking_specification = MarkingSpecification()
marking_specification.controlled_structure = "//node() | //@*"

handling = Marking()
handling.add_marking(marking_specification)

stix_package = STIXPackage()
stix_package.stix_header = STIXHeader()
stix_package.stix_header.handling = handling

print stix_package.to_xml()

Relative Paths

Because XPath allows relative paths, the location of the Controlled_Structure field in the document may be important. All relative paths should be applied from the location of the Controlled_Structure field. That means that an XPath ‘..’ would select the <Marking> parent of the Controlled Structure.

Namespace Prefixes

XPath selectors can make use of namespace prefixes. For example, “stix:Indicator” uses the stix prefix which, in most STIX documents, is defined to the STIX Core namespace. For the purpose of data marking XPaths, prefixes defined at the point of the Controlled_Structure are considered valid for the XPath statement. In most cases, this means that the prefixes you define at the top of the document (in STIX_Package) can be used in the controlled structure. To make your documents easier to use it’s probably best to stick with the common namespace prefixes used in the STIX schemas and examples.

Examples

A couple examples of some controlled structure statements are below:

Location of Marking Controlled Structure Statement Meaning
Header //node() | //@* All nodes in the entire document
Individual Component ../../../descendant-or-self::node() | ../../../descendant-or-self::node()/@* All nodes in the entire enclosing component
Header //stix:Indicator/descendant-or-self::node() | //stix:Indicator::descendant-or-self::node()/@* All instances of indicators in the document
Individual Component ../../../ta:Title/descendant-or-self::node() | ../../../ta:Title/descendant-or-self::node()/@* The Title field of the enclosing Threat Actor component
Note: This table was updated on 2015-02-18 to (correctly) select XML attributes in adddition to elements, to match the intended meanings.

Marking Structures and Default Extensions

Marking structures, as mentioned above, are an extension point in STIX. This means that any marking structure in use by the community can be used within STIX documents by simply defining the appropriate structure as an extension of MarkingStructureType. Note, however, that producers and consumers should agree on which types of extensions are used (perhaps through profiles) in order to make sure that all parties understand and will respect the particular marking structures used.

STIX itself defines three marking structure extensions, however others in the community may define additional structures. For example, the Information Sharing Architecture effort within the Enhanced Shared Situational Awareness (ESSA) community has defined an extension to mark STIX documents with their extensions to the U.S. Government’s Enterprise Data Headers.

Simple Marking

The simple marking extension allows users to make a text statement to mark the content. For example, copyright information can easily be communicated via a simple text statement “Copyright 2014, Acme Inc.”. As an example:

1
2
3
4
5
6
7
8
<stix:Handling>
  <marking:Marking>
    <marking:Controlled_Structure>//node() | //@*</marking:Controlled_Structure>
    <marking:Marking_Structure xsi:type="simpleMarking:SimpleMarkingStructureType">
      <simpleMarking:Statement>Copyright 2014, Acme Inc.</simpleMarking:Statement>
    </marking:Marking_Structure>
  </marking:Marking>
</stix:Handling>
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
from stix.core import STIXPackage, STIXHeader
from stix.data_marking import Marking, MarkingSpecification
from stix.extensions.marking.simple_marking import SimpleMarkingStructure

marking_specification = MarkingSpecification()
marking_specification.controlled_structure = "//node() | //@*"

simple = SimpleMarkingStructure()
simple.statement = "Copyright 2014, Acme Inc."
marking_specification.marking_structures.append(simple)

handling = Marking()
handling.add_marking(marking_specification)

stix_package = STIXPackage()
stix_package.stix_header = STIXHeader()
stix_package.stix_header.handling = handling

print stix_package.to_xml()

Terms of Use

The terms of use marking extension allows users to make a text statement to mark the terms of use of the marked content. For example, terms of use can easily be communicated via a simple text statement “Acme Inc. is not responsible for the content of this file”. In that sense it is very similar to the SimpleMarking extension but has a stronger semantic meaning.

As an example:

1
2
3
4
5
6
7
8
<stix:Handling>
  <marking:Marking>
    <marking:Controlled_Structure>//node() | //@*</marking:Controlled_Structure>
    <marking:Marking_Structure xsi:type="terms:TermsOfUseMarkingStructureType">
      <terms:Terms_Of_Use>Acme Inc. is not responsible for the content of this file</terms:Terms_Of_Use>
    </marking:Marking_Structure>
  </marking:Marking>
</stix:Handling>
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
from stix.core import STIXPackage, STIXHeader
from stix.data_marking import Marking, MarkingSpecification
from stix.extensions.marking.terms_of_use_marking import TermsOfUseMarkingStructure

marking_specification = MarkingSpecification()
marking_specification.controlled_structure = "//node() | //@*"

tou = TermsOfUseMarkingStructure()
tou.terms_of_use = "Acme Inc. is not responsible for the content of this file."
marking_specification.marking_structures.append(tou)

handling = Marking()
handling.add_marking(marking_specification)

stix_package = STIXPackage()
stix_package.stix_header = STIXHeader()
stix_package.stix_header.handling = handling

print stix_package.to_xml()

TLP

Traffic Light Protocol is a mechanism created by US-CERT and used throughout the cyber threat sharing industry to indicate how content may be shared. STIX provides a TLP extension that allows content to be marked with the colors from the TLP matrix.

As an example:

1
2
3
4
5
6
<stix:Handling>
  <marking:Marking>
    <marking:Controlled_Structure>//node() | //@*</marking:Controlled_Structure>
    <marking:Marking_Structure xsi:type="tlp:TLPMarkingStructureType" tlp:color="AMBER" />
  </marking:Marking>
</stix:Handling>
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
from stix.core import STIXPackage, STIXHeader
from stix.data_marking import Marking, MarkingSpecification
from stix.extensions.marking.tlp import TLPMarkingStructure

marking_specification = MarkingSpecification()
marking_specification.controlled_structure = "//node() | //@*"

tlp = TLPMarkingStructure()
tlp.color = "AMBER"
marking_specification.marking_structures.append(tlp)

handling = Marking()
handling.add_marking(marking_specification)

stix_package = STIXPackage()
stix_package.stix_header = STIXHeader()
stix_package.stix_header.handling = handling

print stix_package.to_xml()

Further Reading

  • xsi:type - The STIX xsi:type-based extension mechanism is used to implement marking structures.
  • MarkingType