Tabiya Documentation
HomepageGithub
🇬🇧 English
  • Tabiya Documentation
🇬🇧 English
  • Welcome
  • Overview
    • About Tabiya
    • The Global Youth Employment Challenge
      • The Role of Labor Market Intermediation
      • Digital Platforms and AI in LMIC Labor Market Intermediation
  • Open-Source Tech for Labor Markets
  • Our Tech Stack
    • Inclusive Livelihoods Taxonomy
      • Methodology
      • Why ESCO?
      • Core Taxonomy
      • Open Taxonomy Platform
      • Taxonomy CSV Format
    • Livelihoods Classifier
      • Getting Started
      • Web Application
      • Datasets
      • Training
      • Advanced Topics
      • Contributing Guide
      • FAQs
      • Demo Video
    • Compass
      • Technical Overview
      • UX Evaluation
        • UX Testing Discussion Guide
      • Roadmap
Powered by GitBook
On this page
  • General notes on the fields of the CSV files
  • UUID History
  • Origin Uri
  • ID
  • Object Types
  • Lists
  • Dates
  • File descriptions
  • Model Info
  • Skills
  • Skill Groups
  • Occupations
  • Occupation Groups
  • Columns
  • Skill-to-Skill Relations
  • Occupation-to-Skill Relations
  • Skill Hierarchy
  • Occupation Hierarchy
  • LICENSE
Export as PDF
  1. Our Tech Stack
  2. Inclusive Livelihoods Taxonomy

Taxonomy CSV Format

The Tabiya CSV format is used to import and export data from the Tabiya Open Taxonomy platform.

PreviousOpen Taxonomy PlatformNextLivelihoods Classifier

Last updated 1 month ago

Each taxonomy version is made up of nine CSV files Each file contains a different type of data. The files are:

General notes on the fields of the CSV files

UUID History

A UUIDHISTORY field is a of all the UUIDs that have been assigned to an entity during its lifecycle, e.g. when the entity is created, imported, exported or copied into our platform.

It is an identifier that can be used for tracking objects not only across their lifecycle, but also across systems.

The UUID history is ordered from newest to oldest UUID.

The first entry in the list is the current UUID of the object. The last entry in the list is the very first (initial) UUID of the object.

The entities in this dataset have been assigned an initial UUID. When an entity is imported into our platform, a new UUID will be issued and added at the top of UUID history.

The maximum number of UUIDs in the history for an object is constrained to 10000.

Origin Uri

The maximum length for the Origin Uri is 4096 characters.

ID

The ID field is a unique identifier for each entity in the CSV dataset. It is used for referencing within the CSV dataset, for example, in the relations between entities.

Object Types

The object types are used to differentiate between different types of entities in the dataset.

For example in relations between entities, the object types are used to specify the type of the parent and child objects and determine in which file these objects can be located.

The object types in the CSV files are:

Lists

List properties are stored in the CSV files as strings separated by a character. Currently, we do not support values that contain a new line.

Dates

File descriptions

Model Info

Contains information about the model. The export filename is model_info.csv

Columns

  • NAME: The name of the model.

  • LOCALE: The short code of the model's locale.

  • DESCRIPTION: The description of the model.

  • VERSION: The version of the model.

  • RELEASED: A boolean value that indicates whether the model is released or not.

  • RELEASENOTES: The release notes of the model.

Skills

Contains the skills of the taxonomy. The export filename is skills.csv

Columns

  • SKILLTYPE: The skill type.

    • Possible values: skill/competence,knowledge,language,attitude or empty ( ).

  • REUSELEVEL: The skill reuse level.

    • Possible values: sector-specific,occupation-specific,cross-sector,transversal or empty ( ).

  • PREFERREDLABEL: The preferred label of the skill.

    • Maximum length per label: 256 characters.

    • Maximum number of labels: 100.

  • DESCRIPTION: The skill description.

    • Maximum length:4000 characters.

  • DEFINITION: The skill definition.

    • Maximum length:4000 characters.

  • SCOPENOTE: The skill scope note.

    • Maximum length:4000 characters.

  • ISLOCALIZED: A boolean value that indicates whether the skill is localized or not.

    • Possible values: true or false.

Skill Groups

Contains the skill groups of the taxonomy. The export filename is skill_groups.csv

Columns

  • CODE: SkillGroup code as defined in ESCO. It has the general format SX.X.X, where X is a number.

  • PREFERREDLABEL: The preferred label of the skill group.

    • Maximum length per label: 256 characters.

    • Maximum number of labels: 100.

  • DESCRIPTION: The skill group description.

    • Maximum length:4000 characters.

  • SCOPENOTE: The skill group scope note.

    • Maximum length:4000 characters.

Occupations

Contains the occupations of the taxonomy. The export filename is occupations.csv

Columns

  • OCCUPATIONGROUPCODE:The Occupation group that the occupation belongs to.

  • CODE: An occupation code assigned to the occupation.

    • For ESCO occupations, the code will be the parent code, followed by a . and any number of digits. Eg: XXXX.1234

    • For local occupations, the code will be the parent code, followed by an _ and any number of digits. XXXX_1234

  • PREFERREDLABEL: The preferred label of the occupation.

    • Maximum length per label: 256 characters.

    • Maximum number of labels: 100.

  • DESCRIPTION: The occupation description.

    • Maximum length:4000 characters.

  • DEFINITION: The occupation definition.

    • Maximum length:4000 characters.

  • SCOPENOTE: The occupation scope note.

    • Maximum length:4000 characters.

  • REGULATEDPROFESSIONNOTE: The regulated profession note.

    • Maximum length:4000 characters.

  • OCCUPATIONTYPE: The type of the occupation.

    • Possible values: escooccupation or localoccupation.

  • ISLOCALIZED: A boolean value that indicates whether the occupation is localized or not. Only ocuppations of the type escooccupation can be localized.

    • Possible values: true or false.

Occupation Groups

Contains the Occupation groups of the taxonomy. The export filename is occupation_groups.csv

Columns

  • CODE: A four digit identification code of the Occupation group. Each digit represents a level in the hierarchy.

    • For ISCO groups, the code is a maximum of 4 digits, and each child group should have a code that begins with the parent group code. Eg: 1234

    • For local groups without a parent group, the code should start with an alphabetical character. Eg: A1234

    • For local groups, if the parent occupation group is an isco group, the code should start with the parent group code and then have one alphabetical character. Eg: 1234A

    • For local groups, if the parent occupation group is also a local group, the code should start with the parent group code and then have either an alphabetical character or a number. Eg: 1234AB or 1234A1

  • GROUPTYPE: The type of the Occupation group.

    • Possible values: iscogroup or localgroup.

  • PREFERREDLABEL: The preferred label of the Occupation group.

    • Maximum length per label: 256 characters.

    • Maximum number of labels: 100.

  • DESCRIPTION: The Occupation group description.

    • Maximum length:4000 characters.

Skill-to-Skill Relations

Contains the relations between skills. The export filename is skill_to_skill_relations.csv

Columns

  • RELATIONTYPE: The type of the relation.

    • Possible values: essential or optional.

Occupation-to-Skill Relations

Contains the relations between occupations and skills. The export filename is occupation_to_skill_relations.csv

Columns

  • OCCUPATIONTYPE: The type of the occupation.

    • Possible values: escooccupation or localoccupation.

  • RELATIONTYPE: The type of the relation.

    • Possible values: essential, optional, or it can be left empty.

  • SIGNALLINGVALUELABEL: The signalling value label of the relation.

    • Possible values: low, medium, high, or it can be left empty.

  • SIGNALLINGVALUE: The signalling value of the relation.

    • A number between 0 and 1, or it can be left empty. The only allowed delimiter for decimal numbers is a ..

Caveat: An escooccuption cannot have a signalling value or signalling value label. It must have a relationType. For localoccupations signalling value and relationType are mutually exclusive. A localoccupation can either have a signalling value and signalling value label or it can have a relationType, but not both.

Skill Hierarchy

Contains the hierarchical structure of various skills. The export filename is skill_hierarchy.csv

Columns

  • PARENTOBJECTTYPE: The type of the parent object.

    • Possible values: skill or skillgroup.

  • CHILDOBJECTTYPE: The type of the child object.

    • Possible values: skill or skillgroup.

Caveat: A skill cannot be the parent of a skill group.

Occupation Hierarchy

Contains the hierarchical structure of various occupations. The export filename is occupation_hierarchy.csv

Columns

  • PARENTOBJECTTYPE: The type of the parent object.

    • Possible values: occupationgroup, escooccupation, localoccupation.

  • CHILDOBJECTTYPE: The type of the child object.

    • Possible values: occupationgroup, escooccupation, localoccupation.

Caveat: An escooccupation cannot be the parent of an 'occupationgroup'. Caveat: An localoccupation can be a child of an escooccupation or another localoccupation.

LICENSE

Contains the license information for the model. If one wants to add a license to the dataset, it can be added to a file named LICENSE in the root of the dataset. The LICENSE file supports plain text and Markdown format. During export the license information of the model will also be exported in the LICENSE file.

The UUID used by the platform are based on the standard.

The ORIGINURI field is a that points to the location where an entity was originally defined.

This field is not meant to be used as an identifier outside the scope of the CSV files, for that purpose you should use the first entry in the .

skill: Represents a .

skillgroup: Represents a .

escooccupation: Represents an that originates from the ESCO framework.

localoccupation: Represents an that not originate from the ESCO framework and is defined only this taxonomy.

occupationgroup: Represents an .

The dates in the CSV files are stored in the format.

: A list of .

CREATEDAT: The the model was created.

UPDATEDAT: The the model was last updated.

: A that points to the location where the skill was originally defined.

: A , used for referencing the skill within the CSV dataset.

: A list of .

ALTLABELS: A of alternative labels for the skill.

CREATEDAT: The the skill was created.

UPDATEDAT: The the skill was last updated.\

: A that points to the location where the skill group was originally defined.

: A , used for referencing the skill group within the CSV dataset.

: A list of .

ALTLABELS: A of alternative labels for the skill group.

CREATEDAT: The the skill group was created.

UPDATEDAT: The the skill group was last updated.

: A that points to the location where the occupation was originally defined.

: A , used for referencing the occupation within the CSV dataset.

: A list of .

ALTLABELS: A of alternative labels for the occupation.

CREATEDAT: The the occupation was created.

UPDATEDAT: The the occupation was last updated.

: A that points to the location where the Occupation group was originally defined.

: A , used for referencing the Occupation group within the CSV dataset.

: A list of .

ALTLABELS: A of alternative labels for the Occupation group.

CREATEDAT: The the Occupation group was created.

UPDATEDAT: The the Occupation group was last updated.

REQUIRINGID: The of the skill that requires another skill.

REQUIREDID: The of the skill that is required by another skill.

CREATEDAT: The the relation was created.

UPDATEDAT: The the relation was last updated.

OCCUPATIONID: The of the occupation.

SKILLID: The of the skill.

CREATEDAT: The the relation was created.

UPDATEDAT: The the relation was last updated.

PARENTID: The of the parent object.

CHILDID: The of the child object.

CREATEDAT: The the relation was created.

UPDATEDAT: The the relation was last updated.

PARENTID: The of the parent object.

CHILDID: The of the child object.

CREATEDAT: The the relation was created.

UPDATEDAT: The the relation was last updated.

Universally Unique Identifier v4
URI
ISO 8601
Model Info
Skill Groups
Skills
Skill Hierarchy
Skill to Skill Relations
Occupation Groups
Occupations
Occupation Hierarchy
Occupation to Skill Relations
LICENSE
list
UUID History
skill
skill group
occupation
occupation
Occupation group
UUIDHISTORY
UUIDs
date
date
ORIGINURI
URI
ID
unique identifier
UUIDHISTORY
UUIDs
list
date
date
ORIGINURI
URI
ID
unique identifier
UUIDHISTORY
UUIDs
list
date
date
ORIGINURI
URI
ID
unique identifier
UUIDHISTORY
UUIDs
list
date
date
ORIGINURI
URI
ID
unique identifier
UUIDHISTORY
UUIDs
list
date
date
ID
ID
date
date
ID
ID
date
date
ID
ID
date
date
ID
ID
date
date