Skip to main content

Specification

Last Updated: 2025-01-24

note

This information can be useful for building a dataset for language models.

The IT section uses the following tags and data format:

  • Glossary - an alphabetical list of words relating to a specific subject.
    It is permissible to use multi-line definitions.

    * **Term 1** - Definition
    * **Term 2** - Definition Line 1<Space><Space>
    Definition line 2
    * **Term 3** - Definition
  • FAQ - question-answer data format:

    ## Question

    Answer
  • Links - list of links:

    * [Text](https://example.org/) - description.

    May have categories:

    ## Category

    * [Text](https://example.org/) - description.