Skip to content

Commit

Permalink
Browse files Browse the repository at this point in the history
  • Loading branch information
Trondtr committed Sep 24, 2024
2 parents 964cc78 + 42a1cf5 commit c9e6738
Show file tree
Hide file tree
Showing 677 changed files with 20,713 additions and 32,920 deletions.
16 changes: 4 additions & 12 deletions AboutGiellaLT.md
Original file line number Diff line number Diff line change
Expand Up @@ -6,33 +6,25 @@ It is an open source website providing analysers and tools for
[a wide range of languages](LanguageModels.html), as well as
[a ready-made setup](infra/HowToAddANewLanguage.md) for adding more languages.




# The possibility to make computer tools for your language

## The possibility to make computer tools for your language

Computer tools supported by our infrastructure include:


- linguistic analysers (morphology, syntax)
- spell checkers and grammar checkers
- morphologically enabeled e-dictionaries
- machine translation


# ... by using the following linguistic technology
## ... by using the following linguistic technology

We write our morphologies as [finite state transducers](https://en.wikipedia.org/wiki/Finite_state_transducer)
in the formalisms *lexc*, *twolc* and *xfst rewrite rules*, and compile them into computer programs for language analysis with the compilers [xfst](http://fsmbook.com),
in the formalisms _lexc_, _twolc_ and _xfst rewrite rules_, and compile them into computer programs for language analysis with the compilers [xfst](http://fsmbook.com),
[hfst](http://www.ling.helsinki.fi/kieliteknologia/tutkimus/hfst/) or [foma](https://github.com/mhulden/foma).
Our syntaxes we write in [constraint grammar](https://en.wikipedia.org/wiki/Constraint_grammar),
and we compile our constraint grammars with [vislcg3](http://beta.visl.sdu.dk/cg3.html).
The installation of these compilers is documented on the [Getting Started](infra/GettingStarted.html) page.


# Source code, licensing and cooperation

## Source code, licensing and cooperation

All our resources, infrastructure and linguistic content alike, are available under dual licenses, CC-by-SA and GPL. You may thus take whatever resource you find useful with you and go, as long as you refer to us when you use it.

Expand Down
38 changes: 19 additions & 19 deletions CorpusResources.md
Original file line number Diff line number Diff line change
@@ -1,77 +1,77 @@
# Corpus Resources

![Warning](images/Warning.svg)
__*Under construction.*__
**_Under construction._**

This page contains a dynamically built list of all corpus repositories. Private repositories are not listed.

# Overview
## Overview

{% assign lang_repos = site.github.public_repositories|jsonify %}

<div id="corp_languges" ></div>

# Grouped according to geography
## Grouped according to geography

## Languages of the Nordic countries
### Languages of the Nordic countries

<div id="geo_nordic" class="twocolumn" ></div>

## Languages of Russia
### Languages of Russia

<div id="geo_russia" class="twocolumn" ></div>

## Other European languages
### Other European languages

<div id="geo_europe" class="twocolumn" ></div>

## Languages in North America
### Languages in North America

<div id="geo_northamerica" class="twocolumn" ></div>

## Languages in Africa
### Languages in Africa

<div id="geo_africa" class="twocolumn" ></div>

## Languages in other parts of the world
### Languages in other parts of the world

<div id="geo_other" class="twocolumn" ></div>

## Languages with no geography tag
### Languages with no geography tag

<div id="geo_undef" class="twocolumn" ></div>

# Grouped according to language family
## Grouped according to language family

## Uralic Languages
### Uralic Languages

<div id="fam_uralic" class="twocolumn" ></div>

## Eskimo-Aleut Languages
### Eskimo-Aleut Languages

<div id="fam_eskimo_aleut" class="twocolumn" ></div>

## Algic Languages
### Algic Languages

<div id="fam_algic" class="twocolumn" ></div>

## Indoeuropean languages
### Indoeuropean languages

<div id="fam_indoeuropean" class="twocolumn" ></div>

## Niger-Congo Languages
### Niger-Congo Languages

<div id="fam_nigercongo" class="twocolumn" ></div>

## Turkic Languages
### Turkic Languages

<div id="fam_turkic" class="twocolumn" ></div>

## Languages of other language families, isolates, artificial languages
### Languages of other language families, isolates, artificial languages

<div id="fam_other" class="twocolumn" ></div>

## Languages with no language family tag
### Languages with no language family tag

<div id="fam_undef" class="twocolumn" ></div>

Expand Down
12 changes: 6 additions & 6 deletions DocumentationGuide.md
Original file line number Diff line number Diff line change
Expand Up @@ -3,12 +3,12 @@
The documentation is organised as follows:

- language specific documentation is organised in separate subdomains, links to each language can be found as follows:
- [keyboards and locales](keyboards/KeyboardLayouts.md)
- [morphology, syntax, text processing, proofing tools](LanguageModels.md)
- [speech technology resources](SpeechTechnologyResources.md)
- [keyboards and locales](keyboards/KeyboardLayouts.md)
- [morphology, syntax, text processing, proofing tools](LanguageModels.md)
- [speech technology resources](SpeechTechnologyResources.md)
- general technical & language independent documentation: [this site](/index.md)
- Documentation specific to Divvun, Giellatekno and Tromsø:
- [old site](https://giellalt.uit.no)
- [new site](https://divvungiellatekno.github.io/giellalt.uit.no/) (will be moved to the old site URL when it is fully converted)
- [old site](https://giellalt.uit.no)
- [new site](https://divvungiellatekno.github.io/giellalt.uit.no/) (will be moved to the old site URL when it is fully converted)

Documentation on how to *write* and *publish* documentation [can be found here](infra/docinfra.md).
Documentation on how to _write_ and _publish_ documentation [can be found here](infra/docinfra.md).
13 changes: 7 additions & 6 deletions Games.md
Original file line number Diff line number Diff line change
Expand Up @@ -4,15 +4,16 @@ The languages are grouped according to game.

{% assign games_repos = site.github.public_repositories|jsonify %}

# Word guessing game
## Word guessing game

Simple word guessing game in the tradition of [MasterMind](https://en.wikipedia.org/wiki/Mastermind_(board_game)). For more information on the source code, see [this repo](https://github.com/giellalt/template-wordguess-und).

<div id="wordguess" ></div>
Simple word guessing game in the tradition of [MasterMind](<https://en.wikipedia.org/wiki/Mastermind_(board_game)>). For more information on the source code, see [this repo](https://github.com/giellalt/template-wordguess-und).

```html
<div id="wordguess"></div>

<script src="/assets/js/langtable.js"></script>
<script>
const domWordGames = document.querySelector('#wordguess');
domWordGames.appendChild(addGameTable({{games_repos}}, 'wordguess-', ['game']))
const domWordGames = document.querySelector('#wordguess');
domWordGames.appendChild(addGameTable({{games_repos}}, 'wordguess-', ['game']))
</script>
```
46 changes: 23 additions & 23 deletions KeyboardLayouts.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,93 +2,93 @@

Beware that the documentation pages for most Experimental repos have little or no content, and that documentation for other keyboards probably is out-of-date. Writing documentation is an ongoing effort, and part of the development process. Automatically generated SVG layouts is presently not working.

The languages are grouped in three different ways, according to *maturity, geography* and *language family*. [Private repositories](https://github.com/divvun/private-registry) are not listed.
The languages are grouped in three different ways, according to _maturity, geography_ and _language family_. [Private repositories](https://github.com/divvun/private-registry) are not listed.

# Grouped according to maturity of the keyboards
## Grouped according to maturity of the keyboards

Being in the **Production** group does not necessarily mean it is in production for both mobile and desktop, it can be only one of them. We don't differentiate between the two categories, as soon as a keyboard is released for the general audience for at least one platform, it is in the **Production** category. See the documentation for each keyboard for further details.

{% assign keyb_repos = site.github.public_repositories|jsonify %}

## Production keyboard layouts [![Maturity: Production](https://img.shields.io/badge/Maturity-Production-brightgreen.svg)](/MaturityClassification.html)
### Production keyboard layouts [![Maturity: Production](https://img.shields.io/badge/Maturity-Production-brightgreen.svg)](/MaturityClassification.html)

<div id="prod_keyboards" ></div>

## Beta keyboard layouts [![Maturity: Beta](https://img.shields.io/badge/Maturity-Beta-yellow.svg)](/MaturityClassification.html)
### Beta keyboard layouts [![Maturity: Beta](https://img.shields.io/badge/Maturity-Beta-yellow.svg)](/MaturityClassification.html)

<div id="beta_keyboards" ></div>

## Alpha keyboard layouts [![Maturity: Alpha](https://img.shields.io/badge/Maturity-Alpha-red.svg)](/MaturityClassification.html)
### Alpha keyboard layouts [![Maturity: Alpha](https://img.shields.io/badge/Maturity-Alpha-red.svg)](/MaturityClassification.html)

<div id="alpha_keyboards" ></div>

## Experimental keyboard layouts [![Maturity: Experiment](https://img.shields.io/badge/Maturity-Experiment-black.svg)](/MaturityClassification.html)
### Experimental keyboard layouts [![Maturity: Experiment](https://img.shields.io/badge/Maturity-Experiment-black.svg)](/MaturityClassification.html)

Initial experiments and student exercises.

<div id="exper_keyboards" ></div>

## Keyboard layouts of undefined maturity [![Maturity: Undefined](https://img.shields.io/badge/Maturity-Undefined-lightgrey.svg)](/MaturityClassification.html)
### Keyboard layouts of undefined maturity [![Maturity: Undefined](https://img.shields.io/badge/Maturity-Undefined-lightgrey.svg)](/MaturityClassification.html)

<div id="undef_keyboards" class="twocolumn" ></div>

# Grouped according to geography
## Grouped according to geography

## Languages of the Nordic countries
### Languages of the Nordic countries

<div id="geo_nordic" class="twocolumn" ></div>

## Languages of Russia
### Languages of Russia

<div id="geo_russia" class="twocolumn" ></div>

## Other European languages
### Other European languages

<div id="geo_europe" class="twocolumn" ></div>

## Languages in North America
### Languages in North America

<div id="geo_northamerica" class="twocolumn" ></div>

## Languages in Africa
### Languages in Africa

<div id="geo_africa" class="twocolumn" ></div>

## Languages in other parts of the world
### Languages in other parts of the world

<div id="geo_other" class="twocolumn" ></div>

## Languages without geography tag
### Languages without geography tag

<div id="geo_undef" class="twocolumn" ></div>

# Grouped according to language family
## Grouped according to language family

## Uralic Languages
### Uralic Languages

<div id="fam_uralic" class="twocolumn" ></div>

## Eskimo-Aleut Languages
### Eskimo-Aleut Languages

<div id="fam_eskimo_aleut" class="twocolumn" ></div>

## Algic Languages
### Algic Languages

<div id="fam_algic" class="twocolumn" ></div>

## Indoeuropean languages
### Indoeuropean languages

<div id="fam_indoeuropean" class="twocolumn" ></div>

## Niger-Congo Languages
### Niger-Congo Languages

<div id="fam_niger_congo" class="twocolumn" ></div>

## Languages of other language families, isolates, artificial languages
### Languages of other language families, isolates, artificial languages

<div id="fam_other" class="twocolumn" ></div>

## Languages with no language family tag
### Languages with no language family tag

<div id="fam_undef" class="twocolumn" ></div>

Expand Down
Loading

0 comments on commit c9e6738

Please sign in to comment.