Releases: korpling/graphANNIS
Releases · korpling/graphANNIS
3.6.0 - 2025-01-14
Release Notes
Added
UpdateEvent
now implements PartialEq
to make possible to compare changes.
Fixed
- Deserializing a write-ahead log failed because it was located at the wrong
sub-directory and the deserialization routine for the map had a bug.
graphannis-cli 3.6.0
Download graphannis-cli 3.6.0
graphannis-webservice 3.6.0
Download graphannis-webservice 3.6.0
3.5.1 - 2024-09-25
Release Notes
Fixed
- Fixed out of bounds error parsing legacy meta queries with multiple
alternatives (#308)
graphannis-cli 3.5.1
Download graphannis-cli 3.5.1
graphannis-webservice 3.5.1
Download graphannis-webservice 3.5.1
3.4.0 - 2024-08-20
Release Notes
Added
- Added support for coverage edges between span nodes an segmentation nodes when
calculating the AQL model index.
Fixed
- Do not use recursion to calculate the indirect coverage edges in the model
index, since this could fail for deeply nested structures.
graphannis-cli 3.4.0
Download graphannis-cli 3.4.0
graphannis-webservice 3.4.0
Download graphannis-webservice 3.4.0
3.3.3 - 2024-07-12
Release Notes
Fixed
- Add bug fixes for relANNIS import discovered testing the Annatto relANNIS
importer.
- Fix
FileTooLarge
error when searching for token precedence where the
statistics indicate that this search is impossible.
graphannis-cli 3.3.3
Download graphannis-cli 3.3.3
graphannis-webservice 3.3.3
Download graphannis-webservice 3.3.3
3.3.2 - 2024-07-04
Release Notes
Fixed
- Load existing components from the backup folder instead of the actual location
if a backup folder exists.
graphannis-cli 3.3.2
Download graphannis-cli 3.3.2
graphannis-webservice 3.3.2
Download graphannis-webservice 3.3.2
3.3.1 - 2024-06-04
Release Notes
Fixed
- When optional nodes where located not at the end but somewhere in between the
query, the output of the find
query could include the wrong node ID.
graphannis-cli 3.3.1
Download graphannis-cli 3.3.1
graphannis-webservice 3.3.1
Download graphannis-webservice 3.3.1
3.3.0 - 2024-05-27
Release Notes
Changed
- Use a TOML file instead of a binary file format to store the global
statistics. You might have to re-import existing corpora or use the
re-optimize
command on the command line if the global statistics are
relevant for optimal speed in returning the token of a corpus.
Fixed
- Do not reload graph storages when they are already loaded.
- Do not attempt to unload a corpus that is about to be loaded in the next step.
This could trigger problematic unload/load cycles.
- Fixed issues with
find_connected
, find_connected_inverse
and
is_connected
and excluded ranges (#257)
- Updated lalrpop dependency to 0.20 to fix warnings reported in newer clippy
versions.
- Fixed compiler warnings in newer Rust versions about unused code.
Added
- Added information about the corpus size to the global statistics and corpus
configuration file. The used token/segmentation layer for the corpus size in
the corpus configuration file corpus-config.toml
can be configured manually.
Or theentries are created automatically during import or when the
re-optimize
command is run on the command line. The corpus size is given as
a combination of a unit and the actual quantitiy. The corpus size unit can be
the number of basic token (no outgoing coverage).
[corpus_size]
quantity = 44079
[corpus_size.unit]
name = "tokens"
Or it can describe a specific segmentation layer.
[corpus_size]
quantity = 305056
[corpus_size.unit]
name = "segmentation"
value = "diplomatic"
When the configuration is created automatically, the corpus view configuration
is checked whether it is configured to use a base_text_segmentation
and uses
this segmentation as the corpus size unit. If a corpus size is already
configured, only the quantity will be updated but not the unit.
graphannis-cli 3.3.0
Download graphannis-cli 3.3.0
graphannis-webservice 3.3.0
Download graphannis-webservice 3.3.0
3.2.2 - 2024-04-22
Release Notes
Fixed
- Fix offset and limitation issue when multiple corpora are selected. After a
refactoring, the updated offset was never actually applied when finding the
results in the next corpus. This could lead to too many results on the first
page and also to missing matches on the second and later pages.
graphannis-cli 3.2.2
Download graphannis-cli 3.2.2
graphannis-webservice 3.2.2
Download graphannis-webservice 3.2.2
3.2.1 - 2024-03-25
Release Notes
Fixed
graphannis-cli 3.2.1
Download graphannis-cli 3.2.1
graphannis-webservice 3.2.1
Download graphannis-webservice 3.2.1
3.2.0 - 2024-03-13
Release Notes
Added
- New disk-based graph storage implementation
DiskPathV1_D15
that stores the
outgoing paths from every node when maximum branch-out is 1 and the longest
path has the length 15. This is an optimization especially useful for the
PartOf
component, since it avoids frequent disk access which would be needed
for a adjecency based implementations to get all ancestors. Also PartOf
components are not trees, but still have the property of at most 1 outgoing
edge which can be used to optimize finding all ancestors. Important You
cannot downgrade graphANNIS to an older version if you imported a disk-based
corpus with the new version, since old graphANNIS versions won't be able to
load the new graph storage implementation.
- Add new global statistics that describe the combined graph. Until know, there
were only statistics for each graph component and for the node annotation
storage.
- Improved handling of
tok
queries for corpora with tens of millions token, by
using the newly added graph storage implementation and statistics and
providing an optimized implementation for token search if we already know that
all token are part of the default ordering component. This fixes #276.
- Improve performance for regular expression search when using disk-based
annotation storage and the regex has a prefix. This e.g. fixes getting the
text for a document in ANNIS when the corpus is large.
- Improve performance for regular expressions that can be replaced by an exact
value search, even when the value is escaped. This can be useful e.g. in the
subgraph extraction queries from ANNIS, where some characters are escaped with
\x
and which was previously not treated as constant value search.
- Improve performance for getting all token of a document (e.g. for a subgraph
query) when the PartOf graph storage implementation does not have the same
cost of the inverse graph storage operations by allowing to use a nested loop
join in this particular scenario.
Fixed
- Do not add "annis:doc" labels to sub-corpora when importing relANNIS corpora.
This will fix queries where you just search for documents, e.g. by annis:doc
but also got the sub-corpora as result.
- Re-enable adding the C-API shared library as release artifacts to GitHub.
graphannis-cli 3.2.0
Download graphannis-cli 3.2.0
graphannis-webservice 3.2.0
Download graphannis-webservice 3.2.0