Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bump lingua-language-detector from 1.3.2 to 2.0.1 #107

Closed

Conversation

dependabot[bot]
Copy link
Contributor

@dependabot dependabot bot commented on behalf of github Nov 27, 2023

Bumps lingua-language-detector from 1.3.2 to 2.0.1.

Release notes

Sourced from lingua-language-detector's releases.

Lingua 1.6.1

Bug Fixes

  • The method LanguageDetector.detect_multiple_languages_of returns byte indices. For creating string slices in Python and JavaScript, character indices are needed but were not provided. This resulted in incorrect DetectionResultpemistahl/lingua-py#192

  • Some minor bugs in the WASM module have been fixed to prepare the first release of Lingua for JavaScript.

Lingua 1.6.0

Features

  • Python bindings are now available for the library. These bindings replace the pure Python implementation of Lingua in order to benefit from Rust's performance in any Python software. (#262)

  • Parallel equivalents for all methods in LanguageDetector have been added to give the user the choice of using the library single-threaded or multi-threaded. (#271)

Bug Fixes

  • Several bugs in multiple languages detection have been fixed that caused incomplete results to be returned in several cases.

  • A significant amount of Kazakh texts were incorrectly classified as Mongolian. This has been fixed.

Lingua 1.5.0

Features

  • The new method LanguageDetector.detect_multiple_languages_of() has been introduced. It allows to detect multiple languages in mixed-language text. (#1)

  • The new method LanguageDetectorBuilder.with_low_accuracy_mode() has been introduced. By activating it, detection accuracy for short text is reduced in favor of a smaller memory footprint and faster detection performance. (#119)

  • The new method LanguageDetector.compute_language_confidence() has been introduced. It allows to retrieve the confidence value for one specific language only, given the input text. (#102)

Improvements

  • The computation of the confidence values has been revised and the softmax function is now applied to the values, making them better comparable by behaving more like real probabilities. (#120)

  • The WASM API has been revised. Now it makes use of the same builder pattern as the Rust API. (#122)

  • The language model files are now compressed with the Brotli algorithm which reduces the file size by 15 %, on average. (#189)

  • The language model ngrams are now stored in a CompactString type which reduces the amount of consumed memory by 20 %. (#198)

  • Several performance optimizations have been applied which makes the library nearly twice as fast as the previous version. Big thanks go out to @​serega and @​koute for their help. (#82, #148, #177)

  • The enums IsoCode639_1 and IsoCode639_3 now implement some new traits such as Copy, Hash and Serde's Serialize and Deserialize. The enum Language now implements Copy as well. (#175)

Lingua 1.4.0

Features

  • The library can now be compiled to WebAssembly and be used in any JavaScript project. Big thanks to @​martindisch for bringing this forward. (#14)

Improvements

... (truncated)

Changelog

Sourced from lingua-language-detector's changelog.

Lingua 1.6.1 (released on 23 Nov 2023)

Bug Fixes

  • The method LanguageDetector.detect_multiple_languages_of returns byte indices. For creating string slices in Python and JavaScript, character indices are needed but were not provided. This resulted in incorrect DetectionResults for Python and JavaScript. This has been fixed now by converting the byte indices to character indices.

  • Some minor bugs in the WASM module have been fixed to prepare the first release of Lingua for JavaScript.

Lingua 1.6.0 (released on 14 Nov 2023)

Features

  • Python bindings are now available for the library. These bindings replace the pure Python implementation of Lingua in order to benefit from Rust's performance in any Python software. (#262)

  • Parallel equivalents for all methods in LanguageDetector have been added to give the user the choice of using the library single-threaded or multi-threaded. (#271)

Bug Fixes

  • Several bugs in multiple languages detection have been fixed that caused incomplete results to be returned in several cases.

  • A significant amount of Kazakh texts were incorrectly classified as Mongolian. This has been fixed.

Lingua 1.5.0 (released on 13 Jun 2023)

Features

  • The new method LanguageDetector.detect_multiple_languages_of() has been introduced. It allows to detect multiple languages in mixed-language text. (#1)

  • The new method LanguageDetectorBuilder.with_low_accuracy_mode() has been introduced. By activating it, detection accuracy for short text is reduced in favor of a smaller memory footprint and faster detection performance. (#119)

  • The new method LanguageDetector.compute_language_confidence() has been introduced. It allows to retrieve the confidence value for one specific language only, given the input text. (#102)

Improvements

... (truncated)

Commits

Dependabot compatibility score

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.


Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

  • @dependabot rebase will rebase this PR
  • @dependabot recreate will recreate this PR, overwriting any edits that have been made to it
  • @dependabot merge will merge this PR after your CI passes on it
  • @dependabot squash and merge will squash and merge this PR after your CI passes on it
  • @dependabot cancel merge will cancel a previously requested merge and block automerging
  • @dependabot reopen will reopen this PR if it is closed
  • @dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually
  • @dependabot show <dependency name> ignore conditions will show all of the ignore conditions of the specified dependency
  • @dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)
  • @dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)
  • @dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

Bumps [lingua-language-detector](https://github.com/pemistahl/lingua-rs) from 1.3.2 to 2.0.1.
- [Release notes](https://github.com/pemistahl/lingua-rs/releases)
- [Changelog](https://github.com/pemistahl/lingua-rs/blob/main/RELEASE_NOTES.md)
- [Commits](https://github.com/pemistahl/lingua-rs/commits)

---
updated-dependencies:
- dependency-name: lingua-language-detector
  dependency-type: direct:production
  update-type: version-update:semver-major
...

Signed-off-by: dependabot[bot] <[email protected]>
Copy link
Contributor Author

dependabot bot commented on behalf of github Dec 5, 2023

Looks like lingua-language-detector is up-to-date now, so this is no longer needed.

@dependabot dependabot bot closed this Dec 5, 2023
@dependabot dependabot bot deleted the dependabot/pip/lingua-language-detector-2.0.1 branch December 5, 2023 12:54
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

0 participants