Skip to content

Commit

Permalink
Merge branch 'versions/0-6-0'
Browse files Browse the repository at this point in the history
  • Loading branch information
brenes committed Oct 21, 2021
2 parents a9accbb + cc8ad1e commit 9908cc3
Show file tree
Hide file tree
Showing 4 changed files with 66 additions and 20 deletions.
19 changes: 19 additions & 0 deletions CHANGELOG
Original file line number Diff line number Diff line change
@@ -1,3 +1,22 @@
0.6.0:
* Added support for: [@bettysteger] [#16]
* Afrikaans (af)
* Arabic (ar)
* Bengali (bn)
* Breton (br)
* Catalán (ca)
* Czesch (cs)
* Hebrew (he)
* Indonesian (id)
* Korean (ko)
* Thai (th)
* Turkish (tr)
* Vietnamese (vi)
* Added keywords for English and German [@bettysteger] [#16]
* Fixed upcase/downcase behaviour [@bettysteger] [#16]
* Added gender neutral versions of some words in spanish [@fauno] [#17]
* Bumped rdoc version [@dependabot] [#18]
* Fixed Finnish locale code from `fn` to `fi`
0.5.0:
* Added Greek stopwords based on Lucene [@vrypan] [#13]
* Fixed CSV format for sv and ru locales [@woto] [#14]
Expand Down
39 changes: 39 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -107,6 +107,8 @@ That's all?

I know what you're thinking, it takes a line of ruby code to filter one array from other. That's why we have added an extra functionality, [Snowball][wikipedia_snowball] stopwords lists, already built for you and ready to use.

At least, in the beginning we were using snowball stopwords, but several collaborators have improved this humble gem by including new languages or adding new stopwords. So now, the Snowball version is more an "Snowball and friends" version.

How do I use that snowball thing?
---------------------------------

Expand All @@ -122,6 +124,41 @@ And then you filter without worrying about the exact stopwords used
filter.filter 'guide by douglas adams'.split #-> ['guide', 'douglas', 'adams']
```

Which languages are supported with snowball?
-------------------------------------------

Currently we have support for:

* Afrikaans (af)
* Arabic (ar)
* Bengali (bn)
* Breton (br)
* Catalán (ca)
* Czesch (cs)
* Danish (da)
* German (de)
* Greek (el)
* English (en)
* Spanish (es)
* Finnish (fi): Due to an error it can also be used referring to the `fn` locale
* French (fr)
* Hebrew (he)
* Hungarian (hu)
* Indonesian (id)
* Italian (it)
* Korean (ko)
* Dutch (nl)
* Polish (pl)
* Portuguese (pt)
* Romanian (ro)
* Russian (ru)
* Swedish (sv)
* Thai (th)
* Turkish (tr)
* Vietnamese (vi)

In the changelog you can see the collaborators for each language.

Anything else?
--------------

Expand All @@ -132,6 +169,8 @@ Ackonowledgments

Thanks to @s2gatev who added the `stopword?` method and the sieve class to this gem

Thanks to @bettysteger, @fauno, @vrypan, @woto, @grzegorzblaszczyk, @nerde, @sbeckeriv and @zackxu1 for language support and other features.

[wikipedia_stopwords]: http://en.wikipedia.org/wiki/Stopword
[solr]: https://github.com/sunspot/sunspot
[sphinx]: https://github.com/freelancing-god/thinking-sphinx
Expand Down
2 changes: 1 addition & 1 deletion VERSION
Original file line number Diff line number Diff line change
@@ -1 +1 @@
0.5.0
0.6.0
26 changes: 7 additions & 19 deletions stopwords-filter.gemspec
Original file line number Diff line number Diff line change
Expand Up @@ -5,18 +5,22 @@

Gem::Specification.new do |s|
s.name = %q{stopwords-filter}
s.version = "0.5.0"
s.version = "0.6.0"

s.required_rubygems_version = Gem::Requirement.new(">= 0") if s.respond_to? :required_rubygems_version=
s.authors = ["David J. Brenes"]
s.date = %q{2021-06-09}
s.date = %q{2021-10-21}
s.description = %q{Small library that allows you to create a simple stopwords filter or use some based on Snowball stopwords lists}
s.email = %q{[email protected]}
s.extra_rdoc_files = [
"CHANGELOG",
"LICENSE.txt",
"README.md"
]
locale_files = []
Dir.glob("lib/stopwords/snowball/locales/*.csv") do |locale_file|
locale_files << locale_file
end
s.files = [
"CHANGELOG",
"Gemfile",
Expand All @@ -28,26 +32,10 @@ Gem::Specification.new do |s|
"lib/stopwords/snowball.rb",
"lib/stopwords/snowball/filter.rb",
"lib/stopwords/snowball/wordsieve.rb",
"lib/stopwords/snowball/locales/bg.csv",
"lib/stopwords/snowball/locales/da.csv",
"lib/stopwords/snowball/locales/de.csv",
"lib/stopwords/snowball/locales/el.csv",
"lib/stopwords/snowball/locales/en.csv",
"lib/stopwords/snowball/locales/es.csv",
"lib/stopwords/snowball/locales/fn.csv",
"lib/stopwords/snowball/locales/fr.csv",
"lib/stopwords/snowball/locales/hu.csv",
"lib/stopwords/snowball/locales/it.csv",
"lib/stopwords/snowball/locales/nl.csv",
"lib/stopwords/snowball/locales/pl.csv",
"lib/stopwords/snowball/locales/pt.csv",
"lib/stopwords/snowball/locales/ro.csv",
"lib/stopwords/snowball/locales/ru.csv",
"lib/stopwords/snowball/locales/sv.csv",
"spec/lib/filter_spec.rb",
"spec/lib/snowball_filter_spec.rb",
"spec/spec_helper.rb"
]
] + locale_files
s.homepage = %q{http://github.com/brenes/stopwords-filter}
s.licenses = ["MIT"]
s.require_paths = ["lib"]
Expand Down

0 comments on commit 9908cc3

Please sign in to comment.