diff --git a/master/assets/overrides/partials/comments.html b/master/assets/overrides/partials/comments.html
index 3c70a68f1..c727c146b 100644
--- a/master/assets/overrides/partials/comments.html
+++ b/master/assets/overrides/partials/comments.html
@@ -1,5 +1,4 @@
 {% if page.url.split("/")[0] in ["concepts", "tutorials", "pipes", "tokenizers", "data", "utilities"] %}
-<h2 id="__comments">{{ lang.t("meta.comments") }}</h2>
 <script src="https://giscus.app/client.js"
         data-repo="aphp/edsnlp"
         data-repo-id="R_kgDOG97JnA"
diff --git a/master/concepts/inference/index.html b/master/concepts/inference/index.html
index 1ecc8330b..6c412aa64 100644
--- a/master/concepts/inference/index.html
+++ b/master/concepts/inference/index.html
@@ -64,7 +64,7 @@
 
 <span class="c1"># Accumulate in chunks of fragments, in the case of parquet datasets</span>
 <span class="n">lengths</span> <span class="o">=</span> <span class="n">data</span><span class="o">.</span><span class="n"><html><head></head><body><a class="discrete-link" href="#edsnlp.core.stream.Stream.map_batches">map_batches</a></body></html></span><span class="p">(</span><span class="nb">len</span><span class="p">,</span> <span class="n">batch_size</span><span class="o">=</span><span class="s2">"fragments"</span><span class="p">)</span>
-</code></pre></div> <p>Note that these batch functions are only available under specific conditions:</p> <ul> <li>either <code>backend="simple"</code> or <code>deterministic=True</code> (default) if <code>backend="multiprocessing"</code>, otherwise elements might be processed out of order</li> <li>if every op before was elementwise (e.g. <code>map()</code>, <code>map_gpu()</code>, <code>map_pipeline()</code> and no generator function), or <code>sentinel_mode</code> was explicitly set to <code>"split"</code> in <code>map_batches()</code>, otherwise the sentinel are dropped by default when the user requires batching.</li> </ul> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> <p>Note that these batch functions are only available under specific conditions:</p> <ul> <li>either <code>backend="simple"</code> or <code>deterministic=True</code> (default) if <code>backend="multiprocessing"</code>, otherwise elements might be processed out of order</li> <li>if every op before was elementwise (e.g. <code>map()</code>, <code>map_gpu()</code>, <code>map_pipeline()</code> and no generator function), or <code>sentinel_mode</code> was explicitly set to <code>"split"</code> in <code>map_batches()</code>, otherwise the sentinel are dropped by default when the user requires batching.</li> </ul> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/concepts/pipeline/index.html b/master/concepts/pipeline/index.html
index 3ba82d76c..d07940b3d 100644
--- a/master/concepts/pipeline/index.html
+++ b/master/concepts/pipeline/index.html
@@ -64,7 +64,7 @@
         <span class="n">description</span><span class="o">=</span><span class="s2">"A short description of your package"</span><span class="p">,</span>
     <span class="p">),</span>
 <span class="p">)</span>
-</code></pre></div> <p>This will create a wheel file in the root_dir/dist folder, which you can share and install with pip.</p> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> <p>This will create a wheel file in the root_dir/dist folder, which you can share and install with pip.</p> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/concepts/torch-component/index.html b/master/concepts/torch-component/index.html
index 8208185e5..35f0cae8a 100644
--- a/master/concepts/torch-component/index.html
+++ b/master/concepts/torch-component/index.html
@@ -166,7 +166,7 @@
 <span class="sd">        """</span>
         <span class="o">...</span>
         <span class="k">return</span> <span class="n">docs</span>
-</code></pre></div> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/data/converters/index.html b/master/data/converters/index.html
index 70b597465..fb39778d8 100644
--- a/master/data/converters/index.html
+++ b/master/data/converters/index.html
@@ -172,7 +172,7 @@
 <span class="w">    </span><span class="nt">"certainty"</span><span class="p">:</span><span class="w"> </span><span class="s2">"probable"</span>
 <span class="w">    </span><span class="err">...</span>
 <span class="p">}</span>
-</code></pre></div> <div class="doc doc-object doc-class"> <div class="doc doc-contents first"> <h4 id="edsnlp.data.converters.EntsDoc2DictConverter--parameters">Parameters</h4> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>span_getter</code></td> <td class="doc-param-details"> <p>The span getter to use when getting the spans from the documents. Defaults to getting the spans in the <code>ents</code> attribute.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../reference/edsnlp/utils/span_getters/#edsnlp.utils.span_getters.SpanGetterArg" title="edsnlp.utils.span_getters.SpanGetterArg">SpanGetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True}</code> </span> </p> </td> </tr> <tr> <td><code>doc_attributes</code></td> <td class="doc-param-details"> <p>Mapping from Doc extensions to JSON attributes (can be a list too). By default, no doc attribute is exported, except <code>note_id</code>.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../reference/edsnlp/data/converters/#edsnlp.data.converters.AttributesMappingArg" title="edsnlp.data.converters.AttributesMappingArg">AttributesMappingArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{}</code> </span> </p> </td> </tr> <tr> <td><code>span_attributes</code></td> <td class="doc-param-details"> <p>Mapping from Span extensions to JSON attributes (can be a list too). By default, no attribute is exported.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../reference/edsnlp/data/converters/#edsnlp.data.converters.AttributesMappingArg" title="edsnlp.data.converters.AttributesMappingArg">AttributesMappingArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{}</code> </span> </p> </td> </tr> </tbody> </table> <div class="doc doc-children"> </div> </div> </div> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> <div class="doc doc-object doc-class"> <div class="doc doc-contents first"> <h4 id="edsnlp.data.converters.EntsDoc2DictConverter--parameters">Parameters</h4> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>span_getter</code></td> <td class="doc-param-details"> <p>The span getter to use when getting the spans from the documents. Defaults to getting the spans in the <code>ents</code> attribute.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../reference/edsnlp/utils/span_getters/#edsnlp.utils.span_getters.SpanGetterArg" title="edsnlp.utils.span_getters.SpanGetterArg">SpanGetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True}</code> </span> </p> </td> </tr> <tr> <td><code>doc_attributes</code></td> <td class="doc-param-details"> <p>Mapping from Doc extensions to JSON attributes (can be a list too). By default, no doc attribute is exported, except <code>note_id</code>.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../reference/edsnlp/data/converters/#edsnlp.data.converters.AttributesMappingArg" title="edsnlp.data.converters.AttributesMappingArg">AttributesMappingArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{}</code> </span> </p> </td> </tr> <tr> <td><code>span_attributes</code></td> <td class="doc-param-details"> <p>Mapping from Span extensions to JSON attributes (can be a list too). By default, no attribute is exported.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../reference/edsnlp/data/converters/#edsnlp.data.converters.AttributesMappingArg" title="edsnlp.data.converters.AttributesMappingArg">AttributesMappingArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{}</code> </span> </p> </td> </tr> </tbody> </table> <div class="doc doc-children"> </div> </div> </div> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/data/index.html b/master/data/index.html
index 3609770c9..6762f90e8 100644
--- a/master/data/index.html
+++ b/master/data/index.html
@@ -17,7 +17,7 @@
     <span class="c1"># How to convert Doc objects to JSON-like samples</span>
     <span class="n">converter</span><span class="o">=</span><span class="n">predefined</span> <span class="n">schema</span> <span class="ow">or</span> <span class="n">function</span><span class="p">,</span>
 <span class="p">)</span>
-</code></pre></div> <p>The overall process is illustrated in the following diagram:</p> <p><img alt="Data connectors overview" src="overview.png"/></p> <p>At the moment, we support the following data sources:</p> <table> <thead> <tr> <th style="text-align: left;">Source</th> <th style="text-align: left;">Description</th> </tr> </thead> <tbody> <tr> <td style="text-align: left;"><a href="./json">JSON</a></td> <td style="text-align: left;"><code>.json</code> and <code>.jsonl</code> files</td> </tr> <tr> <td style="text-align: left;"><a href="./standoff">Standoff &amp; BRAT</a></td> <td style="text-align: left;"><code>.ann</code> and <code>.txt</code> files</td> </tr> <tr> <td style="text-align: left;"><a href="./pandas">Pandas</a></td> <td style="text-align: left;">Pandas DataFrame objects</td> </tr> <tr> <td style="text-align: left;"><a href="./polars">Polars</a></td> <td style="text-align: left;">Polars DataFrame objects</td> </tr> <tr> <td style="text-align: left;"><a href="./spark">Spark</a></td> <td style="text-align: left;">Spark DataFrame objects</td> </tr> </tbody> </table> <p>and the following schemas:</p> <table> <thead> <tr> <th style="text-align: left;">Schema</th> <th>Snippet</th> </tr> </thead> <tbody> <tr> <td style="text-align: left;"><a href="./converters/#custom">Custom</a></td> <td><code>converter=custom_fn</code></td> </tr> <tr> <td style="text-align: left;"><a href="./converters/#omop">OMOP</a></td> <td><code>converter="omop"</code></td> </tr> <tr> <td style="text-align: left;"><a href="./converters/#standoff">Standoff</a></td> <td><code>converter="standoff"</code></td> </tr> <tr> <td style="text-align: left;"><a href="./converters/#edsnlp.data.converters.EntsDoc2DictConverter">Ents</a></td> <td><code>converter="ents"</code></td> </tr> </tbody> </table> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> <p>The overall process is illustrated in the following diagram:</p> <p><img alt="Data connectors overview" src="overview.png"/></p> <p>At the moment, we support the following data sources:</p> <table> <thead> <tr> <th style="text-align: left;">Source</th> <th style="text-align: left;">Description</th> </tr> </thead> <tbody> <tr> <td style="text-align: left;"><a href="./json">JSON</a></td> <td style="text-align: left;"><code>.json</code> and <code>.jsonl</code> files</td> </tr> <tr> <td style="text-align: left;"><a href="./standoff">Standoff &amp; BRAT</a></td> <td style="text-align: left;"><code>.ann</code> and <code>.txt</code> files</td> </tr> <tr> <td style="text-align: left;"><a href="./pandas">Pandas</a></td> <td style="text-align: left;">Pandas DataFrame objects</td> </tr> <tr> <td style="text-align: left;"><a href="./polars">Polars</a></td> <td style="text-align: left;">Polars DataFrame objects</td> </tr> <tr> <td style="text-align: left;"><a href="./spark">Spark</a></td> <td style="text-align: left;">Spark DataFrame objects</td> </tr> </tbody> </table> <p>and the following schemas:</p> <table> <thead> <tr> <th style="text-align: left;">Schema</th> <th>Snippet</th> </tr> </thead> <tbody> <tr> <td style="text-align: left;"><a href="./converters/#custom">Custom</a></td> <td><code>converter=custom_fn</code></td> </tr> <tr> <td style="text-align: left;"><a href="./converters/#omop">OMOP</a></td> <td><code>converter="omop"</code></td> </tr> <tr> <td style="text-align: left;"><a href="./converters/#standoff">Standoff</a></td> <td><code>converter="standoff"</code></td> </tr> <tr> <td style="text-align: left;"><a href="./converters/#edsnlp.data.converters.EntsDoc2DictConverter">Ents</a></td> <td><code>converter="ents"</code></td> </tr> </tbody> </table> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/data/json/index.html b/master/data/json/index.html
index f3e53bcb6..d5648be4b 100644
--- a/master/data/json/index.html
+++ b/master/data/json/index.html
@@ -27,7 +27,7 @@
 <span class="c1"># or to write a directory of JSON files, ensure that each doc has a doc._.note_id</span>
 <span class="c1"># attribute, since this will be used as a filename:</span>
 <span class="n">edsnlp</span><span class="o">.</span><span class="n">data</span><span class="o">.</span><span class="n"><html><head></head><body><a class="discrete-link" href="#edsnlp.data.json.write_json">write_json</a></body></html></span><span class="p">([</span><span class="n">doc</span><span class="p">],</span> <span class="s2">"path/to/json/dir"</span><span class="p">,</span> <span class="n">converter</span><span class="o">=</span><span class="s2">"omop"</span><span class="p">,</span> <span class="n">lines</span><span class="o">=</span><span class="kc">False</span><span class="p">)</span>
-</code></pre></div> <div class="admonition warning"> <p class="admonition-title">Overwriting files</p> <p>By default, <code>write_json</code> will raise an error if the directory already exists and contains files with <code>.a*</code> or <code>.txt</code> suffixes. This is to avoid overwriting existing annotations. To allow overwriting existing files, use <code>overwrite=True</code>.</p> </div> <h3 id="edsnlp.data.json.write_json--parameters">Parameters</h3> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>data</code></td> <td class="doc-param-details"> <p>The data to write (either a list of documents or a Stream).</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[<span title="typing.Any">Any</span>, <a class="autorefs autorefs-internal" href="../../concepts/inference/#edsnlp.core.stream.Stream" title="edsnlp.core.stream.Stream">Stream</a>]</code> </span> </p> </td> </tr> <tr> <td><code>path</code></td> <td class="doc-param-details"> <p>Path to either - a file if <code>lines</code> is true : this will write the documents as a JSONL file - a directory if <code>lines</code> is false: this will write one JSON file per document using the FILENAME field returned by the converter (commonly the <code>note_id</code> attribute of the documents) as the filename.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[str, <span title="pathlib.Path">Path</span>]</code> </span> </p> </td> </tr> <tr> <td><code>lines</code></td> <td class="doc-param-details"> <p>Whether to write the documents as a JSONL file or as a directory of JSON files. By default, this is inferred from the path: if the path is a file, lines is assumed to be true, otherwise it is assumed to be false.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>overwrite</code></td> <td class="doc-param-details"> <p>Whether to overwrite existing directories.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>execute</code></td> <td class="doc-param-details"> <p>Whether to execute the writing operation immediately or to return a stream</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>True</code> </span> </p> </td> </tr> <tr> <td><code>converter</code></td> <td class="doc-param-details"> <p>Converter to use to convert the documents to dictionary objects before writing them. These are documented on the <a href="../converters">Converters</a> page.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="typing.Union">Union</span>[str, <span title="typing.Callable">Callable</span>]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>filesystem</code></td> <td class="doc-param-details"> <p>The filesystem to use to write the files. If None, the filesystem will be inferred from the path (e.g. <code>s3://</code> will use S3).</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="edsnlp.utils.file_system.FileSystem">FileSystem</span>]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>kwargs</code></td> <td class="doc-param-details"> <p>Additional keyword arguments to pass to the converter. These are documented on the <a href="../converters">Converters</a> page.</p> <p> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{}</code> </span> </p> </td> </tr> </tbody> </table> </div> </div> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> <div class="admonition warning"> <p class="admonition-title">Overwriting files</p> <p>By default, <code>write_json</code> will raise an error if the directory already exists and contains files with <code>.a*</code> or <code>.txt</code> suffixes. This is to avoid overwriting existing annotations. To allow overwriting existing files, use <code>overwrite=True</code>.</p> </div> <h3 id="edsnlp.data.json.write_json--parameters">Parameters</h3> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>data</code></td> <td class="doc-param-details"> <p>The data to write (either a list of documents or a Stream).</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[<span title="typing.Any">Any</span>, <a class="autorefs autorefs-internal" href="../../concepts/inference/#edsnlp.core.stream.Stream" title="edsnlp.core.stream.Stream">Stream</a>]</code> </span> </p> </td> </tr> <tr> <td><code>path</code></td> <td class="doc-param-details"> <p>Path to either - a file if <code>lines</code> is true : this will write the documents as a JSONL file - a directory if <code>lines</code> is false: this will write one JSON file per document using the FILENAME field returned by the converter (commonly the <code>note_id</code> attribute of the documents) as the filename.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[str, <span title="pathlib.Path">Path</span>]</code> </span> </p> </td> </tr> <tr> <td><code>lines</code></td> <td class="doc-param-details"> <p>Whether to write the documents as a JSONL file or as a directory of JSON files. By default, this is inferred from the path: if the path is a file, lines is assumed to be true, otherwise it is assumed to be false.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>overwrite</code></td> <td class="doc-param-details"> <p>Whether to overwrite existing directories.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>execute</code></td> <td class="doc-param-details"> <p>Whether to execute the writing operation immediately or to return a stream</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>True</code> </span> </p> </td> </tr> <tr> <td><code>converter</code></td> <td class="doc-param-details"> <p>Converter to use to convert the documents to dictionary objects before writing them. These are documented on the <a href="../converters">Converters</a> page.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="typing.Union">Union</span>[str, <span title="typing.Callable">Callable</span>]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>filesystem</code></td> <td class="doc-param-details"> <p>The filesystem to use to write the files. If None, the filesystem will be inferred from the path (e.g. <code>s3://</code> will use S3).</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="edsnlp.utils.file_system.FileSystem">FileSystem</span>]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>kwargs</code></td> <td class="doc-param-details"> <p>Additional keyword arguments to pass to the converter. These are documented on the <a href="../converters">Converters</a> page.</p> <p> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{}</code> </span> </p> </td> </tr> </tbody> </table> </div> </div> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/data/pandas/index.html b/master/data/pandas/index.html
index 9d417454e..d65e3576c 100644
--- a/master/data/pandas/index.html
+++ b/master/data/pandas/index.html
@@ -32,7 +32,7 @@
         <span class="o">.</span><span class="n">rename</span><span class="p">(</span><span class="s2">"entities"</span><span class="p">)</span>
     <span class="p">)</span>
 <span class="p">)</span><span class="o">.</span><span class="n">reset_index</span><span class="p">()</span>
-</code></pre></div> <table> <thead> <tr> <th style="text-align: right;">note_id</th> <th>note_text</th> <th>note_datetime</th> <th style="text-align: right;">entities</th> </tr> </thead> <tbody> <tr> <td style="text-align: right;">0</td> <td>Le patient...</td> <td>2021-10-23</td> <td style="text-align: right;"><code>[{"note_nlp_id": 0, "start_char": 46, ...]</code></td> </tr> <tr> <td style="text-align: right;">...</td> <td>...</td> <td>...</td> <td style="text-align: right;">...</td> </tr> </tbody> </table> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> <table> <thead> <tr> <th style="text-align: right;">note_id</th> <th>note_text</th> <th>note_datetime</th> <th style="text-align: right;">entities</th> </tr> </thead> <tbody> <tr> <td style="text-align: right;">0</td> <td>Le patient...</td> <td>2021-10-23</td> <td style="text-align: right;"><code>[{"note_nlp_id": 0, "start_char": 46, ...]</code></td> </tr> <tr> <td style="text-align: right;">...</td> <td>...</td> <td>...</td> <td style="text-align: right;">...</td> </tr> </tbody> </table> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/data/parquet/index.html b/master/data/parquet/index.html
index a24c48fa5..4fffe6e75 100644
--- a/master/data/parquet/index.html
+++ b/master/data/parquet/index.html
@@ -24,7 +24,7 @@
 <span class="n">doc</span> <span class="o">=</span> <span class="n">nlp</span><span class="p">(</span><span class="s2">"My document with entities"</span><span class="p">)</span>
 
 <span class="n">edsnlp</span><span class="o">.</span><span class="n">data</span><span class="o">.</span><span class="n"><html><head></head><body><a class="discrete-link" href="#edsnlp.data.parquet.write_parquet">write_parquet</a></body></html></span><span class="p">([</span><span class="n">doc</span><span class="p">],</span> <span class="s2">"path/to/parquet"</span><span class="p">)</span>
-</code></pre></div> <div class="admonition warning"> <p class="admonition-title">Overwriting files</p> <p>By default, <code>write_parquet</code> will raise an error if the directory already exists and contains parquet files. This is to avoid overwriting existing annotations. To allow overwriting existing files, use <code>overwrite=True</code>.</p> </div> <h3 id="edsnlp.data.parquet.write_parquet--parameters">Parameters</h3> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>data</code></td> <td class="doc-param-details"> <p>The data to write (either a list of documents or a Stream).</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[<span title="typing.Any">Any</span>, <a class="autorefs autorefs-internal" href="../../concepts/inference/#edsnlp.core.stream.Stream" title="edsnlp.core.stream.Stream">Stream</a>]</code> </span> </p> </td> </tr> <tr> <td><code>path</code></td> <td class="doc-param-details"> <p>Path to the directory containing the parquet files (will recursively look for files in subdirectories). Supports any filesystem supported by pyarrow.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[str, <span title="pathlib.Path">Path</span>]</code> </span> </p> </td> </tr> <tr> <td><code>batch_size</code></td> <td class="doc-param-details"> <p>The maximum number of documents to write in each parquet file.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="typing.Union">Union</span>[int, str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>batch_by</code></td> <td class="doc-param-details"> <p>The method to batch the documents. If "docs", the batch size is the number of documents. If "fragment", each batch corresponds to a parquet file fragment from the input data.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="edsnlp.utils.batching.BatchBy">BatchBy</span></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>write_in_worker</code></td> <td class="doc-param-details"> <p>In multiprocessing or spark mode, whether to batch and write the documents in the workers or in the main process.</p> <p>For instance, a worker may read the 1st, 3rd, 5th, ... documents, while another reads the 2nd, 4th, 6th, ... documents.</p> <p>If <code>write_in_worker</code> is False, <code>deterministic</code> is True (default) and no operation adds or remove document from the stream (e.g., no <code>map_batches</code>), the original order of the documents will be recovered in the main process, and batching there can produce fragments that respect the original order.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>True</code> </span> </p> </td> </tr> <tr> <td><code>overwrite</code></td> <td class="doc-param-details"> <p>Whether to overwrite existing directories.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>filesystem</code></td> <td class="doc-param-details"> <p>The filesystem to use to write the files. If None, the filesystem will be inferred from the path (e.g. <code>s3://</code> will use S3).</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="edsnlp.utils.file_system.FileSystem">FileSystem</span>]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>execute</code></td> <td class="doc-param-details"> <p>Whether to execute the writing operation immediately or to return a stream</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>True</code> </span> </p> </td> </tr> <tr> <td><code>converter</code></td> <td class="doc-param-details"> <p>Converter to use to convert the documents to dictionary objects before writing them as Parquet rows. These are documented on the <a href="../converters">Converters</a> page.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="typing.Union">Union</span>[str, <span title="typing.Callable">Callable</span>]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>kwargs</code></td> <td class="doc-param-details"> <p>Additional keyword arguments to pass to the converter. These are documented on the <a href="../converters">Converters</a> page.</p> <p> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{}</code> </span> </p> </td> </tr> </tbody> </table> </div> </div> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> <div class="admonition warning"> <p class="admonition-title">Overwriting files</p> <p>By default, <code>write_parquet</code> will raise an error if the directory already exists and contains parquet files. This is to avoid overwriting existing annotations. To allow overwriting existing files, use <code>overwrite=True</code>.</p> </div> <h3 id="edsnlp.data.parquet.write_parquet--parameters">Parameters</h3> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>data</code></td> <td class="doc-param-details"> <p>The data to write (either a list of documents or a Stream).</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[<span title="typing.Any">Any</span>, <a class="autorefs autorefs-internal" href="../../concepts/inference/#edsnlp.core.stream.Stream" title="edsnlp.core.stream.Stream">Stream</a>]</code> </span> </p> </td> </tr> <tr> <td><code>path</code></td> <td class="doc-param-details"> <p>Path to the directory containing the parquet files (will recursively look for files in subdirectories). Supports any filesystem supported by pyarrow.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[str, <span title="pathlib.Path">Path</span>]</code> </span> </p> </td> </tr> <tr> <td><code>batch_size</code></td> <td class="doc-param-details"> <p>The maximum number of documents to write in each parquet file.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="typing.Union">Union</span>[int, str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>batch_by</code></td> <td class="doc-param-details"> <p>The method to batch the documents. If "docs", the batch size is the number of documents. If "fragment", each batch corresponds to a parquet file fragment from the input data.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="edsnlp.utils.batching.BatchBy">BatchBy</span></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>write_in_worker</code></td> <td class="doc-param-details"> <p>In multiprocessing or spark mode, whether to batch and write the documents in the workers or in the main process.</p> <p>For instance, a worker may read the 1st, 3rd, 5th, ... documents, while another reads the 2nd, 4th, 6th, ... documents.</p> <p>If <code>write_in_worker</code> is False, <code>deterministic</code> is True (default) and no operation adds or remove document from the stream (e.g., no <code>map_batches</code>), the original order of the documents will be recovered in the main process, and batching there can produce fragments that respect the original order.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>True</code> </span> </p> </td> </tr> <tr> <td><code>overwrite</code></td> <td class="doc-param-details"> <p>Whether to overwrite existing directories.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>filesystem</code></td> <td class="doc-param-details"> <p>The filesystem to use to write the files. If None, the filesystem will be inferred from the path (e.g. <code>s3://</code> will use S3).</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="edsnlp.utils.file_system.FileSystem">FileSystem</span>]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>execute</code></td> <td class="doc-param-details"> <p>Whether to execute the writing operation immediately or to return a stream</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>True</code> </span> </p> </td> </tr> <tr> <td><code>converter</code></td> <td class="doc-param-details"> <p>Converter to use to convert the documents to dictionary objects before writing them as Parquet rows. These are documented on the <a href="../converters">Converters</a> page.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="typing.Union">Union</span>[str, <span title="typing.Callable">Callable</span>]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>kwargs</code></td> <td class="doc-param-details"> <p>Additional keyword arguments to pass to the converter. These are documented on the <a href="../converters">Converters</a> page.</p> <p> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{}</code> </span> </p> </td> </tr> </tbody> </table> </div> </div> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/data/polars/index.html b/master/data/polars/index.html
index 2d0566fc5..1720cee0d 100644
--- a/master/data/polars/index.html
+++ b/master/data/polars/index.html
@@ -21,7 +21,7 @@
 <span class="n">doc</span> <span class="o">=</span> <span class="n">nlp</span><span class="p">(</span><span class="s2">"My document with entities"</span><span class="p">)</span>
 
 <span class="n">edsnlp</span><span class="o">.</span><span class="n">data</span><span class="o">.</span><span class="n"><html><head></head><body><a class="discrete-link" href="#edsnlp.data.polars.to_polars">to_polars</a></body></html></span><span class="p">([</span><span class="n">doc</span><span class="p">],</span> <span class="n">converter</span><span class="o">=</span><span class="s2">"omop"</span><span class="p">)</span>
-</code></pre></div> <h3 id="edsnlp.data.polars.to_polars--parameters">Parameters</h3> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>data</code></td> <td class="doc-param-details"> <p>The data to write (either a list of documents or a Stream).</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[<span title="typing.Any">Any</span>, <a class="autorefs autorefs-internal" href="../../concepts/inference/#edsnlp.core.stream.Stream" title="edsnlp.core.stream.Stream">Stream</a>]</code> </span> </p> </td> </tr> <tr> <td><code>dtypes</code></td> <td class="doc-param-details"> <p>Dictionary of column names to dtypes. This is passed to the schema parameter of <code>pl.from_dicts</code>.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[dict]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>converter</code></td> <td class="doc-param-details"> <p>Converter to use to convert the documents to dictionary objects before storing them in the dataframe. These are documented on the <a href="../converters">Converters</a> page.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="typing.Union">Union</span>[str, <span title="typing.Callable">Callable</span>]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>execute</code></td> <td class="doc-param-details"> <p>Whether to execute the writing operation immediately or to return a stream</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>True</code> </span> </p> </td> </tr> <tr> <td><code>kwargs</code></td> <td class="doc-param-details"> <p>Additional keyword arguments to pass to the converter. These are documented on the <a href="../converters">Converters</a> page.</p> <p> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{}</code> </span> </p> </td> </tr> </tbody> </table> </div> </div> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> <h3 id="edsnlp.data.polars.to_polars--parameters">Parameters</h3> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>data</code></td> <td class="doc-param-details"> <p>The data to write (either a list of documents or a Stream).</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[<span title="typing.Any">Any</span>, <a class="autorefs autorefs-internal" href="../../concepts/inference/#edsnlp.core.stream.Stream" title="edsnlp.core.stream.Stream">Stream</a>]</code> </span> </p> </td> </tr> <tr> <td><code>dtypes</code></td> <td class="doc-param-details"> <p>Dictionary of column names to dtypes. This is passed to the schema parameter of <code>pl.from_dicts</code>.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[dict]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>converter</code></td> <td class="doc-param-details"> <p>Converter to use to convert the documents to dictionary objects before storing them in the dataframe. These are documented on the <a href="../converters">Converters</a> page.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="typing.Union">Union</span>[str, <span title="typing.Callable">Callable</span>]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>execute</code></td> <td class="doc-param-details"> <p>Whether to execute the writing operation immediately or to return a stream</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>True</code> </span> </p> </td> </tr> <tr> <td><code>kwargs</code></td> <td class="doc-param-details"> <p>Additional keyword arguments to pass to the converter. These are documented on the <a href="../converters">Converters</a> page.</p> <p> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{}</code> </span> </p> </td> </tr> </tbody> </table> </div> </div> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/data/spark/index.html b/master/data/spark/index.html
index 72d0a8fda..8f4950286 100644
--- a/master/data/spark/index.html
+++ b/master/data/spark/index.html
@@ -48,7 +48,7 @@
             <span class="p">)</span>
         <span class="p">)</span><span class="o">.</span><span class="n">alias</span><span class="p">(</span><span class="s2">"entities"</span><span class="p">)</span>
     <span class="p">),</span> <span class="s2">"note_id"</span><span class="p">,</span> <span class="s2">"left"</span><span class="p">)</span>
-</code></pre></div> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/data/standoff/index.html b/master/data/standoff/index.html
index 97df570d2..614becc6b 100644
--- a/master/data/standoff/index.html
+++ b/master/data/standoff/index.html
@@ -31,7 +31,7 @@
 <span class="n">doc</span> <span class="o">=</span> <span class="n">nlp</span><span class="p">(</span><span class="s2">"My document with entities"</span><span class="p">)</span>
 
 <span class="n">edsnlp</span><span class="o">.</span><span class="n">data</span><span class="o">.</span><span class="n"><html><head></head><body><a class="discrete-link" href="#edsnlp.data.standoff.write_standoff">write_standoff</a></body></html></span><span class="p">([</span><span class="n">doc</span><span class="p">],</span> <span class="s2">"path/to/brat/directory"</span><span class="p">)</span>
-</code></pre></div> <div class="admonition warning"> <p class="admonition-title">Overwriting files</p> <p>By default, <code>write_standoff</code> will raise an error if the directory already exists and contains files with <code>.a*</code> or <code>.txt</code> suffixes. This is to avoid overwriting existing annotations. To allow overwriting existing files, use <code>overwrite=True</code>.</p> </div> <h3 id="edsnlp.data.standoff.write_standoff--parameters">Parameters</h3> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>data</code></td> <td class="doc-param-details"> <p>The data to write (either a list of documents or a Stream).</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[<span title="typing.Any">Any</span>, <a class="autorefs autorefs-internal" href="../../concepts/inference/#edsnlp.core.stream.Stream" title="edsnlp.core.stream.Stream">Stream</a>]</code> </span> </p> </td> </tr> <tr> <td><code>path</code></td> <td class="doc-param-details"> <p>Path to the directory containing the BRAT files (will recursively look for files in subdirectories).</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[str, <span title="pathlib.Path">Path</span>]</code> </span> </p> </td> </tr> <tr> <td><code>span_getter</code></td> <td class="doc-param-details"> <p>The span getter to use when listing the spans that will be exported as BRAT entities. Defaults to getting the spans in the <code>ents</code> attribute.</p> <p> </p> </td> </tr> <tr> <td><code>span_attributes</code></td> <td class="doc-param-details"> <p>Mapping from BRAT attributes to Span extension. By default, no attribute will be exported.</p> <p> </p> </td> </tr> <tr> <td><code>overwrite</code></td> <td class="doc-param-details"> <p>Whether to overwrite existing directories.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>filesystem</code></td> <td class="doc-param-details"> <p>The filesystem to use to write the files. If None, the filesystem will be inferred from the path (e.g. <code>s3://</code> will use S3).</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="edsnlp.utils.file_system.FileSystem">FileSystem</span>]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>execute</code></td> <td class="doc-param-details"> <p>Whether to execute the writing operation immediately or to return a stream</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>True</code> </span> </p> </td> </tr> <tr> <td><code>converter</code></td> <td class="doc-param-details"> <p>Converter to use to convert the documents to dictionary objects. Defaults to the "standoff" format converter.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="typing.Union">Union</span>[str, <span title="typing.Callable">Callable</span>]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'standoff'</code> </span> </p> </td> </tr> </tbody> </table> </div> </div> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> <div class="admonition warning"> <p class="admonition-title">Overwriting files</p> <p>By default, <code>write_standoff</code> will raise an error if the directory already exists and contains files with <code>.a*</code> or <code>.txt</code> suffixes. This is to avoid overwriting existing annotations. To allow overwriting existing files, use <code>overwrite=True</code>.</p> </div> <h3 id="edsnlp.data.standoff.write_standoff--parameters">Parameters</h3> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>data</code></td> <td class="doc-param-details"> <p>The data to write (either a list of documents or a Stream).</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[<span title="typing.Any">Any</span>, <a class="autorefs autorefs-internal" href="../../concepts/inference/#edsnlp.core.stream.Stream" title="edsnlp.core.stream.Stream">Stream</a>]</code> </span> </p> </td> </tr> <tr> <td><code>path</code></td> <td class="doc-param-details"> <p>Path to the directory containing the BRAT files (will recursively look for files in subdirectories).</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[str, <span title="pathlib.Path">Path</span>]</code> </span> </p> </td> </tr> <tr> <td><code>span_getter</code></td> <td class="doc-param-details"> <p>The span getter to use when listing the spans that will be exported as BRAT entities. Defaults to getting the spans in the <code>ents</code> attribute.</p> <p> </p> </td> </tr> <tr> <td><code>span_attributes</code></td> <td class="doc-param-details"> <p>Mapping from BRAT attributes to Span extension. By default, no attribute will be exported.</p> <p> </p> </td> </tr> <tr> <td><code>overwrite</code></td> <td class="doc-param-details"> <p>Whether to overwrite existing directories.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>filesystem</code></td> <td class="doc-param-details"> <p>The filesystem to use to write the files. If None, the filesystem will be inferred from the path (e.g. <code>s3://</code> will use S3).</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="edsnlp.utils.file_system.FileSystem">FileSystem</span>]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>execute</code></td> <td class="doc-param-details"> <p>Whether to execute the writing operation immediately or to return a stream</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>True</code> </span> </p> </td> </tr> <tr> <td><code>converter</code></td> <td class="doc-param-details"> <p>Converter to use to convert the documents to dictionary objects. Defaults to the "standoff" format converter.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="typing.Union">Union</span>[str, <span title="typing.Callable">Callable</span>]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'standoff'</code> </span> </p> </td> </tr> </tbody> </table> </div> </div> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/pipes/architecture/index.html b/master/pipes/architecture/index.html
index 9e2ef194e..1e35089bd 100644
--- a/master/pipes/architecture/index.html
+++ b/master/pipes/architecture/index.html
@@ -1,5 +1,5 @@
 <!DOCTYPE html>
-<html class="no-js" lang="en"> <head><meta charset="utf-8"/><meta content="width=device-width,initial-scale=1" name="viewport"/><link href="../../assets/logo/edsnlp.svg" rel="icon"/><meta content="mkdocs-1.6.1, mkdocs-material-9.5.42" name="generator"/><title>Basic Architecture - EDS-NLP</title><link href="../../assets/stylesheets/main.0253249f.min.css" rel="stylesheet"/><link href="../../assets/stylesheets/palette.06af60db.min.css" rel="stylesheet"/><link crossorigin="" href="https://fonts.gstatic.com" rel="preconnect"/><link href="https://fonts.googleapis.com/css?family=Roboto:300,300i,400,400i,700,700i%7CRoboto+Mono:400,400i,700,700i&amp;display=fallback" rel="stylesheet"/><style>:root{--md-text-font:"Roboto";--md-code-font:"Roboto Mono"}</style><link href="../../assets/_mkdocstrings.css" rel="stylesheet"/><link href="../../assets/stylesheets/extra.css" rel="stylesheet"/><link href="../../assets/stylesheets/cards.css" rel="stylesheet"/><link href="../../assets/termynal/termynal.css" rel="stylesheet"/><script>__md_scope=new URL("../..",location),__md_hash=e=>[...e].reduce(((e,_)=>(e<<5)-e+_.charCodeAt(0)),0),__md_get=(e,_=localStorage,t=__md_scope)=>JSON.parse(_.getItem(t.pathname+"."+e)),__md_set=(e,_,t=localStorage,a=__md_scope)=>{try{t.setItem(a.pathname+"."+e,JSON.stringify(_))}catch(e){}}</script></head> <body data-md-color-accent="indigo" data-md-color-primary="indigo" data-md-color-scheme="default" dir="ltr"> <input autocomplete="off" class="md-toggle" data-md-toggle="drawer" id="__drawer" type="checkbox"/> <input autocomplete="off" class="md-toggle" data-md-toggle="search" id="__search" type="checkbox"/> <label class="md-overlay" for="__drawer"></label> <div data-md-component="skip"> <a class="md-skip" href="#basic-architecture"> Skip to content </a> </div> <div data-md-component="announce"> <aside class="md-banner"> <div class="md-banner__inner md-grid md-typeset"> <button aria-label="Don't show this again" class="md-banner__button md-icon"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M19 6.41 17.59 5 12 10.59 6.41 5 5 6.41 10.59 12 5 17.59 6.41 19 12 13.41 17.59 19 19 17.59 13.41 12z"></path></svg> </button> Check out the new <a href="../../tutorials/training">Model Training tutorial</a> ! </div> <script>var el=document.querySelector("[data-md-component=announce]");if(el){var content=el.querySelector(".md-typeset");__md_hash(content.innerHTML)===__md_get("__announce")&&(el.hidden=!0)}</script> </aside> </div> <div data-md-color-scheme="default" data-md-component="outdated" hidden=""> </div> <header class="md-header md-header--shadow" data-md-component="header"> <nav aria-label="Header" class="md-header__inner md-grid"> <a aria-label="EDS-NLP" class="md-header__button md-logo" data-md-component="logo" href="../.." title="EDS-NLP"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a3 3 0 0 0 3-3 3 3 0 0 0-3-3 3 3 0 0 0-3 3 3 3 0 0 0 3 3m0 3.54C9.64 9.35 6.5 8 3 8v11c3.5 0 6.64 1.35 9 3.54 2.36-2.19 5.5-3.54 9-3.54V8c-3.5 0-6.64 1.35-9 3.54"></path></svg> </a> <label class="md-header__button md-icon" for="__drawer"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M3 6h18v2H3zm0 5h18v2H3zm0 5h18v2H3z"></path></svg> </label> <div class="md-header__title" data-md-component="header-title"> <div class="md-header__ellipsis"> <div class="md-header__topic"> <span class="md-ellipsis"> EDS-NLP </span> </div> <div class="md-header__topic" data-md-component="header-topic"> <span class="md-ellipsis"> Basic Architecture </span> </div> </div> </div> <form class="md-header__option" data-md-component="palette"> <input aria-label="Switch to dark mode" class="md-option" data-md-color-accent="indigo" data-md-color-media="" data-md-color-primary="indigo" data-md-color-scheme="default" id="__palette_0" name="__palette" type="radio"/> <label class="md-header__button md-icon" for="__palette_1" hidden="" title="Switch to dark mode"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 18c-.89 0-1.74-.2-2.5-.55C11.56 16.5 13 14.42 13 12s-1.44-4.5-3.5-5.45C10.26 6.2 11.11 6 12 6a6 6 0 0 1 6 6 6 6 0 0 1-6 6m8-9.31V4h-4.69L12 .69 8.69 4H4v4.69L.69 12 4 15.31V20h4.69L12 23.31 15.31 20H20v-4.69L23.31 12z"></path></svg> </label> <input aria-label="Switch to light mode" class="md-option" data-md-color-accent="indigo" data-md-color-media="" data-md-color-primary="indigo" data-md-color-scheme="slate" id="__palette_1" name="__palette" type="radio"/> <label class="md-header__button md-icon" for="__palette_0" hidden="" title="Switch to light mode"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a4 4 0 0 0-4 4 4 4 0 0 0 4 4 4 4 0 0 0 4-4 4 4 0 0 0-4-4m0 10a6 6 0 0 1-6-6 6 6 0 0 1 6-6 6 6 0 0 1 6 6 6 6 0 0 1-6 6m8-9.31V4h-4.69L12 .69 8.69 4H4v4.69L.69 12 4 15.31V20h4.69L12 23.31 15.31 20H20v-4.69L23.31 12z"></path></svg> </label> </form> <script>var palette=__md_get("__palette");if(palette&&palette.color){if("(prefers-color-scheme)"===palette.color.media){var media=matchMedia("(prefers-color-scheme: light)"),input=document.querySelector(media.matches?"[data-md-color-media='(prefers-color-scheme: light)']":"[data-md-color-media='(prefers-color-scheme: dark)']");palette.color.media=input.getAttribute("data-md-color-media"),palette.color.scheme=input.getAttribute("data-md-color-scheme"),palette.color.primary=input.getAttribute("data-md-color-primary"),palette.color.accent=input.getAttribute("data-md-color-accent")}for(var[key,value]of Object.entries(palette.color))document.body.setAttribute("data-md-color-"+key,value)}</script> <label class="md-header__button md-icon" for="__search"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M9.5 3A6.5 6.5 0 0 1 16 9.5c0 1.61-.59 3.09-1.56 4.23l.27.27h.79l5 5-1.5 1.5-5-5v-.79l-.27-.27A6.52 6.52 0 0 1 9.5 16 6.5 6.5 0 0 1 3 9.5 6.5 6.5 0 0 1 9.5 3m0 2C7 5 5 7 5 9.5S7 14 9.5 14 14 12 14 9.5 12 5 9.5 5"></path></svg> </label> <div class="md-search" data-md-component="search" role="dialog"> <label class="md-search__overlay" for="__search"></label> <div class="md-search__inner" role="search"> <form class="md-search__form" name="search"> <input aria-label="Search" autocapitalize="off" autocomplete="off" autocorrect="off" class="md-search__input" data-md-component="search-query" name="query" placeholder="Search" required="" spellcheck="false" type="text"/> <label class="md-search__icon md-icon" for="__search"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M9.5 3A6.5 6.5 0 0 1 16 9.5c0 1.61-.59 3.09-1.56 4.23l.27.27h.79l5 5-1.5 1.5-5-5v-.79l-.27-.27A6.52 6.52 0 0 1 9.5 16 6.5 6.5 0 0 1 3 9.5 6.5 6.5 0 0 1 9.5 3m0 2C7 5 5 7 5 9.5S7 14 9.5 14 14 12 14 9.5 12 5 9.5 5"></path></svg> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M20 11v2H8l5.5 5.5-1.42 1.42L4.16 12l7.92-7.92L13.5 5.5 8 11z"></path></svg> </label> <nav aria-label="Search" class="md-search__options"> <button aria-label="Clear" class="md-search__icon md-icon" tabindex="-1" title="Clear" type="reset"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M19 6.41 17.59 5 12 10.59 6.41 5 5 6.41 10.59 12 5 17.59 6.41 19 12 13.41 17.59 19 19 17.59 13.41 12z"></path></svg> </button> </nav> </form> <div class="md-search__output"> <div class="md-search__scrollwrap" data-md-scrollfix="" tabindex="0"> <div class="md-search-result" data-md-component="search-result"> <div class="md-search-result__meta"> Initializing search </div> <ol class="md-search-result__list" role="presentation"></ol> </div> </div> </div> </div> </div> <div class="md-header__source"> <a class="md-source" data-md-component="source" href="https://github.com/aphp/edsnlp" title="Go to repository"> <div class="md-source__icon md-icon"> <svg viewbox="0 0 448 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M439.55 236.05 244 40.45a28.87 28.87 0 0 0-40.81 0l-40.66 40.63 51.52 51.52c27.06-9.14 52.68 16.77 43.39 43.68l49.66 49.66c34.23-11.8 61.18 31 35.47 56.69-26.49 26.49-70.21-2.87-56-37.34L240.22 199v121.85c25.3 12.54 22.26 41.85 9.08 55a34.34 34.34 0 0 1-48.55 0c-17.57-17.6-11.07-46.91 11.25-56v-123c-20.8-8.51-24.6-30.74-18.64-45L142.57 101 8.45 235.14a28.86 28.86 0 0 0 0 40.81l195.61 195.6a28.86 28.86 0 0 0 40.8 0l194.69-194.69a28.86 28.86 0 0 0 0-40.81"></path></svg> </div> <div class="md-source__repository"> aphp/edsnlp </div> </a> </div> </nav> </header> <div class="md-container" data-md-component="container"> <main class="md-main" data-md-component="main"> <div class="md-main__inner md-grid"> <div class="md-sidebar md-sidebar--primary" data-md-component="sidebar" data-md-type="navigation"> <div class="md-sidebar__scrollwrap"> <div class="md-sidebar__inner"> <nav aria-label="Navigation" class="md-nav md-nav--primary" data-md-level="0"> <label class="md-nav__title" for="__drawer"> <a aria-label="EDS-NLP" class="md-nav__button md-logo" data-md-component="logo" href="../.." title="EDS-NLP"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a3 3 0 0 0 3-3 3 3 0 0 0-3-3 3 3 0 0 0-3 3 3 3 0 0 0 3 3m0 3.54C9.64 9.35 6.5 8 3 8v11c3.5 0 6.64 1.35 9 3.54 2.36-2.19 5.5-3.54 9-3.54V8c-3.5 0-6.64 1.35-9 3.54"></path></svg> </a> EDS-NLP </label> <div class="md-nav__source"> <a class="md-source" data-md-component="source" href="https://github.com/aphp/edsnlp" title="Go to repository"> <div class="md-source__icon md-icon"> <svg viewbox="0 0 448 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M439.55 236.05 244 40.45a28.87 28.87 0 0 0-40.81 0l-40.66 40.63 51.52 51.52c27.06-9.14 52.68 16.77 43.39 43.68l49.66 49.66c34.23-11.8 61.18 31 35.47 56.69-26.49 26.49-70.21-2.87-56-37.34L240.22 199v121.85c25.3 12.54 22.26 41.85 9.08 55a34.34 34.34 0 0 1-48.55 0c-17.57-17.6-11.07-46.91 11.25-56v-123c-20.8-8.51-24.6-30.74-18.64-45L142.57 101 8.45 235.14a28.86 28.86 0 0 0 0 40.81l195.61 195.6a28.86 28.86 0 0 0 40.8 0l194.69-194.69a28.86 28.86 0 0 0 0-40.81"></path></svg> </div> <div class="md-source__repository"> aphp/edsnlp </div> </a> </div> <ul class="md-nav__list" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href="../.."> <span class="md-ellipsis"> Getting started </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="https://aphp.github.io/edsnlp/demo" target="_blank"> <span class="md-ellipsis"> Demo </span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../tutorials/"> <span class="md-ellipsis"> Tutorials </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../"> <span class="md-ellipsis"> Pipes </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../tokenizers/"> <span class="md-ellipsis"> Tokenizers </span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../data/"> <span class="md-ellipsis"> Data Connectors </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../concepts/pipeline/"> <span class="md-ellipsis"> Concepts </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../utilities/"> <span class="md-ellipsis"> Utilities </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../reference/edsnlp/"> <span class="md-ellipsis"> Code Reference </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../contributing/"> <span class="md-ellipsis"> Contributing to EDS-NLP </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../changelog/"> <span class="md-ellipsis"> Changelog </span> </a> </li> </ul> </nav> </div> </div> </div> <div class="md-sidebar md-sidebar--secondary" data-md-component="sidebar" data-md-type="toc"> <div class="md-sidebar__scrollwrap"> <div class="md-sidebar__inner"> <nav aria-label="Table of contents" class="md-nav md-nav--secondary"> <label class="md-nav__title" for="__toc"> <span class="md-nav__icon md-icon"></span> Table of contents </label> <ul class="md-nav__list" data-md-component="toc" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href="#scope"> <span class="md-ellipsis"> Scope </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="#result-persistence"> <span class="md-ellipsis"> Result persistence </span> </a> <nav aria-label="Result persistence" class="md-nav"> <ul class="md-nav__list"> <li class="md-nav__item"> <a class="md-nav__link" href="#extraction-pipes"> <span class="md-ellipsis"> Extraction pipes </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="#entity-tagging"> <span class="md-ellipsis"> Entity tagging </span> </a> </li> </ul> </nav> </li> </ul> </nav> </div> </div> </div> <div class="md-content" data-md-component="content"> <article class="md-content__inner md-typeset"> <h1 id="basic-architecture">Basic Architecture</h1> <p>Most pipes provided by EDS-NLP aim to qualify pre-extracted entities. To wit, the basic usage of the library:</p> <ol> <li>Implement a normaliser (see <code><a href="../core/normalizer/#edsnlp.pipes.core.normalizer.factory.create_component">eds.normalizer</a></code>)</li> <li>Add an entity recognition component (eg the simple but powerful <code><a href="../core/matcher/#edsnlp.pipes.core.matcher.factory.create_component">eds.matcher</a></code>)</li> <li>Add zero or more entity qualification components, such as <code><a href="../qualifiers/negation/#edsnlp.pipes.qualifiers.negation.factory.create_component">eds.negation</a></code>, <code><a href="../qualifiers/family/#edsnlp.pipes.qualifiers.family.factory.create_component">eds.family</a></code> or <code><a href="../qualifiers/hypothesis/#edsnlp.pipes.qualifiers.hypothesis.factory.create_component">eds.hypothesis</a></code>. These qualifiers typically help detect false-positives.</li> </ol> <h2 id="scope">Scope</h2> <p>Since the basic usage of EDS-NLP components is to qualify entities, most pipes can function in two modes:</p> <ol> <li>Annotation of the extracted entities (this is the default). To increase throughput, only pre-extracted entities (found in <code>doc.ents</code>) are processed.</li> <li>Full-text, token-wise annotation. This mode is activated by setting the <code>on_ents_only</code> parameter to <code>False</code>.</li> </ol> <p>The possibility to do full-text annotation implies that one could use the pipes the other way around, eg detecting all negations once and for all in an ETL phase, and reusing the results consequently. However, this is not the intended use of the library, which aims to help researchers downstream as a standalone application.</p> <h2 id="result-persistence">Result persistence</h2> <p>Depending on their purpose (entity extraction, qualification, etc), EDS-NLP pipes write their results to <code>Doc.ents</code>, <code>Doc.spans</code> or in a custom attribute.</p> <h3 id="extraction-pipes">Extraction pipes</h3> <p>Extraction pipes (matchers, the date detector or NER pipes, for instance) keep their results to the <code>Doc.ents</code> attribute directly.</p> <p>Note that spaCy prohibits overlapping entities within the <code>Doc.ents</code> attribute. To circumvent this limitation, we <a class="autorefs autorefs-internal" href="../../reference/edsnlp/utils/filter/#edsnlp.utils.filter.filter_spans">filter spans</a>, and keep all discarded entities within the <code>discarded</code> key of the <code>Doc.spans</code> attribute.</p> <p>Some pipes write their output to the <code>Doc.spans</code> dictionary. We enforce the following doctrine:</p> <ul> <li>Should the pipe extract entities that are directly informative (typically the output of the <code><a href="../core/matcher/#edsnlp.pipes.core.matcher.factory.create_component">eds.matcher</a></code> component), said entities are stashed in the <code>Doc.ents</code> attribute.</li> <li>On the other hand, should the entity be useful to another pipe, but less so in itself (eg the output of the <code><a href="../misc/sections/#edsnlp.pipes.misc.sections.factory.create_component">eds.sections</a></code> or <code><a href="../misc/dates/#edsnlp.pipes.misc.dates.factory.create_component">eds.dates</a></code> component), it will be stashed in a specific key within the <code>Doc.spans</code> attribute.</li> </ul> <h3 id="entity-tagging">Entity tagging</h3> <p>Moreover, most pipes declare <a href="https://spacy.io/usage/processing-pipelines#custom-components-attributes">spaCy extensions</a>, on the <code>Doc</code>, <code>Span</code> and/or <code>Token</code> objects.</p> <p>These extensions are especially useful for qualifier pipes, but can also be used by other pipes to persist relevant information. For instance, the <code><a href="../misc/dates/#edsnlp.pipes.misc.dates.factory.create_component">eds.dates</a></code> pipeline component:</p> <ol> <li>Populates <code class="highlight"><span class="n">Doc</span><span class="o">.</span><span class="n">spans</span><span class="p">[</span><span class="s2">"dates"</span><span class="p">]</span></code></li> <li>For each detected item, keeps the normalised date in <code class="highlight"><span class="n">Span</span><span class="o">.</span><span class="n">_</span><span class="o">.</span><span class="n">date</span></code></li> </ol> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+<html class="no-js" lang="en"> <head><meta charset="utf-8"/><meta content="width=device-width,initial-scale=1" name="viewport"/><link href="../../assets/logo/edsnlp.svg" rel="icon"/><meta content="mkdocs-1.6.1, mkdocs-material-9.5.42" name="generator"/><title>Basic Architecture - EDS-NLP</title><link href="../../assets/stylesheets/main.0253249f.min.css" rel="stylesheet"/><link href="../../assets/stylesheets/palette.06af60db.min.css" rel="stylesheet"/><link crossorigin="" href="https://fonts.gstatic.com" rel="preconnect"/><link href="https://fonts.googleapis.com/css?family=Roboto:300,300i,400,400i,700,700i%7CRoboto+Mono:400,400i,700,700i&amp;display=fallback" rel="stylesheet"/><style>:root{--md-text-font:"Roboto";--md-code-font:"Roboto Mono"}</style><link href="../../assets/_mkdocstrings.css" rel="stylesheet"/><link href="../../assets/stylesheets/extra.css" rel="stylesheet"/><link href="../../assets/stylesheets/cards.css" rel="stylesheet"/><link href="../../assets/termynal/termynal.css" rel="stylesheet"/><script>__md_scope=new URL("../..",location),__md_hash=e=>[...e].reduce(((e,_)=>(e<<5)-e+_.charCodeAt(0)),0),__md_get=(e,_=localStorage,t=__md_scope)=>JSON.parse(_.getItem(t.pathname+"."+e)),__md_set=(e,_,t=localStorage,a=__md_scope)=>{try{t.setItem(a.pathname+"."+e,JSON.stringify(_))}catch(e){}}</script></head> <body data-md-color-accent="indigo" data-md-color-primary="indigo" data-md-color-scheme="default" dir="ltr"> <input autocomplete="off" class="md-toggle" data-md-toggle="drawer" id="__drawer" type="checkbox"/> <input autocomplete="off" class="md-toggle" data-md-toggle="search" id="__search" type="checkbox"/> <label class="md-overlay" for="__drawer"></label> <div data-md-component="skip"> <a class="md-skip" href="#basic-architecture"> Skip to content </a> </div> <div data-md-component="announce"> <aside class="md-banner"> <div class="md-banner__inner md-grid md-typeset"> <button aria-label="Don't show this again" class="md-banner__button md-icon"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M19 6.41 17.59 5 12 10.59 6.41 5 5 6.41 10.59 12 5 17.59 6.41 19 12 13.41 17.59 19 19 17.59 13.41 12z"></path></svg> </button> Check out the new <a href="../../tutorials/training">Model Training tutorial</a> ! </div> <script>var el=document.querySelector("[data-md-component=announce]");if(el){var content=el.querySelector(".md-typeset");__md_hash(content.innerHTML)===__md_get("__announce")&&(el.hidden=!0)}</script> </aside> </div> <div data-md-color-scheme="default" data-md-component="outdated" hidden=""> </div> <header class="md-header md-header--shadow" data-md-component="header"> <nav aria-label="Header" class="md-header__inner md-grid"> <a aria-label="EDS-NLP" class="md-header__button md-logo" data-md-component="logo" href="../.." title="EDS-NLP"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a3 3 0 0 0 3-3 3 3 0 0 0-3-3 3 3 0 0 0-3 3 3 3 0 0 0 3 3m0 3.54C9.64 9.35 6.5 8 3 8v11c3.5 0 6.64 1.35 9 3.54 2.36-2.19 5.5-3.54 9-3.54V8c-3.5 0-6.64 1.35-9 3.54"></path></svg> </a> <label class="md-header__button md-icon" for="__drawer"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M3 6h18v2H3zm0 5h18v2H3zm0 5h18v2H3z"></path></svg> </label> <div class="md-header__title" data-md-component="header-title"> <div class="md-header__ellipsis"> <div class="md-header__topic"> <span class="md-ellipsis"> EDS-NLP </span> </div> <div class="md-header__topic" data-md-component="header-topic"> <span class="md-ellipsis"> Basic Architecture </span> </div> </div> </div> <form class="md-header__option" data-md-component="palette"> <input aria-label="Switch to dark mode" class="md-option" data-md-color-accent="indigo" data-md-color-media="" data-md-color-primary="indigo" data-md-color-scheme="default" id="__palette_0" name="__palette" type="radio"/> <label class="md-header__button md-icon" for="__palette_1" hidden="" title="Switch to dark mode"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 18c-.89 0-1.74-.2-2.5-.55C11.56 16.5 13 14.42 13 12s-1.44-4.5-3.5-5.45C10.26 6.2 11.11 6 12 6a6 6 0 0 1 6 6 6 6 0 0 1-6 6m8-9.31V4h-4.69L12 .69 8.69 4H4v4.69L.69 12 4 15.31V20h4.69L12 23.31 15.31 20H20v-4.69L23.31 12z"></path></svg> </label> <input aria-label="Switch to light mode" class="md-option" data-md-color-accent="indigo" data-md-color-media="" data-md-color-primary="indigo" data-md-color-scheme="slate" id="__palette_1" name="__palette" type="radio"/> <label class="md-header__button md-icon" for="__palette_0" hidden="" title="Switch to light mode"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a4 4 0 0 0-4 4 4 4 0 0 0 4 4 4 4 0 0 0 4-4 4 4 0 0 0-4-4m0 10a6 6 0 0 1-6-6 6 6 0 0 1 6-6 6 6 0 0 1 6 6 6 6 0 0 1-6 6m8-9.31V4h-4.69L12 .69 8.69 4H4v4.69L.69 12 4 15.31V20h4.69L12 23.31 15.31 20H20v-4.69L23.31 12z"></path></svg> </label> </form> <script>var palette=__md_get("__palette");if(palette&&palette.color){if("(prefers-color-scheme)"===palette.color.media){var media=matchMedia("(prefers-color-scheme: light)"),input=document.querySelector(media.matches?"[data-md-color-media='(prefers-color-scheme: light)']":"[data-md-color-media='(prefers-color-scheme: dark)']");palette.color.media=input.getAttribute("data-md-color-media"),palette.color.scheme=input.getAttribute("data-md-color-scheme"),palette.color.primary=input.getAttribute("data-md-color-primary"),palette.color.accent=input.getAttribute("data-md-color-accent")}for(var[key,value]of Object.entries(palette.color))document.body.setAttribute("data-md-color-"+key,value)}</script> <label class="md-header__button md-icon" for="__search"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M9.5 3A6.5 6.5 0 0 1 16 9.5c0 1.61-.59 3.09-1.56 4.23l.27.27h.79l5 5-1.5 1.5-5-5v-.79l-.27-.27A6.52 6.52 0 0 1 9.5 16 6.5 6.5 0 0 1 3 9.5 6.5 6.5 0 0 1 9.5 3m0 2C7 5 5 7 5 9.5S7 14 9.5 14 14 12 14 9.5 12 5 9.5 5"></path></svg> </label> <div class="md-search" data-md-component="search" role="dialog"> <label class="md-search__overlay" for="__search"></label> <div class="md-search__inner" role="search"> <form class="md-search__form" name="search"> <input aria-label="Search" autocapitalize="off" autocomplete="off" autocorrect="off" class="md-search__input" data-md-component="search-query" name="query" placeholder="Search" required="" spellcheck="false" type="text"/> <label class="md-search__icon md-icon" for="__search"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M9.5 3A6.5 6.5 0 0 1 16 9.5c0 1.61-.59 3.09-1.56 4.23l.27.27h.79l5 5-1.5 1.5-5-5v-.79l-.27-.27A6.52 6.52 0 0 1 9.5 16 6.5 6.5 0 0 1 3 9.5 6.5 6.5 0 0 1 9.5 3m0 2C7 5 5 7 5 9.5S7 14 9.5 14 14 12 14 9.5 12 5 9.5 5"></path></svg> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M20 11v2H8l5.5 5.5-1.42 1.42L4.16 12l7.92-7.92L13.5 5.5 8 11z"></path></svg> </label> <nav aria-label="Search" class="md-search__options"> <button aria-label="Clear" class="md-search__icon md-icon" tabindex="-1" title="Clear" type="reset"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M19 6.41 17.59 5 12 10.59 6.41 5 5 6.41 10.59 12 5 17.59 6.41 19 12 13.41 17.59 19 19 17.59 13.41 12z"></path></svg> </button> </nav> </form> <div class="md-search__output"> <div class="md-search__scrollwrap" data-md-scrollfix="" tabindex="0"> <div class="md-search-result" data-md-component="search-result"> <div class="md-search-result__meta"> Initializing search </div> <ol class="md-search-result__list" role="presentation"></ol> </div> </div> </div> </div> </div> <div class="md-header__source"> <a class="md-source" data-md-component="source" href="https://github.com/aphp/edsnlp" title="Go to repository"> <div class="md-source__icon md-icon"> <svg viewbox="0 0 448 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M439.55 236.05 244 40.45a28.87 28.87 0 0 0-40.81 0l-40.66 40.63 51.52 51.52c27.06-9.14 52.68 16.77 43.39 43.68l49.66 49.66c34.23-11.8 61.18 31 35.47 56.69-26.49 26.49-70.21-2.87-56-37.34L240.22 199v121.85c25.3 12.54 22.26 41.85 9.08 55a34.34 34.34 0 0 1-48.55 0c-17.57-17.6-11.07-46.91 11.25-56v-123c-20.8-8.51-24.6-30.74-18.64-45L142.57 101 8.45 235.14a28.86 28.86 0 0 0 0 40.81l195.61 195.6a28.86 28.86 0 0 0 40.8 0l194.69-194.69a28.86 28.86 0 0 0 0-40.81"></path></svg> </div> <div class="md-source__repository"> aphp/edsnlp </div> </a> </div> </nav> </header> <div class="md-container" data-md-component="container"> <main class="md-main" data-md-component="main"> <div class="md-main__inner md-grid"> <div class="md-sidebar md-sidebar--primary" data-md-component="sidebar" data-md-type="navigation"> <div class="md-sidebar__scrollwrap"> <div class="md-sidebar__inner"> <nav aria-label="Navigation" class="md-nav md-nav--primary" data-md-level="0"> <label class="md-nav__title" for="__drawer"> <a aria-label="EDS-NLP" class="md-nav__button md-logo" data-md-component="logo" href="../.." title="EDS-NLP"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a3 3 0 0 0 3-3 3 3 0 0 0-3-3 3 3 0 0 0-3 3 3 3 0 0 0 3 3m0 3.54C9.64 9.35 6.5 8 3 8v11c3.5 0 6.64 1.35 9 3.54 2.36-2.19 5.5-3.54 9-3.54V8c-3.5 0-6.64 1.35-9 3.54"></path></svg> </a> EDS-NLP </label> <div class="md-nav__source"> <a class="md-source" data-md-component="source" href="https://github.com/aphp/edsnlp" title="Go to repository"> <div class="md-source__icon md-icon"> <svg viewbox="0 0 448 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M439.55 236.05 244 40.45a28.87 28.87 0 0 0-40.81 0l-40.66 40.63 51.52 51.52c27.06-9.14 52.68 16.77 43.39 43.68l49.66 49.66c34.23-11.8 61.18 31 35.47 56.69-26.49 26.49-70.21-2.87-56-37.34L240.22 199v121.85c25.3 12.54 22.26 41.85 9.08 55a34.34 34.34 0 0 1-48.55 0c-17.57-17.6-11.07-46.91 11.25-56v-123c-20.8-8.51-24.6-30.74-18.64-45L142.57 101 8.45 235.14a28.86 28.86 0 0 0 0 40.81l195.61 195.6a28.86 28.86 0 0 0 40.8 0l194.69-194.69a28.86 28.86 0 0 0 0-40.81"></path></svg> </div> <div class="md-source__repository"> aphp/edsnlp </div> </a> </div> <ul class="md-nav__list" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href="../.."> <span class="md-ellipsis"> Getting started </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="https://aphp.github.io/edsnlp/demo" target="_blank"> <span class="md-ellipsis"> Demo </span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../tutorials/"> <span class="md-ellipsis"> Tutorials </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../"> <span class="md-ellipsis"> Pipes </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../tokenizers/"> <span class="md-ellipsis"> Tokenizers </span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../data/"> <span class="md-ellipsis"> Data Connectors </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../concepts/pipeline/"> <span class="md-ellipsis"> Concepts </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../utilities/"> <span class="md-ellipsis"> Utilities </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../reference/edsnlp/"> <span class="md-ellipsis"> Code Reference </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../contributing/"> <span class="md-ellipsis"> Contributing to EDS-NLP </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../changelog/"> <span class="md-ellipsis"> Changelog </span> </a> </li> </ul> </nav> </div> </div> </div> <div class="md-sidebar md-sidebar--secondary" data-md-component="sidebar" data-md-type="toc"> <div class="md-sidebar__scrollwrap"> <div class="md-sidebar__inner"> <nav aria-label="Table of contents" class="md-nav md-nav--secondary"> <label class="md-nav__title" for="__toc"> <span class="md-nav__icon md-icon"></span> Table of contents </label> <ul class="md-nav__list" data-md-component="toc" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href="#scope"> <span class="md-ellipsis"> Scope </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="#result-persistence"> <span class="md-ellipsis"> Result persistence </span> </a> <nav aria-label="Result persistence" class="md-nav"> <ul class="md-nav__list"> <li class="md-nav__item"> <a class="md-nav__link" href="#extraction-pipes"> <span class="md-ellipsis"> Extraction pipes </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="#entity-tagging"> <span class="md-ellipsis"> Entity tagging </span> </a> </li> </ul> </nav> </li> </ul> </nav> </div> </div> </div> <div class="md-content" data-md-component="content"> <article class="md-content__inner md-typeset"> <h1 id="basic-architecture">Basic Architecture</h1> <p>Most pipes provided by EDS-NLP aim to qualify pre-extracted entities. To wit, the basic usage of the library:</p> <ol> <li>Implement a normaliser (see <code><a href="../core/normalizer/#edsnlp.pipes.core.normalizer.factory.create_component">eds.normalizer</a></code>)</li> <li>Add an entity recognition component (eg the simple but powerful <code><a href="../core/matcher/#edsnlp.pipes.core.matcher.factory.create_component">eds.matcher</a></code>)</li> <li>Add zero or more entity qualification components, such as <code><a href="../qualifiers/negation/#edsnlp.pipes.qualifiers.negation.factory.create_component">eds.negation</a></code>, <code><a href="../qualifiers/family/#edsnlp.pipes.qualifiers.family.factory.create_component">eds.family</a></code> or <code><a href="../qualifiers/hypothesis/#edsnlp.pipes.qualifiers.hypothesis.factory.create_component">eds.hypothesis</a></code>. These qualifiers typically help detect false-positives.</li> </ol> <h2 id="scope">Scope</h2> <p>Since the basic usage of EDS-NLP components is to qualify entities, most pipes can function in two modes:</p> <ol> <li>Annotation of the extracted entities (this is the default). To increase throughput, only pre-extracted entities (found in <code>doc.ents</code>) are processed.</li> <li>Full-text, token-wise annotation. This mode is activated by setting the <code>on_ents_only</code> parameter to <code>False</code>.</li> </ol> <p>The possibility to do full-text annotation implies that one could use the pipes the other way around, eg detecting all negations once and for all in an ETL phase, and reusing the results consequently. However, this is not the intended use of the library, which aims to help researchers downstream as a standalone application.</p> <h2 id="result-persistence">Result persistence</h2> <p>Depending on their purpose (entity extraction, qualification, etc), EDS-NLP pipes write their results to <code>Doc.ents</code>, <code>Doc.spans</code> or in a custom attribute.</p> <h3 id="extraction-pipes">Extraction pipes</h3> <p>Extraction pipes (matchers, the date detector or NER pipes, for instance) keep their results to the <code>Doc.ents</code> attribute directly.</p> <p>Note that spaCy prohibits overlapping entities within the <code>Doc.ents</code> attribute. To circumvent this limitation, we <a class="autorefs autorefs-internal" href="../../reference/edsnlp/utils/filter/#edsnlp.utils.filter.filter_spans">filter spans</a>, and keep all discarded entities within the <code>discarded</code> key of the <code>Doc.spans</code> attribute.</p> <p>Some pipes write their output to the <code>Doc.spans</code> dictionary. We enforce the following doctrine:</p> <ul> <li>Should the pipe extract entities that are directly informative (typically the output of the <code><a href="../core/matcher/#edsnlp.pipes.core.matcher.factory.create_component">eds.matcher</a></code> component), said entities are stashed in the <code>Doc.ents</code> attribute.</li> <li>On the other hand, should the entity be useful to another pipe, but less so in itself (eg the output of the <code><a href="../misc/sections/#edsnlp.pipes.misc.sections.factory.create_component">eds.sections</a></code> or <code><a href="../misc/dates/#edsnlp.pipes.misc.dates.factory.create_component">eds.dates</a></code> component), it will be stashed in a specific key within the <code>Doc.spans</code> attribute.</li> </ul> <h3 id="entity-tagging">Entity tagging</h3> <p>Moreover, most pipes declare <a href="https://spacy.io/usage/processing-pipelines#custom-components-attributes">spaCy extensions</a>, on the <code>Doc</code>, <code>Span</code> and/or <code>Token</code> objects.</p> <p>These extensions are especially useful for qualifier pipes, but can also be used by other pipes to persist relevant information. For instance, the <code><a href="../misc/dates/#edsnlp.pipes.misc.dates.factory.create_component">eds.dates</a></code> pipeline component:</p> <ol> <li>Populates <code class="highlight"><span class="n">Doc</span><span class="o">.</span><span class="n">spans</span><span class="p">[</span><span class="s2">"dates"</span><span class="p">]</span></code></li> <li>For each detected item, keeps the normalised date in <code class="highlight"><span class="n">Span</span><span class="o">.</span><span class="n">_</span><span class="o">.</span><span class="n">date</span></code></li> </ol> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/pipes/core/contextual-matcher/index.html b/master/pipes/core/contextual-matcher/index.html
index 1f08654af..2848b5d3c 100644
--- a/master/pipes/core/contextual-matcher/index.html
+++ b/master/pipes/core/contextual-matcher/index.html
@@ -126,7 +126,7 @@
         <span class="p">),</span>
     <span class="p">],</span>
 <span class="p">)</span>
-</code></pre></div> <div class="doc doc-object doc-attribute"> <div class="doc doc-contents first"> <h2 id="edsnlp.pipes.core.contextual_matcher.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>patterns</code></td> <td class="doc-param-details"> <p>The configuration dictionary</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[<span title="typing.Dict">Dict</span>[str, <span title="typing.Any">Any</span>], <span title="typing.List">List</span>[<span title="typing.Dict">Dict</span>[str, <span title="typing.Any">Any</span>]]]</code> </span> </p> </td> </tr> <tr> <td><code>assign_as_span</code></td> <td class="doc-param-details"> <p>Whether to store eventual extractions defined via the <code>assign</code> key as Spans or as string</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>attr</code></td> <td class="doc-param-details"> <p>Attribute to match on, eg <code>TEXT</code>, <code>NORM</code>, etc.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>NORM</code> </span> </p> </td> </tr> <tr> <td><code>ignore_excluded</code></td> <td class="doc-param-details"> <p>Whether to skip excluded tokens during matching.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>ignore_space_tokens</code></td> <td class="doc-param-details"> <p>Whether to skip space tokens during matching.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>alignment_mode</code></td> <td class="doc-param-details"> <p>Overwrite alignment mode.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>expand</code> </span> </p> </td> </tr> <tr> <td><code>regex_flags</code></td> <td class="doc-param-details"> <p>RegExp flags to use when matching, filtering and assigning (See <a href="https://docs.python.org/3/library/re.html#flags">here</a>)</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[<span title="re.RegexFlag">RegexFlag</span>, int]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>0</code> </span> </p> </td> </tr> <tr> <td><code>include_assigned</code></td> <td class="doc-param-details"> <p>Whether to include (eventual) assign matches to the final entity</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>label_name</code></td> <td class="doc-param-details"> <p>Deprecated, use <code>label</code> instead. The label to assign to the matched entities</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>The label to assign to the matched entities</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set matches on the doc</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../ner/#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True}</code> </span> </p> </td> </tr> </tbody> </table> </div> </div><h2 id="authors-and-citation">Authors and citation</h2> <p>The <code><a href="../matcher/#edsnlp.pipes.core.matcher.factory.create_component">eds.matcher</a></code> pipeline component was developed by AP-HP's Data Science team.</p> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> <div class="doc doc-object doc-attribute"> <div class="doc doc-contents first"> <h2 id="edsnlp.pipes.core.contextual_matcher.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>patterns</code></td> <td class="doc-param-details"> <p>The configuration dictionary</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[<span title="typing.Dict">Dict</span>[str, <span title="typing.Any">Any</span>], <span title="typing.List">List</span>[<span title="typing.Dict">Dict</span>[str, <span title="typing.Any">Any</span>]]]</code> </span> </p> </td> </tr> <tr> <td><code>assign_as_span</code></td> <td class="doc-param-details"> <p>Whether to store eventual extractions defined via the <code>assign</code> key as Spans or as string</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>attr</code></td> <td class="doc-param-details"> <p>Attribute to match on, eg <code>TEXT</code>, <code>NORM</code>, etc.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>NORM</code> </span> </p> </td> </tr> <tr> <td><code>ignore_excluded</code></td> <td class="doc-param-details"> <p>Whether to skip excluded tokens during matching.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>ignore_space_tokens</code></td> <td class="doc-param-details"> <p>Whether to skip space tokens during matching.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>alignment_mode</code></td> <td class="doc-param-details"> <p>Overwrite alignment mode.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>expand</code> </span> </p> </td> </tr> <tr> <td><code>regex_flags</code></td> <td class="doc-param-details"> <p>RegExp flags to use when matching, filtering and assigning (See <a href="https://docs.python.org/3/library/re.html#flags">here</a>)</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[<span title="re.RegexFlag">RegexFlag</span>, int]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>0</code> </span> </p> </td> </tr> <tr> <td><code>include_assigned</code></td> <td class="doc-param-details"> <p>Whether to include (eventual) assign matches to the final entity</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>label_name</code></td> <td class="doc-param-details"> <p>Deprecated, use <code>label</code> instead. The label to assign to the matched entities</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>The label to assign to the matched entities</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set matches on the doc</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../ner/#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True}</code> </span> </p> </td> </tr> </tbody> </table> </div> </div><h2 id="authors-and-citation">Authors and citation</h2> <p>The <code><a href="../matcher/#edsnlp.pipes.core.matcher.factory.create_component">eds.matcher</a></code> pipeline component was developed by AP-HP's Data Science team.</p> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/pipes/core/endlines/index.html b/master/pipes/core/endlines/index.html
index ca9cb4d01..4922f1fad 100644
--- a/master/pipes/core/endlines/index.html
+++ b/master/pipes/core/endlines/index.html
@@ -59,7 +59,7 @@
 <span class="p">)</span>
 
 <span class="n">displacy</span><span class="o">.</span><span class="n">render</span><span class="p">(</span><span class="n">doc_exemple</span><span class="p">,</span> <span class="n">style</span><span class="o">=</span><span class="s2">"ent"</span><span class="p">,</span> <span class="n">options</span><span class="o">=</span><span class="p">{</span><span class="s2">"colors"</span><span class="p">:</span> <span class="p">{</span><span class="s2">"space"</span><span class="p">:</span> <span class="s2">"red"</span><span class="p">}})</span>
-</code></pre></div> <h2 id="edsnlp.pipes.core.endlines.factory.create_component--extensions">Extensions</h2> <p>The <code><a href="#edsnlp.pipes.core.endlines.factory.create_component">eds.endlines</a></code> pipe declares one extension, on both <code>Span</code> and <code>Token</code> objects. The <code>end_line</code> attribute is a boolean, set to <code>True</code> if the pipe predicts that the new line is an end line character. Otherwise, it is set to <code>False</code> if the new line is classified as a space.</p> <p>The pipe also sets the <code>excluded</code> custom attribute on newlines that are classified as spaces. It lets downstream matchers skip excluded tokens (see <a href="../normalisation">normalisation</a>) for more detail.</p> <h2 id="edsnlp.pipes.core.endlines.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline object.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span></code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the component.</p> <p> </p> </td> </tr> <tr> <td><code>model_path</code></td> <td class="doc-param-details"> <p>Path to trained model. If None, it will use a default model</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="typing.Union">Union</span>[str, <a class="autorefs autorefs-internal" href="../../../reference/edsnlp/pipes/core/endlines/model/#edsnlp.pipes.core.endlines.model.EndLinesModel" title="edsnlp.pipes.core.endlines.model.EndLinesModel">EndLinesModel</a>]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.core.endlines.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.core.endlines.factory.create_component">eds.endlines</a></code> pipe was developed by AP-HP's Data Science team based on the work of <span><a class="citation" href="./#ref-zweigenbaum2016" id="edsnlp.pipes.core.endlines.factory.create_component--cite-zweigenbaum2016">Zweigenbaum et al., 2016</a></span>.</p> </div> </div> <div class="footnote"><hr/><ol><li id="ref-zweigenbaum2016"><p><p id="ref-zweigenbaum2016">Zweigenbaum P., Grouin C. and Lavergne T., 2016. Une catégorisation de fins de lignes non-supervisée (End-of-line classification with no supervision). <a href="https://aclanthology.org/2016.jeptalnrecital-poster.7" target="_blank">https://aclanthology.org/2016.jeptalnrecital-poster.7</a></p></p></li></ol><div></div></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> <h2 id="edsnlp.pipes.core.endlines.factory.create_component--extensions">Extensions</h2> <p>The <code><a href="#edsnlp.pipes.core.endlines.factory.create_component">eds.endlines</a></code> pipe declares one extension, on both <code>Span</code> and <code>Token</code> objects. The <code>end_line</code> attribute is a boolean, set to <code>True</code> if the pipe predicts that the new line is an end line character. Otherwise, it is set to <code>False</code> if the new line is classified as a space.</p> <p>The pipe also sets the <code>excluded</code> custom attribute on newlines that are classified as spaces. It lets downstream matchers skip excluded tokens (see <a href="../normalisation">normalisation</a>) for more detail.</p> <h2 id="edsnlp.pipes.core.endlines.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline object.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span></code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the component.</p> <p> </p> </td> </tr> <tr> <td><code>model_path</code></td> <td class="doc-param-details"> <p>Path to trained model. If None, it will use a default model</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="typing.Union">Union</span>[str, <a class="autorefs autorefs-internal" href="../../../reference/edsnlp/pipes/core/endlines/model/#edsnlp.pipes.core.endlines.model.EndLinesModel" title="edsnlp.pipes.core.endlines.model.EndLinesModel">EndLinesModel</a>]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.core.endlines.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.core.endlines.factory.create_component">eds.endlines</a></code> pipe was developed by AP-HP's Data Science team based on the work of <span><a class="citation" href="./#ref-zweigenbaum2016" id="edsnlp.pipes.core.endlines.factory.create_component--cite-zweigenbaum2016">Zweigenbaum et al., 2016</a></span>.</p> </div> </div> <div class="footnote"><hr/><ol><li id="ref-zweigenbaum2016"><p><p id="ref-zweigenbaum2016">Zweigenbaum P., Grouin C. and Lavergne T., 2016. Une catégorisation de fins de lignes non-supervisée (End-of-line classification with no supervision). <a href="https://aclanthology.org/2016.jeptalnrecital-poster.7" target="_blank">https://aclanthology.org/2016.jeptalnrecital-poster.7</a></p></p></li></ol><div></div></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/pipes/core/index.html b/master/pipes/core/index.html
index e89aaf97d..ea8e80c18 100644
--- a/master/pipes/core/index.html
+++ b/master/pipes/core/index.html
@@ -1,5 +1,5 @@
 <!DOCTYPE html>
-<html class="no-js" lang="en"> <head><meta charset="utf-8"/><meta content="width=device-width,initial-scale=1" name="viewport"/><link href="../" rel="prev"/><link href="normalizer/" rel="next"/><link href="../../assets/logo/edsnlp.svg" rel="icon"/><meta content="mkdocs-1.6.1, mkdocs-material-9.5.42" name="generator"/><title>Overview - EDS-NLP</title><link href="../../assets/stylesheets/main.0253249f.min.css" rel="stylesheet"/><link href="../../assets/stylesheets/palette.06af60db.min.css" rel="stylesheet"/><link crossorigin="" href="https://fonts.gstatic.com" rel="preconnect"/><link href="https://fonts.googleapis.com/css?family=Roboto:300,300i,400,400i,700,700i%7CRoboto+Mono:400,400i,700,700i&amp;display=fallback" rel="stylesheet"/><style>:root{--md-text-font:"Roboto";--md-code-font:"Roboto Mono"}</style><link href="../../assets/_mkdocstrings.css" rel="stylesheet"/><link href="../../assets/stylesheets/extra.css" rel="stylesheet"/><link href="../../assets/stylesheets/cards.css" rel="stylesheet"/><link href="../../assets/termynal/termynal.css" rel="stylesheet"/><script>__md_scope=new URL("../..",location),__md_hash=e=>[...e].reduce(((e,_)=>(e<<5)-e+_.charCodeAt(0)),0),__md_get=(e,_=localStorage,t=__md_scope)=>JSON.parse(_.getItem(t.pathname+"."+e)),__md_set=(e,_,t=localStorage,a=__md_scope)=>{try{t.setItem(a.pathname+"."+e,JSON.stringify(_))}catch(e){}}</script></head> <body data-md-color-accent="indigo" data-md-color-primary="indigo" data-md-color-scheme="default" dir="ltr"> <input autocomplete="off" class="md-toggle" data-md-toggle="drawer" id="__drawer" type="checkbox"/> <input autocomplete="off" class="md-toggle" data-md-toggle="search" id="__search" type="checkbox"/> <label class="md-overlay" for="__drawer"></label> <div data-md-component="skip"> <a class="md-skip" href="#core-components"> Skip to content </a> </div> <div data-md-component="announce"> <aside class="md-banner"> <div class="md-banner__inner md-grid md-typeset"> <button aria-label="Don't show this again" class="md-banner__button md-icon"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M19 6.41 17.59 5 12 10.59 6.41 5 5 6.41 10.59 12 5 17.59 6.41 19 12 13.41 17.59 19 19 17.59 13.41 12z"></path></svg> </button> Check out the new <a href="../../tutorials/training">Model Training tutorial</a> ! </div> <script>var el=document.querySelector("[data-md-component=announce]");if(el){var content=el.querySelector(".md-typeset");__md_hash(content.innerHTML)===__md_get("__announce")&&(el.hidden=!0)}</script> </aside> </div> <div data-md-color-scheme="default" data-md-component="outdated" hidden=""> </div> <header class="md-header md-header--shadow" data-md-component="header"> <nav aria-label="Header" class="md-header__inner md-grid"> <a aria-label="EDS-NLP" class="md-header__button md-logo" data-md-component="logo" href="../.." title="EDS-NLP"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a3 3 0 0 0 3-3 3 3 0 0 0-3-3 3 3 0 0 0-3 3 3 3 0 0 0 3 3m0 3.54C9.64 9.35 6.5 8 3 8v11c3.5 0 6.64 1.35 9 3.54 2.36-2.19 5.5-3.54 9-3.54V8c-3.5 0-6.64 1.35-9 3.54"></path></svg> </a> <label class="md-header__button md-icon" for="__drawer"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M3 6h18v2H3zm0 5h18v2H3zm0 5h18v2H3z"></path></svg> </label> <div class="md-header__title" data-md-component="header-title"> <div class="md-header__ellipsis"> <div class="md-header__topic"> <span class="md-ellipsis"> EDS-NLP </span> </div> <div class="md-header__topic" data-md-component="header-topic"> <span class="md-ellipsis"> Overview </span> </div> </div> </div> <form class="md-header__option" data-md-component="palette"> <input aria-label="Switch to dark mode" class="md-option" data-md-color-accent="indigo" data-md-color-media="" data-md-color-primary="indigo" data-md-color-scheme="default" id="__palette_0" name="__palette" type="radio"/> <label class="md-header__button md-icon" for="__palette_1" hidden="" title="Switch to dark mode"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 18c-.89 0-1.74-.2-2.5-.55C11.56 16.5 13 14.42 13 12s-1.44-4.5-3.5-5.45C10.26 6.2 11.11 6 12 6a6 6 0 0 1 6 6 6 6 0 0 1-6 6m8-9.31V4h-4.69L12 .69 8.69 4H4v4.69L.69 12 4 15.31V20h4.69L12 23.31 15.31 20H20v-4.69L23.31 12z"></path></svg> </label> <input aria-label="Switch to light mode" class="md-option" data-md-color-accent="indigo" data-md-color-media="" data-md-color-primary="indigo" data-md-color-scheme="slate" id="__palette_1" name="__palette" type="radio"/> <label class="md-header__button md-icon" for="__palette_0" hidden="" title="Switch to light mode"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a4 4 0 0 0-4 4 4 4 0 0 0 4 4 4 4 0 0 0 4-4 4 4 0 0 0-4-4m0 10a6 6 0 0 1-6-6 6 6 0 0 1 6-6 6 6 0 0 1 6 6 6 6 0 0 1-6 6m8-9.31V4h-4.69L12 .69 8.69 4H4v4.69L.69 12 4 15.31V20h4.69L12 23.31 15.31 20H20v-4.69L23.31 12z"></path></svg> </label> </form> <script>var palette=__md_get("__palette");if(palette&&palette.color){if("(prefers-color-scheme)"===palette.color.media){var media=matchMedia("(prefers-color-scheme: light)"),input=document.querySelector(media.matches?"[data-md-color-media='(prefers-color-scheme: light)']":"[data-md-color-media='(prefers-color-scheme: dark)']");palette.color.media=input.getAttribute("data-md-color-media"),palette.color.scheme=input.getAttribute("data-md-color-scheme"),palette.color.primary=input.getAttribute("data-md-color-primary"),palette.color.accent=input.getAttribute("data-md-color-accent")}for(var[key,value]of Object.entries(palette.color))document.body.setAttribute("data-md-color-"+key,value)}</script> <label class="md-header__button md-icon" for="__search"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M9.5 3A6.5 6.5 0 0 1 16 9.5c0 1.61-.59 3.09-1.56 4.23l.27.27h.79l5 5-1.5 1.5-5-5v-.79l-.27-.27A6.52 6.52 0 0 1 9.5 16 6.5 6.5 0 0 1 3 9.5 6.5 6.5 0 0 1 9.5 3m0 2C7 5 5 7 5 9.5S7 14 9.5 14 14 12 14 9.5 12 5 9.5 5"></path></svg> </label> <div class="md-search" data-md-component="search" role="dialog"> <label class="md-search__overlay" for="__search"></label> <div class="md-search__inner" role="search"> <form class="md-search__form" name="search"> <input aria-label="Search" autocapitalize="off" autocomplete="off" autocorrect="off" class="md-search__input" data-md-component="search-query" name="query" placeholder="Search" required="" spellcheck="false" type="text"/> <label class="md-search__icon md-icon" for="__search"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M9.5 3A6.5 6.5 0 0 1 16 9.5c0 1.61-.59 3.09-1.56 4.23l.27.27h.79l5 5-1.5 1.5-5-5v-.79l-.27-.27A6.52 6.52 0 0 1 9.5 16 6.5 6.5 0 0 1 3 9.5 6.5 6.5 0 0 1 9.5 3m0 2C7 5 5 7 5 9.5S7 14 9.5 14 14 12 14 9.5 12 5 9.5 5"></path></svg> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M20 11v2H8l5.5 5.5-1.42 1.42L4.16 12l7.92-7.92L13.5 5.5 8 11z"></path></svg> </label> <nav aria-label="Search" class="md-search__options"> <button aria-label="Clear" class="md-search__icon md-icon" tabindex="-1" title="Clear" type="reset"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M19 6.41 17.59 5 12 10.59 6.41 5 5 6.41 10.59 12 5 17.59 6.41 19 12 13.41 17.59 19 19 17.59 13.41 12z"></path></svg> </button> </nav> </form> <div class="md-search__output"> <div class="md-search__scrollwrap" data-md-scrollfix="" tabindex="0"> <div class="md-search-result" data-md-component="search-result"> <div class="md-search-result__meta"> Initializing search </div> <ol class="md-search-result__list" role="presentation"></ol> </div> </div> </div> </div> </div> <div class="md-header__source"> <a class="md-source" data-md-component="source" href="https://github.com/aphp/edsnlp" title="Go to repository"> <div class="md-source__icon md-icon"> <svg viewbox="0 0 448 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M439.55 236.05 244 40.45a28.87 28.87 0 0 0-40.81 0l-40.66 40.63 51.52 51.52c27.06-9.14 52.68 16.77 43.39 43.68l49.66 49.66c34.23-11.8 61.18 31 35.47 56.69-26.49 26.49-70.21-2.87-56-37.34L240.22 199v121.85c25.3 12.54 22.26 41.85 9.08 55a34.34 34.34 0 0 1-48.55 0c-17.57-17.6-11.07-46.91 11.25-56v-123c-20.8-8.51-24.6-30.74-18.64-45L142.57 101 8.45 235.14a28.86 28.86 0 0 0 0 40.81l195.61 195.6a28.86 28.86 0 0 0 40.8 0l194.69-194.69a28.86 28.86 0 0 0 0-40.81"></path></svg> </div> <div class="md-source__repository"> aphp/edsnlp </div> </a> </div> </nav> </header> <div class="md-container" data-md-component="container"> <main class="md-main" data-md-component="main"> <div class="md-main__inner md-grid"> <div class="md-sidebar md-sidebar--primary" data-md-component="sidebar" data-md-type="navigation"> <div class="md-sidebar__scrollwrap"> <div class="md-sidebar__inner"> <nav aria-label="Navigation" class="md-nav md-nav--primary" data-md-level="0"> <label class="md-nav__title" for="__drawer"> <a aria-label="EDS-NLP" class="md-nav__button md-logo" data-md-component="logo" href="../.." title="EDS-NLP"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a3 3 0 0 0 3-3 3 3 0 0 0-3-3 3 3 0 0 0-3 3 3 3 0 0 0 3 3m0 3.54C9.64 9.35 6.5 8 3 8v11c3.5 0 6.64 1.35 9 3.54 2.36-2.19 5.5-3.54 9-3.54V8c-3.5 0-6.64 1.35-9 3.54"></path></svg> </a> EDS-NLP </label> <div class="md-nav__source"> <a class="md-source" data-md-component="source" href="https://github.com/aphp/edsnlp" title="Go to repository"> <div class="md-source__icon md-icon"> <svg viewbox="0 0 448 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M439.55 236.05 244 40.45a28.87 28.87 0 0 0-40.81 0l-40.66 40.63 51.52 51.52c27.06-9.14 52.68 16.77 43.39 43.68l49.66 49.66c34.23-11.8 61.18 31 35.47 56.69-26.49 26.49-70.21-2.87-56-37.34L240.22 199v121.85c25.3 12.54 22.26 41.85 9.08 55a34.34 34.34 0 0 1-48.55 0c-17.57-17.6-11.07-46.91 11.25-56v-123c-20.8-8.51-24.6-30.74-18.64-45L142.57 101 8.45 235.14a28.86 28.86 0 0 0 0 40.81l195.61 195.6a28.86 28.86 0 0 0 40.8 0l194.69-194.69a28.86 28.86 0 0 0 0-40.81"></path></svg> </div> <div class="md-source__repository"> aphp/edsnlp </div> </a> </div> <ul class="md-nav__list" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href="../.."> <span class="md-ellipsis"> Getting started </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="https://aphp.github.io/edsnlp/demo" target="_blank"> <span class="md-ellipsis"> Demo </span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../tutorials/"> <span class="md-ellipsis"> Tutorials </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--active md-nav__item--nested"> <input checked="" class="md-nav__toggle md-toggle" id="__nav_4" type="checkbox"/> <div class="md-nav__link md-nav__container"> <a class="md-nav__link" href="../"> <span class="md-ellipsis"> Pipes </span> </a> <label class="md-nav__link" for="__nav_4" id="__nav_4_label" tabindex="0"> <span class="md-nav__icon md-icon"></span> </label> </div> <nav aria-expanded="true" aria-labelledby="__nav_4_label" class="md-nav" data-md-level="1"> <label class="md-nav__title" for="__nav_4"> <span class="md-nav__icon md-icon"></span> Pipes </label> <ul class="md-nav__list" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href="../"> <span class="md-ellipsis"> Overview </span> </a> </li> <li class="md-nav__item md-nav__item--active md-nav__item--nested"> <input checked="" class="md-nav__toggle md-toggle" id="__nav_4_3" type="checkbox"/> <div class="md-nav__link md-nav__container"> <a class="md-nav__link" href="./"> <span class="md-ellipsis"> Core Pipelines </span> </a> <label class="md-nav__link" for="__nav_4_3" id="__nav_4_3_label" tabindex="0"> <span class="md-nav__icon md-icon"></span> </label> </div> <nav aria-expanded="true" aria-labelledby="__nav_4_3_label" class="md-nav" data-md-level="2"> <label class="md-nav__title" for="__nav_4_3"> <span class="md-nav__icon md-icon"></span> Core Pipelines </label> <ul class="md-nav__list" data-md-scrollfix=""> <li class="md-nav__item md-nav__item--active"> <input class="md-nav__toggle md-toggle" id="__toc" type="checkbox"/> <label class="md-nav__link md-nav__link--active" for="__toc"> <span class="md-ellipsis"> Overview </span> <span class="md-nav__icon md-icon"></span> </label> <a class="md-nav__link md-nav__link--active" href="./"> <span class="md-ellipsis"> Overview </span> </a> <nav aria-label="Table of contents" class="md-nav md-nav--secondary"> <label class="md-nav__title" for="__toc"> <span class="md-nav__icon md-icon"></span> Table of contents </label> <ul class="md-nav__list" data-md-component="toc" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href="#available-components"> <span class="md-ellipsis"> Available components </span> </a> </li> </ul> </nav> </li> <li class="md-nav__item"> <a class="md-nav__link" href="normalizer/"> <span class="md-ellipsis"> Normalisation </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="sentences/"> <span class="md-ellipsis"> Sentences </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="matcher/"> <span class="md-ellipsis"> Matcher </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="terminology/"> <span class="md-ellipsis"> Terminology </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="contextual-matcher/"> <span class="md-ellipsis"> Contextual Matcher </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="endlines/"> <span class="md-ellipsis"> Endlines </span> </a> </li> </ul> </nav> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../qualifiers/"> <span class="md-ellipsis"> Qualifiers </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../misc/"> <span class="md-ellipsis"> Miscellaneous </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../ner/"> <span class="md-ellipsis"> Named Entity Recognition </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../trainable/"> <span class="md-ellipsis"> Trainable components </span> <span class="md-nav__icon md-icon"></span> </a> </li> </ul> </nav> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../tokenizers/"> <span class="md-ellipsis"> Tokenizers </span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../data/"> <span class="md-ellipsis"> Data Connectors </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../concepts/pipeline/"> <span class="md-ellipsis"> Concepts </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../utilities/"> <span class="md-ellipsis"> Utilities </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../reference/edsnlp/"> <span class="md-ellipsis"> Code Reference </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../contributing/"> <span class="md-ellipsis"> Contributing to EDS-NLP </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../changelog/"> <span class="md-ellipsis"> Changelog </span> </a> </li> </ul> </nav> </div> </div> </div> <div class="md-sidebar md-sidebar--secondary" data-md-component="sidebar" data-md-type="toc"> <div class="md-sidebar__scrollwrap"> <div class="md-sidebar__inner"> <nav aria-label="Table of contents" class="md-nav md-nav--secondary"> <label class="md-nav__title" for="__toc"> <span class="md-nav__icon md-icon"></span> Table of contents </label> <ul class="md-nav__list" data-md-component="toc" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href="#available-components"> <span class="md-ellipsis"> Available components </span> </a> </li> </ul> </nav> </div> </div> </div> <div class="md-content" data-md-component="content"> <article class="md-content__inner md-typeset"> <h1 id="core-components">Core Components</h1> <p>This section deals with "core" functionalities offered by EDS-NLP:</p> <ul> <li>Generic matchers against regular expressions and list of terms</li> <li>Text cleaning</li> <li>Sentence boundaries detection</li> </ul> <h2 id="available-components">Available components</h2> <table> <thead> <tr> <th>Component</th> <th>Description</th> </tr> </thead> <tbody> <tr> <td><code><a href="normalizer/#edsnlp.pipes.core.normalizer.factory.create_component">eds.normalizer</a></code></td> <td>Non-destructive input text normalisation</td> </tr> <tr> <td><code><a href="sentences/#edsnlp.pipes.core.sentences.factory.create_component">eds.sentences</a></code></td> <td>Better sentence boundary detection</td> </tr> <tr> <td><code><a href="matcher/#edsnlp.pipes.core.matcher.factory.create_component">eds.matcher</a></code></td> <td>A simple yet powerful entity extractor</td> </tr> <tr> <td><code><a href="terminology/#edsnlp.pipes.core.terminology.factory.create_component">eds.terminology</a></code></td> <td>A simple yet powerful terminology matcher</td> </tr> <tr> <td><code><a href="contextual-matcher/#edsnlp.pipes.core.contextual_matcher.factory.create_component">eds.contextual_matcher</a></code></td> <td>A conditional entity extractor</td> </tr> <tr> <td><code><a href="endlines/#edsnlp.pipes.core.endlines.factory.create_component">eds.endlines</a></code></td> <td>An unsupervised model to classify each end line</td> </tr> </tbody> </table> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+<html class="no-js" lang="en"> <head><meta charset="utf-8"/><meta content="width=device-width,initial-scale=1" name="viewport"/><link href="../" rel="prev"/><link href="normalizer/" rel="next"/><link href="../../assets/logo/edsnlp.svg" rel="icon"/><meta content="mkdocs-1.6.1, mkdocs-material-9.5.42" name="generator"/><title>Overview - EDS-NLP</title><link href="../../assets/stylesheets/main.0253249f.min.css" rel="stylesheet"/><link href="../../assets/stylesheets/palette.06af60db.min.css" rel="stylesheet"/><link crossorigin="" href="https://fonts.gstatic.com" rel="preconnect"/><link href="https://fonts.googleapis.com/css?family=Roboto:300,300i,400,400i,700,700i%7CRoboto+Mono:400,400i,700,700i&amp;display=fallback" rel="stylesheet"/><style>:root{--md-text-font:"Roboto";--md-code-font:"Roboto Mono"}</style><link href="../../assets/_mkdocstrings.css" rel="stylesheet"/><link href="../../assets/stylesheets/extra.css" rel="stylesheet"/><link href="../../assets/stylesheets/cards.css" rel="stylesheet"/><link href="../../assets/termynal/termynal.css" rel="stylesheet"/><script>__md_scope=new URL("../..",location),__md_hash=e=>[...e].reduce(((e,_)=>(e<<5)-e+_.charCodeAt(0)),0),__md_get=(e,_=localStorage,t=__md_scope)=>JSON.parse(_.getItem(t.pathname+"."+e)),__md_set=(e,_,t=localStorage,a=__md_scope)=>{try{t.setItem(a.pathname+"."+e,JSON.stringify(_))}catch(e){}}</script></head> <body data-md-color-accent="indigo" data-md-color-primary="indigo" data-md-color-scheme="default" dir="ltr"> <input autocomplete="off" class="md-toggle" data-md-toggle="drawer" id="__drawer" type="checkbox"/> <input autocomplete="off" class="md-toggle" data-md-toggle="search" id="__search" type="checkbox"/> <label class="md-overlay" for="__drawer"></label> <div data-md-component="skip"> <a class="md-skip" href="#core-components"> Skip to content </a> </div> <div data-md-component="announce"> <aside class="md-banner"> <div class="md-banner__inner md-grid md-typeset"> <button aria-label="Don't show this again" class="md-banner__button md-icon"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M19 6.41 17.59 5 12 10.59 6.41 5 5 6.41 10.59 12 5 17.59 6.41 19 12 13.41 17.59 19 19 17.59 13.41 12z"></path></svg> </button> Check out the new <a href="../../tutorials/training">Model Training tutorial</a> ! </div> <script>var el=document.querySelector("[data-md-component=announce]");if(el){var content=el.querySelector(".md-typeset");__md_hash(content.innerHTML)===__md_get("__announce")&&(el.hidden=!0)}</script> </aside> </div> <div data-md-color-scheme="default" data-md-component="outdated" hidden=""> </div> <header class="md-header md-header--shadow" data-md-component="header"> <nav aria-label="Header" class="md-header__inner md-grid"> <a aria-label="EDS-NLP" class="md-header__button md-logo" data-md-component="logo" href="../.." title="EDS-NLP"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a3 3 0 0 0 3-3 3 3 0 0 0-3-3 3 3 0 0 0-3 3 3 3 0 0 0 3 3m0 3.54C9.64 9.35 6.5 8 3 8v11c3.5 0 6.64 1.35 9 3.54 2.36-2.19 5.5-3.54 9-3.54V8c-3.5 0-6.64 1.35-9 3.54"></path></svg> </a> <label class="md-header__button md-icon" for="__drawer"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M3 6h18v2H3zm0 5h18v2H3zm0 5h18v2H3z"></path></svg> </label> <div class="md-header__title" data-md-component="header-title"> <div class="md-header__ellipsis"> <div class="md-header__topic"> <span class="md-ellipsis"> EDS-NLP </span> </div> <div class="md-header__topic" data-md-component="header-topic"> <span class="md-ellipsis"> Overview </span> </div> </div> </div> <form class="md-header__option" data-md-component="palette"> <input aria-label="Switch to dark mode" class="md-option" data-md-color-accent="indigo" data-md-color-media="" data-md-color-primary="indigo" data-md-color-scheme="default" id="__palette_0" name="__palette" type="radio"/> <label class="md-header__button md-icon" for="__palette_1" hidden="" title="Switch to dark mode"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 18c-.89 0-1.74-.2-2.5-.55C11.56 16.5 13 14.42 13 12s-1.44-4.5-3.5-5.45C10.26 6.2 11.11 6 12 6a6 6 0 0 1 6 6 6 6 0 0 1-6 6m8-9.31V4h-4.69L12 .69 8.69 4H4v4.69L.69 12 4 15.31V20h4.69L12 23.31 15.31 20H20v-4.69L23.31 12z"></path></svg> </label> <input aria-label="Switch to light mode" class="md-option" data-md-color-accent="indigo" data-md-color-media="" data-md-color-primary="indigo" data-md-color-scheme="slate" id="__palette_1" name="__palette" type="radio"/> <label class="md-header__button md-icon" for="__palette_0" hidden="" title="Switch to light mode"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a4 4 0 0 0-4 4 4 4 0 0 0 4 4 4 4 0 0 0 4-4 4 4 0 0 0-4-4m0 10a6 6 0 0 1-6-6 6 6 0 0 1 6-6 6 6 0 0 1 6 6 6 6 0 0 1-6 6m8-9.31V4h-4.69L12 .69 8.69 4H4v4.69L.69 12 4 15.31V20h4.69L12 23.31 15.31 20H20v-4.69L23.31 12z"></path></svg> </label> </form> <script>var palette=__md_get("__palette");if(palette&&palette.color){if("(prefers-color-scheme)"===palette.color.media){var media=matchMedia("(prefers-color-scheme: light)"),input=document.querySelector(media.matches?"[data-md-color-media='(prefers-color-scheme: light)']":"[data-md-color-media='(prefers-color-scheme: dark)']");palette.color.media=input.getAttribute("data-md-color-media"),palette.color.scheme=input.getAttribute("data-md-color-scheme"),palette.color.primary=input.getAttribute("data-md-color-primary"),palette.color.accent=input.getAttribute("data-md-color-accent")}for(var[key,value]of Object.entries(palette.color))document.body.setAttribute("data-md-color-"+key,value)}</script> <label class="md-header__button md-icon" for="__search"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M9.5 3A6.5 6.5 0 0 1 16 9.5c0 1.61-.59 3.09-1.56 4.23l.27.27h.79l5 5-1.5 1.5-5-5v-.79l-.27-.27A6.52 6.52 0 0 1 9.5 16 6.5 6.5 0 0 1 3 9.5 6.5 6.5 0 0 1 9.5 3m0 2C7 5 5 7 5 9.5S7 14 9.5 14 14 12 14 9.5 12 5 9.5 5"></path></svg> </label> <div class="md-search" data-md-component="search" role="dialog"> <label class="md-search__overlay" for="__search"></label> <div class="md-search__inner" role="search"> <form class="md-search__form" name="search"> <input aria-label="Search" autocapitalize="off" autocomplete="off" autocorrect="off" class="md-search__input" data-md-component="search-query" name="query" placeholder="Search" required="" spellcheck="false" type="text"/> <label class="md-search__icon md-icon" for="__search"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M9.5 3A6.5 6.5 0 0 1 16 9.5c0 1.61-.59 3.09-1.56 4.23l.27.27h.79l5 5-1.5 1.5-5-5v-.79l-.27-.27A6.52 6.52 0 0 1 9.5 16 6.5 6.5 0 0 1 3 9.5 6.5 6.5 0 0 1 9.5 3m0 2C7 5 5 7 5 9.5S7 14 9.5 14 14 12 14 9.5 12 5 9.5 5"></path></svg> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M20 11v2H8l5.5 5.5-1.42 1.42L4.16 12l7.92-7.92L13.5 5.5 8 11z"></path></svg> </label> <nav aria-label="Search" class="md-search__options"> <button aria-label="Clear" class="md-search__icon md-icon" tabindex="-1" title="Clear" type="reset"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M19 6.41 17.59 5 12 10.59 6.41 5 5 6.41 10.59 12 5 17.59 6.41 19 12 13.41 17.59 19 19 17.59 13.41 12z"></path></svg> </button> </nav> </form> <div class="md-search__output"> <div class="md-search__scrollwrap" data-md-scrollfix="" tabindex="0"> <div class="md-search-result" data-md-component="search-result"> <div class="md-search-result__meta"> Initializing search </div> <ol class="md-search-result__list" role="presentation"></ol> </div> </div> </div> </div> </div> <div class="md-header__source"> <a class="md-source" data-md-component="source" href="https://github.com/aphp/edsnlp" title="Go to repository"> <div class="md-source__icon md-icon"> <svg viewbox="0 0 448 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M439.55 236.05 244 40.45a28.87 28.87 0 0 0-40.81 0l-40.66 40.63 51.52 51.52c27.06-9.14 52.68 16.77 43.39 43.68l49.66 49.66c34.23-11.8 61.18 31 35.47 56.69-26.49 26.49-70.21-2.87-56-37.34L240.22 199v121.85c25.3 12.54 22.26 41.85 9.08 55a34.34 34.34 0 0 1-48.55 0c-17.57-17.6-11.07-46.91 11.25-56v-123c-20.8-8.51-24.6-30.74-18.64-45L142.57 101 8.45 235.14a28.86 28.86 0 0 0 0 40.81l195.61 195.6a28.86 28.86 0 0 0 40.8 0l194.69-194.69a28.86 28.86 0 0 0 0-40.81"></path></svg> </div> <div class="md-source__repository"> aphp/edsnlp </div> </a> </div> </nav> </header> <div class="md-container" data-md-component="container"> <main class="md-main" data-md-component="main"> <div class="md-main__inner md-grid"> <div class="md-sidebar md-sidebar--primary" data-md-component="sidebar" data-md-type="navigation"> <div class="md-sidebar__scrollwrap"> <div class="md-sidebar__inner"> <nav aria-label="Navigation" class="md-nav md-nav--primary" data-md-level="0"> <label class="md-nav__title" for="__drawer"> <a aria-label="EDS-NLP" class="md-nav__button md-logo" data-md-component="logo" href="../.." title="EDS-NLP"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a3 3 0 0 0 3-3 3 3 0 0 0-3-3 3 3 0 0 0-3 3 3 3 0 0 0 3 3m0 3.54C9.64 9.35 6.5 8 3 8v11c3.5 0 6.64 1.35 9 3.54 2.36-2.19 5.5-3.54 9-3.54V8c-3.5 0-6.64 1.35-9 3.54"></path></svg> </a> EDS-NLP </label> <div class="md-nav__source"> <a class="md-source" data-md-component="source" href="https://github.com/aphp/edsnlp" title="Go to repository"> <div class="md-source__icon md-icon"> <svg viewbox="0 0 448 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M439.55 236.05 244 40.45a28.87 28.87 0 0 0-40.81 0l-40.66 40.63 51.52 51.52c27.06-9.14 52.68 16.77 43.39 43.68l49.66 49.66c34.23-11.8 61.18 31 35.47 56.69-26.49 26.49-70.21-2.87-56-37.34L240.22 199v121.85c25.3 12.54 22.26 41.85 9.08 55a34.34 34.34 0 0 1-48.55 0c-17.57-17.6-11.07-46.91 11.25-56v-123c-20.8-8.51-24.6-30.74-18.64-45L142.57 101 8.45 235.14a28.86 28.86 0 0 0 0 40.81l195.61 195.6a28.86 28.86 0 0 0 40.8 0l194.69-194.69a28.86 28.86 0 0 0 0-40.81"></path></svg> </div> <div class="md-source__repository"> aphp/edsnlp </div> </a> </div> <ul class="md-nav__list" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href="../.."> <span class="md-ellipsis"> Getting started </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="https://aphp.github.io/edsnlp/demo" target="_blank"> <span class="md-ellipsis"> Demo </span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../tutorials/"> <span class="md-ellipsis"> Tutorials </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--active md-nav__item--nested"> <input checked="" class="md-nav__toggle md-toggle" id="__nav_4" type="checkbox"/> <div class="md-nav__link md-nav__container"> <a class="md-nav__link" href="../"> <span class="md-ellipsis"> Pipes </span> </a> <label class="md-nav__link" for="__nav_4" id="__nav_4_label" tabindex="0"> <span class="md-nav__icon md-icon"></span> </label> </div> <nav aria-expanded="true" aria-labelledby="__nav_4_label" class="md-nav" data-md-level="1"> <label class="md-nav__title" for="__nav_4"> <span class="md-nav__icon md-icon"></span> Pipes </label> <ul class="md-nav__list" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href="../"> <span class="md-ellipsis"> Overview </span> </a> </li> <li class="md-nav__item md-nav__item--active md-nav__item--nested"> <input checked="" class="md-nav__toggle md-toggle" id="__nav_4_3" type="checkbox"/> <div class="md-nav__link md-nav__container"> <a class="md-nav__link" href="./"> <span class="md-ellipsis"> Core Pipelines </span> </a> <label class="md-nav__link" for="__nav_4_3" id="__nav_4_3_label" tabindex="0"> <span class="md-nav__icon md-icon"></span> </label> </div> <nav aria-expanded="true" aria-labelledby="__nav_4_3_label" class="md-nav" data-md-level="2"> <label class="md-nav__title" for="__nav_4_3"> <span class="md-nav__icon md-icon"></span> Core Pipelines </label> <ul class="md-nav__list" data-md-scrollfix=""> <li class="md-nav__item md-nav__item--active"> <input class="md-nav__toggle md-toggle" id="__toc" type="checkbox"/> <label class="md-nav__link md-nav__link--active" for="__toc"> <span class="md-ellipsis"> Overview </span> <span class="md-nav__icon md-icon"></span> </label> <a class="md-nav__link md-nav__link--active" href="./"> <span class="md-ellipsis"> Overview </span> </a> <nav aria-label="Table of contents" class="md-nav md-nav--secondary"> <label class="md-nav__title" for="__toc"> <span class="md-nav__icon md-icon"></span> Table of contents </label> <ul class="md-nav__list" data-md-component="toc" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href="#available-components"> <span class="md-ellipsis"> Available components </span> </a> </li> </ul> </nav> </li> <li class="md-nav__item"> <a class="md-nav__link" href="normalizer/"> <span class="md-ellipsis"> Normalisation </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="sentences/"> <span class="md-ellipsis"> Sentences </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="matcher/"> <span class="md-ellipsis"> Matcher </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="terminology/"> <span class="md-ellipsis"> Terminology </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="contextual-matcher/"> <span class="md-ellipsis"> Contextual Matcher </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="endlines/"> <span class="md-ellipsis"> Endlines </span> </a> </li> </ul> </nav> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../qualifiers/"> <span class="md-ellipsis"> Qualifiers </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../misc/"> <span class="md-ellipsis"> Miscellaneous </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../ner/"> <span class="md-ellipsis"> Named Entity Recognition </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../trainable/"> <span class="md-ellipsis"> Trainable components </span> <span class="md-nav__icon md-icon"></span> </a> </li> </ul> </nav> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../tokenizers/"> <span class="md-ellipsis"> Tokenizers </span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../data/"> <span class="md-ellipsis"> Data Connectors </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../concepts/pipeline/"> <span class="md-ellipsis"> Concepts </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../utilities/"> <span class="md-ellipsis"> Utilities </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../reference/edsnlp/"> <span class="md-ellipsis"> Code Reference </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../contributing/"> <span class="md-ellipsis"> Contributing to EDS-NLP </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../changelog/"> <span class="md-ellipsis"> Changelog </span> </a> </li> </ul> </nav> </div> </div> </div> <div class="md-sidebar md-sidebar--secondary" data-md-component="sidebar" data-md-type="toc"> <div class="md-sidebar__scrollwrap"> <div class="md-sidebar__inner"> <nav aria-label="Table of contents" class="md-nav md-nav--secondary"> <label class="md-nav__title" for="__toc"> <span class="md-nav__icon md-icon"></span> Table of contents </label> <ul class="md-nav__list" data-md-component="toc" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href="#available-components"> <span class="md-ellipsis"> Available components </span> </a> </li> </ul> </nav> </div> </div> </div> <div class="md-content" data-md-component="content"> <article class="md-content__inner md-typeset"> <h1 id="core-components">Core Components</h1> <p>This section deals with "core" functionalities offered by EDS-NLP:</p> <ul> <li>Generic matchers against regular expressions and list of terms</li> <li>Text cleaning</li> <li>Sentence boundaries detection</li> </ul> <h2 id="available-components">Available components</h2> <table> <thead> <tr> <th>Component</th> <th>Description</th> </tr> </thead> <tbody> <tr> <td><code><a href="normalizer/#edsnlp.pipes.core.normalizer.factory.create_component">eds.normalizer</a></code></td> <td>Non-destructive input text normalisation</td> </tr> <tr> <td><code><a href="sentences/#edsnlp.pipes.core.sentences.factory.create_component">eds.sentences</a></code></td> <td>Better sentence boundary detection</td> </tr> <tr> <td><code><a href="matcher/#edsnlp.pipes.core.matcher.factory.create_component">eds.matcher</a></code></td> <td>A simple yet powerful entity extractor</td> </tr> <tr> <td><code><a href="terminology/#edsnlp.pipes.core.terminology.factory.create_component">eds.terminology</a></code></td> <td>A simple yet powerful terminology matcher</td> </tr> <tr> <td><code><a href="contextual-matcher/#edsnlp.pipes.core.contextual_matcher.factory.create_component">eds.contextual_matcher</a></code></td> <td>A conditional entity extractor</td> </tr> <tr> <td><code><a href="endlines/#edsnlp.pipes.core.endlines.factory.create_component">eds.endlines</a></code></td> <td>An unsupervised model to classify each end line</td> </tr> </tbody> </table> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/pipes/core/matcher/index.html b/master/pipes/core/matcher/index.html
index ae572afb6..c0a345cf0 100644
--- a/master/pipes/core/matcher/index.html
+++ b/master/pipes/core/matcher/index.html
@@ -21,7 +21,7 @@
         <span class="n">term_matcher_config</span><span class="o">=</span><span class="p">{},</span>
     <span class="p">),</span>
 <span class="p">)</span>
-</code></pre></div> <ol> <li>Every key in the <code>terms</code> dictionary is mapped to a concept.</li> <li>The <code><a href="#edsnlp.pipes.core.matcher.factory.create_component">eds.matcher</a></code> pipeline expects a list of expressions, or a single expression.</li> <li>We can also define regular expression patterns.</li> </ol> <p>This snippet is complete, and should run as is.</p> <p>Patterns, be they <code>terms</code> or <code>regex</code>, are defined as dictionaries where keys become the label of the extracted entities. Dictionary values are either a single expression or a list of expressions that match the concept.</p> <h2 id="edsnlp.pipes.core.matcher.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline object.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span></code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the component.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'matcher'</code> </span> </p> </td> </tr> <tr> <td><code>terms</code></td> <td class="doc-param-details"> <p>A dictionary of terms.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="edsnlp.matchers.utils.Patterns">Patterns</span>]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>regex</code></td> <td class="doc-param-details"> <p>A dictionary of regular expressions.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="edsnlp.matchers.utils.Patterns">Patterns</span>]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>attr</code></td> <td class="doc-param-details"> <p>The default attribute to use for matching. Can be overridden using the <code>terms</code> and <code>regex</code> configurations.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>TEXT</code> </span> </p> </td> </tr> <tr> <td><code>ignore_excluded</code></td> <td class="doc-param-details"> <p>Whether to skip excluded tokens (requires an upstream pipeline to mark excluded tokens).</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>ignore_space_tokens</code></td> <td class="doc-param-details"> <p>Whether to skip space tokens during matching.</p> <p>You won't be able to match on newlines if this is enabled and the "spaces"/"newline" option of <code><a href="../normalizer/#edsnlp.pipes.core.normalizer.factory.create_component">eds.normalizer</a></code> is enabled (by default).</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>term_matcher</code></td> <td class="doc-param-details"> <p>The matcher to use for matching phrases ? One of (exact, simstring)</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing_extensions.Literal">Literal</span>['exact', 'simstring']</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>exact</code> </span> </p> </td> </tr> <tr> <td><code>term_matcher_config</code></td> <td class="doc-param-details"> <p>Parameters of the matcher class</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Dict">Dict</span>[str, <span title="typing.Any">Any</span>]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{}</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set the spans in the doc.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../ner/#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True}</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.core.matcher.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.core.matcher.factory.create_component">eds.matcher</a></code> pipeline was developed by AP-HP's Data Science team.</p> </div> </div> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> <ol> <li>Every key in the <code>terms</code> dictionary is mapped to a concept.</li> <li>The <code><a href="#edsnlp.pipes.core.matcher.factory.create_component">eds.matcher</a></code> pipeline expects a list of expressions, or a single expression.</li> <li>We can also define regular expression patterns.</li> </ol> <p>This snippet is complete, and should run as is.</p> <p>Patterns, be they <code>terms</code> or <code>regex</code>, are defined as dictionaries where keys become the label of the extracted entities. Dictionary values are either a single expression or a list of expressions that match the concept.</p> <h2 id="edsnlp.pipes.core.matcher.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline object.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span></code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the component.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'matcher'</code> </span> </p> </td> </tr> <tr> <td><code>terms</code></td> <td class="doc-param-details"> <p>A dictionary of terms.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="edsnlp.matchers.utils.Patterns">Patterns</span>]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>regex</code></td> <td class="doc-param-details"> <p>A dictionary of regular expressions.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="edsnlp.matchers.utils.Patterns">Patterns</span>]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>attr</code></td> <td class="doc-param-details"> <p>The default attribute to use for matching. Can be overridden using the <code>terms</code> and <code>regex</code> configurations.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>TEXT</code> </span> </p> </td> </tr> <tr> <td><code>ignore_excluded</code></td> <td class="doc-param-details"> <p>Whether to skip excluded tokens (requires an upstream pipeline to mark excluded tokens).</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>ignore_space_tokens</code></td> <td class="doc-param-details"> <p>Whether to skip space tokens during matching.</p> <p>You won't be able to match on newlines if this is enabled and the "spaces"/"newline" option of <code><a href="../normalizer/#edsnlp.pipes.core.normalizer.factory.create_component">eds.normalizer</a></code> is enabled (by default).</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>term_matcher</code></td> <td class="doc-param-details"> <p>The matcher to use for matching phrases ? One of (exact, simstring)</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing_extensions.Literal">Literal</span>['exact', 'simstring']</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>exact</code> </span> </p> </td> </tr> <tr> <td><code>term_matcher_config</code></td> <td class="doc-param-details"> <p>Parameters of the matcher class</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Dict">Dict</span>[str, <span title="typing.Any">Any</span>]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{}</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set the spans in the doc.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../ner/#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True}</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.core.matcher.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.core.matcher.factory.create_component">eds.matcher</a></code> pipeline was developed by AP-HP's Data Science team.</p> </div> </div> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/pipes/core/normalizer/index.html b/master/pipes/core/normalizer/index.html
index 7181d0045..2a3332240 100644
--- a/master/pipes/core/normalizer/index.html
+++ b/master/pipes/core/normalizer/index.html
@@ -303,7 +303,7 @@
         <span class="n">pollution</span><span class="o">=</span><span class="nb">dict</span><span class="p">(</span><span class="n">custom_pollution</span><span class="o">=</span><span class="sa">r</span><span class="s2">"AAA.*ZZZ"</span><span class="p">),</span>
     <span class="p">),</span>
 <span class="p">)</span>
-</code></pre></div> <h2 id="authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.core.normalizer.factory.create_component">eds.normalizer</a></code> pipeline component was developed by AP-HP's Data Science team.</p> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> <h2 id="authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.core.normalizer.factory.create_component">eds.normalizer</a></code> pipeline component was developed by AP-HP's Data Science team.</p> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/pipes/core/sentences/index.html b/master/pipes/core/sentences/index.html
index fa1b599bc..9d258f2f6 100644
--- a/master/pipes/core/sentences/index.html
+++ b/master/pipes/core/sentences/index.html
@@ -30,7 +30,7 @@
     <span class="nb">print</span><span class="p">(</span><span class="s2">"&lt;s&gt;"</span><span class="p">,</span> <span class="n">sentence</span><span class="p">,</span> <span class="s2">"&lt;/s&gt;"</span><span class="p">)</span>
 <span class="c1"># Out: &lt;s&gt; Le patient est admis le 23 août 2021 pour une douleur à l'estomac</span>
 <span class="c1"># Out: Il lui était arrivé la même chose il y a deux ans. &lt;\s&gt;</span>
-</code></pre></div> </div> </div> </div> <p>Notice how EDS-NLP's implementation is more robust to ill-defined sentence endings.</p> <h2 id="edsnlp.pipes.core.sentences.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The EDS-NLP pipeline</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span></code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'sentences'</code> </span> </p> </td> </tr> <tr> <td><code>punct_chars</code></td> <td class="doc-param-details"> <p>Punctuation characters.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="typing.List">List</span>[str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>use_endlines</code></td> <td class="doc-param-details"> <p>Whether to use endlines prediction.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[bool]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>ignore_excluded</code></td> <td class="doc-param-details"> <p>Whether to ignore excluded tokens.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>True</code> </span> </p> </td> </tr> <tr> <td><code>check_capitalized</code></td> <td class="doc-param-details"> <p>Whether to check for capitalized words after newlines or full stops.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>True</code> </span> </p> </td> </tr> <tr> <td><code>min_newline_count</code></td> <td class="doc-param-details"> <p>The minimum number of newlines to consider a newline-triggered sentence.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>int</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>1</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.core.sentences.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.core.sentences.factory.create_component">eds.sentences</a></code> component was developed by AP-HP's Data Science team.</p> </div> </div> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> </div> </div> </div> <p>Notice how EDS-NLP's implementation is more robust to ill-defined sentence endings.</p> <h2 id="edsnlp.pipes.core.sentences.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The EDS-NLP pipeline</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span></code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'sentences'</code> </span> </p> </td> </tr> <tr> <td><code>punct_chars</code></td> <td class="doc-param-details"> <p>Punctuation characters.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="typing.List">List</span>[str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>use_endlines</code></td> <td class="doc-param-details"> <p>Whether to use endlines prediction.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[bool]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>ignore_excluded</code></td> <td class="doc-param-details"> <p>Whether to ignore excluded tokens.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>True</code> </span> </p> </td> </tr> <tr> <td><code>check_capitalized</code></td> <td class="doc-param-details"> <p>Whether to check for capitalized words after newlines or full stops.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>True</code> </span> </p> </td> </tr> <tr> <td><code>min_newline_count</code></td> <td class="doc-param-details"> <p>The minimum number of newlines to consider a newline-triggered sentence.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>int</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>1</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.core.sentences.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.core.sentences.factory.create_component">eds.sentences</a></code> component was developed by AP-HP's Data Science team.</p> </div> </div> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/pipes/core/terminology/index.html b/master/pipes/core/terminology/index.html
index 3e46f3902..fedb56b6f 100644
--- a/master/pipes/core/terminology/index.html
+++ b/master/pipes/core/terminology/index.html
@@ -20,7 +20,7 @@
         <span class="n">attr</span><span class="o">=</span><span class="s2">"LOWER"</span><span class="p">,</span>
     <span class="p">),</span>
 <span class="p">)</span>
-</code></pre></div> <ol> <li>Every key in the <code>terms</code> dictionary is mapped to a concept.</li> <li>The <code><a href="../matcher/#edsnlp.pipes.core.matcher.factory.create_component">eds.matcher</a></code> pipeline expects a list of expressions, or a single expression.</li> <li>We can also define regular expression patterns.</li> </ol> <p>This snippet is complete, and should run as is.</p> <h2 id="edsnlp.pipes.core.terminology.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline object</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span></code> </span> </p> </td> </tr> <tr> <td><code>terms</code></td> <td class="doc-param-details"> <p>A dictionary of terms.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="edsnlp.matchers.utils.Patterns">Patterns</span>]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>regex</code></td> <td class="doc-param-details"> <p>A dictionary of regular expressions.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="edsnlp.matchers.utils.Patterns">Patterns</span>]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>attr</code></td> <td class="doc-param-details"> <p>The default attribute to use for matching. Can be overridden using the <code>terms</code> and <code>regex</code> configurations.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>TEXT</code> </span> </p> </td> </tr> <tr> <td><code>ignore_excluded</code></td> <td class="doc-param-details"> <p>Whether to skip excluded tokens (requires an upstream pipeline to mark excluded tokens).</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>ignore_space_tokens</code></td> <td class="doc-param-details"> <p>Whether to skip space tokens during matching.</p> <p> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>term_matcher</code></td> <td class="doc-param-details"> <p>The matcher to use for matching phrases ? One of (exact, simstring)</p> <p> <span class="doc-param-default"> <b>DEFAULT:</b> <code>exact</code> </span> </p> </td> </tr> <tr> <td><code>term_matcher_config</code></td> <td class="doc-param-details"> <p>Parameters of the matcher class</p> <p> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>Label name to use for the <code>Span</code> object and the extension</p> <p> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set matches on the doc</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../ner/#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True}</code> </span> </p> </td> </tr> </tbody> </table> <p>Patterns, be they <code>terms</code> or <code>regex</code>, are defined as dictionaries where keys become the <code>kb_id_</code> of the extracted entities. Dictionary values are either a single expression or a list of expressions that match the concept (see <a href="#edsnlp.pipes.core.terminology.factory.create_component--usage">example</a>).</p> <h2 id="edsnlp.pipes.core.terminology.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.core.terminology.factory.create_component">eds.terminology</a></code> pipeline was developed by AP-HP's Data Science team.</p> </div> </div> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> <ol> <li>Every key in the <code>terms</code> dictionary is mapped to a concept.</li> <li>The <code><a href="../matcher/#edsnlp.pipes.core.matcher.factory.create_component">eds.matcher</a></code> pipeline expects a list of expressions, or a single expression.</li> <li>We can also define regular expression patterns.</li> </ol> <p>This snippet is complete, and should run as is.</p> <h2 id="edsnlp.pipes.core.terminology.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline object</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span></code> </span> </p> </td> </tr> <tr> <td><code>terms</code></td> <td class="doc-param-details"> <p>A dictionary of terms.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="edsnlp.matchers.utils.Patterns">Patterns</span>]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>regex</code></td> <td class="doc-param-details"> <p>A dictionary of regular expressions.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="edsnlp.matchers.utils.Patterns">Patterns</span>]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>attr</code></td> <td class="doc-param-details"> <p>The default attribute to use for matching. Can be overridden using the <code>terms</code> and <code>regex</code> configurations.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>TEXT</code> </span> </p> </td> </tr> <tr> <td><code>ignore_excluded</code></td> <td class="doc-param-details"> <p>Whether to skip excluded tokens (requires an upstream pipeline to mark excluded tokens).</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>ignore_space_tokens</code></td> <td class="doc-param-details"> <p>Whether to skip space tokens during matching.</p> <p> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>term_matcher</code></td> <td class="doc-param-details"> <p>The matcher to use for matching phrases ? One of (exact, simstring)</p> <p> <span class="doc-param-default"> <b>DEFAULT:</b> <code>exact</code> </span> </p> </td> </tr> <tr> <td><code>term_matcher_config</code></td> <td class="doc-param-details"> <p>Parameters of the matcher class</p> <p> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>Label name to use for the <code>Span</code> object and the extension</p> <p> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set matches on the doc</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../ner/#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True}</code> </span> </p> </td> </tr> </tbody> </table> <p>Patterns, be they <code>terms</code> or <code>regex</code>, are defined as dictionaries where keys become the <code>kb_id_</code> of the extracted entities. Dictionary values are either a single expression or a list of expressions that match the concept (see <a href="#edsnlp.pipes.core.terminology.factory.create_component--usage">example</a>).</p> <h2 id="edsnlp.pipes.core.terminology.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.core.terminology.factory.create_component">eds.terminology</a></code> pipeline was developed by AP-HP's Data Science team.</p> </div> </div> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/pipes/index.html b/master/pipes/index.html
index b97359ea0..d524e6f0b 100644
--- a/master/pipes/index.html
+++ b/master/pipes/index.html
@@ -5,7 +5,7 @@
 <span class="n">nlp</span><span class="o">.</span><span class="n"><html><head></head><body><a class="discrete-link" href="../reference/edsnlp/core/pipeline/#edsnlp.core.pipeline.Pipeline.add_pipe">add_pipe</a></body></html></span><span class="p">(</span><a href="core/normalizer/#edsnlp.pipes.core.normalizer.factory.create_component">eds.normalizer</a><span class="p">())</span>
 <span class="n">nlp</span><span class="o">.</span><span class="n"><html><head></head><body><a class="discrete-link" href="../reference/edsnlp/core/pipeline/#edsnlp.core.pipeline.Pipeline.add_pipe">add_pipe</a></body></html></span><span class="p">(</span><a href="core/sentences/#edsnlp.pipes.core.sentences.factory.create_component">eds.sentences</a><span class="p">())</span>
 <span class="n">nlp</span><span class="o">.</span><span class="n"><html><head></head><body><a class="discrete-link" href="../reference/edsnlp/core/pipeline/#edsnlp.core.pipeline.Pipeline.add_pipe">add_pipe</a></body></html></span><span class="p">(</span><a href="ner/tnm/#edsnlp.pipes.ner.tnm.factory.create_component">eds.tnm</a><span class="p">())</span>
-</code></pre></div> <h2 id="basic-architecture">Basic architecture</h2> <p>Most components provided by EDS-NLP aim to qualify pre-extracted entities. To wit, the basic usage of the library:</p> <ol> <li>Implement a normaliser (see <code><a href="core/normalizer/#edsnlp.pipes.core.normalizer.factory.create_component">eds.normalizer</a></code>)</li> <li>Add an entity recognition component (eg the simple but powerful <code><a href="core/matcher/#edsnlp.pipes.core.matcher.factory.create_component">eds.matcher</a></code> component)</li> <li>Add zero or more entity qualification components, such as <code><a href="qualifiers/negation/#edsnlp.pipes.qualifiers.negation.factory.create_component">eds.negation</a></code>, <code><a href="qualifiers/family/#edsnlp.pipes.qualifiers.family.factory.create_component">eds.family</a></code> or <code><a href="qualifiers/hypothesis/#edsnlp.pipes.qualifiers.hypothesis.factory.create_component">eds.hypothesis</a></code>. These qualifiers typically help detect false-positives.</li> </ol> <h2 id="extraction-components">Extraction components</h2> <p>Extraction components (matchers, the date detector or NER components, for instance) keep their results to the <code>doc.ents</code> and <code>doc.spans</code> attributes directly.</p> <p>By default, some components do not write their output to <code>doc.ents</code>, such as the <code><a href="misc/sections/#edsnlp.pipes.misc.sections.factory.create_component">eds.sections</a></code> matcher. This is mainly due to the fact that, since <code>doc.ents</code> cannot contain overlapping entities, we <a class="autorefs autorefs-internal" href="../reference/edsnlp/utils/filter/#edsnlp.utils.filter.filter_spans">filter spans</a> and keep the largest one by default. Since sections usually cover large spans of text, storing them in ents would remove every other overlapping entities.</p> <h2 id="entity-tagging">Entity tagging</h2> <p>Moreover, most components declare <a href="https://spacy.io/usage/processing-components#custom-components-attributes">extensions</a>, on the <code>Doc</code>, <code>Span</code> and/or <code>Token</code> objects.</p> <p>These extensions are especially useful for qualifier components, but can also be used by other components to persist relevant information. For instance, the <code><a href="misc/dates/#edsnlp.pipes.misc.dates.factory.create_component">eds.dates</a></code> component declares a <code>span._.date</code> extension to store a normalised version of each detected date.</p> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> <h2 id="basic-architecture">Basic architecture</h2> <p>Most components provided by EDS-NLP aim to qualify pre-extracted entities. To wit, the basic usage of the library:</p> <ol> <li>Implement a normaliser (see <code><a href="core/normalizer/#edsnlp.pipes.core.normalizer.factory.create_component">eds.normalizer</a></code>)</li> <li>Add an entity recognition component (eg the simple but powerful <code><a href="core/matcher/#edsnlp.pipes.core.matcher.factory.create_component">eds.matcher</a></code> component)</li> <li>Add zero or more entity qualification components, such as <code><a href="qualifiers/negation/#edsnlp.pipes.qualifiers.negation.factory.create_component">eds.negation</a></code>, <code><a href="qualifiers/family/#edsnlp.pipes.qualifiers.family.factory.create_component">eds.family</a></code> or <code><a href="qualifiers/hypothesis/#edsnlp.pipes.qualifiers.hypothesis.factory.create_component">eds.hypothesis</a></code>. These qualifiers typically help detect false-positives.</li> </ol> <h2 id="extraction-components">Extraction components</h2> <p>Extraction components (matchers, the date detector or NER components, for instance) keep their results to the <code>doc.ents</code> and <code>doc.spans</code> attributes directly.</p> <p>By default, some components do not write their output to <code>doc.ents</code>, such as the <code><a href="misc/sections/#edsnlp.pipes.misc.sections.factory.create_component">eds.sections</a></code> matcher. This is mainly due to the fact that, since <code>doc.ents</code> cannot contain overlapping entities, we <a class="autorefs autorefs-internal" href="../reference/edsnlp/utils/filter/#edsnlp.utils.filter.filter_spans">filter spans</a> and keep the largest one by default. Since sections usually cover large spans of text, storing them in ents would remove every other overlapping entities.</p> <h2 id="entity-tagging">Entity tagging</h2> <p>Moreover, most components declare <a href="https://spacy.io/usage/processing-components#custom-components-attributes">extensions</a>, on the <code>Doc</code>, <code>Span</code> and/or <code>Token</code> objects.</p> <p>These extensions are especially useful for qualifier components, but can also be used by other components to persist relevant information. For instance, the <code><a href="misc/dates/#edsnlp.pipes.misc.dates.factory.create_component">eds.dates</a></code> component declares a <code>span._.date</code> extension to store a normalised version of each detected date.</p> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/pipes/misc/consultation-dates/index.html b/master/pipes/misc/consultation-dates/index.html
index a109a0827..bc1bf4730 100644
--- a/master/pipes/misc/consultation-dates/index.html
+++ b/master/pipes/misc/consultation-dates/index.html
@@ -26,7 +26,7 @@
 
 <span class="n">doc</span><span class="o">.</span><span class="n">spans</span><span class="p">[</span><span class="s2">"consultation_dates"</span><span class="p">][</span><span class="mi">0</span><span class="p">]</span><span class="o">.</span><span class="n">_</span><span class="o">.</span><span class="n">consultation_date</span><span class="o">.</span><span class="n">to_datetime</span><span class="p">()</span>
 <span class="c1"># Out: DateTime(2018, 10, 3, 0, 0, 0)</span>
-</code></pre></div> <h2 id="edsnlp.pipes.misc.consultation_dates.factory.create_component--extensions">Extensions</h2> <p>The <code><a href="#edsnlp.pipes.misc.consultation_dates.factory.create_component">eds.consultation_dates</a></code> pipeline declares one extension on the <code>Span</code> object: the <code>consultation_date</code> attribute, which is a Python <code>datetime</code> object.</p> <h2 id="edsnlp.pipes.misc.consultation_dates.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>Language pipeline object</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span></code> </span> </p> </td> </tr> <tr> <td><code>consultation_mention</code></td> <td class="doc-param-details"> <p>List of RegEx for consultation mentions.</p> <ul> <li>If <code>type==list</code>: Overrides the default list</li> <li>If <code>type==bool</code>: Uses the default list of True, disable if False</li> </ul> <p>This list contains terms directly referring to consultations, such as "<em>Consultation du...</em>" or "<em>Compte rendu du...</em>". This list is the only one enabled by default since it is fairly precise and not error-prone.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[<span title="typing.List">List</span>[str], bool]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>True</code> </span> </p> </td> </tr> <tr> <td><code>town_mention</code></td> <td class="doc-param-details"> <p>List of RegEx for all AP-HP hospitals' towns mentions.</p> <ul> <li>If <code>type==list</code>: Overrides the default list</li> <li>If <code>type==bool</code>: Uses the default list of True, disable if False</li> </ul> <p>This list contains the towns of each AP-HP's hospital. Its goal is to fetch dates mentioned as "<em>Paris, le 13 décembre 2015</em>". It has a high recall but poor precision, since those dates can often be dates of letter redaction instead of consultation dates.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[<span title="typing.List">List</span>[str], bool]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>document_date_mention</code></td> <td class="doc-param-details"> <p>List of RegEx for document date.</p> <ul> <li>If <code>type==list</code>: Overrides the default list</li> <li>If <code>type==bool</code>: Uses the default list of True, disable if False</li> </ul> <p>This list contains expressions mentioning the date of creation/edition of a document, such as "<em>Date du rapport: 13/12/2015</em>" or "<em>Signé le 13/12/2015</em>". Like <code>town_mention</code> patterns, it has a high recall but is prone to errors since document date and consultation date aren't necessary similar.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[<span title="typing.List">List</span>[str], bool]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.misc.consultation_dates.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.misc.consultation_dates.factory.create_component">eds.consultation_dates</a></code> pipeline was developed by AP-HP's Data Science team.</p> </div> </div> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> <h2 id="edsnlp.pipes.misc.consultation_dates.factory.create_component--extensions">Extensions</h2> <p>The <code><a href="#edsnlp.pipes.misc.consultation_dates.factory.create_component">eds.consultation_dates</a></code> pipeline declares one extension on the <code>Span</code> object: the <code>consultation_date</code> attribute, which is a Python <code>datetime</code> object.</p> <h2 id="edsnlp.pipes.misc.consultation_dates.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>Language pipeline object</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span></code> </span> </p> </td> </tr> <tr> <td><code>consultation_mention</code></td> <td class="doc-param-details"> <p>List of RegEx for consultation mentions.</p> <ul> <li>If <code>type==list</code>: Overrides the default list</li> <li>If <code>type==bool</code>: Uses the default list of True, disable if False</li> </ul> <p>This list contains terms directly referring to consultations, such as "<em>Consultation du...</em>" or "<em>Compte rendu du...</em>". This list is the only one enabled by default since it is fairly precise and not error-prone.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[<span title="typing.List">List</span>[str], bool]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>True</code> </span> </p> </td> </tr> <tr> <td><code>town_mention</code></td> <td class="doc-param-details"> <p>List of RegEx for all AP-HP hospitals' towns mentions.</p> <ul> <li>If <code>type==list</code>: Overrides the default list</li> <li>If <code>type==bool</code>: Uses the default list of True, disable if False</li> </ul> <p>This list contains the towns of each AP-HP's hospital. Its goal is to fetch dates mentioned as "<em>Paris, le 13 décembre 2015</em>". It has a high recall but poor precision, since those dates can often be dates of letter redaction instead of consultation dates.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[<span title="typing.List">List</span>[str], bool]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>document_date_mention</code></td> <td class="doc-param-details"> <p>List of RegEx for document date.</p> <ul> <li>If <code>type==list</code>: Overrides the default list</li> <li>If <code>type==bool</code>: Uses the default list of True, disable if False</li> </ul> <p>This list contains expressions mentioning the date of creation/edition of a document, such as "<em>Date du rapport: 13/12/2015</em>" or "<em>Signé le 13/12/2015</em>". Like <code>town_mention</code> patterns, it has a high recall but is prone to errors since document date and consultation date aren't necessary similar.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[<span title="typing.List">List</span>[str], bool]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.misc.consultation_dates.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.misc.consultation_dates.factory.create_component">eds.consultation_dates</a></code> pipeline was developed by AP-HP's Data Science team.</p> </div> </div> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/pipes/misc/dates/index.html b/master/pipes/misc/dates/index.html
index 42c66efba..7f3c8702f 100644
--- a/master/pipes/misc/dates/index.html
+++ b/master/pipes/misc/dates/index.html
@@ -56,7 +56,7 @@
 <span class="nb">print</span><span class="p">(</span><span class="n">docs</span><span class="p">)</span>
 <span class="c1"># note_id  start  end label lexical_variant span_type datetime</span>
 <span class="c1"># ...</span>
-</code></pre></div> <h2 id="edsnlp.pipes.misc.dates.factory.create_component--extensions">Extensions</h2> <p>The <code><a href="#edsnlp.pipes.misc.dates.factory.create_component">eds.dates</a></code> pipeline declares two extensions on the <code>Span</code> object:</p> <ul> <li>the <code>span._.date</code> attribute of a date contains a parsed version of the date.</li> <li>the <code>span._.duration</code> attribute of a duration contains a parsed version of the duration.</li> </ul> <p>As with other components, you can use the <code>span._.value</code> attribute to get either the parsed date or the duration depending on the span.</p> <h2 id="edsnlp.pipes.misc.dates.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline object</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span></code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>Name of the pipeline component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[str]</code> </span> </p> </td> </tr> <tr> <td><code>absolute</code></td> <td class="doc-param-details"> <p>List of regular expressions for absolute dates.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[<span title="typing.List">List</span>[str], str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>relative</code></td> <td class="doc-param-details"> <p>List of regular expressions for relative dates (eg <code>hier</code>, <code>la semaine prochaine</code>).</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[<span title="typing.List">List</span>[str], str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>duration</code></td> <td class="doc-param-details"> <p>List of regular expressions for durations (eg <code>pendant trois mois</code>).</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[<span title="typing.List">List</span>[str], str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>false_positive</code></td> <td class="doc-param-details"> <p>List of regular expressions for false positive (eg phone numbers, etc).</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[<span title="typing.List">List</span>[str], str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>span_getter</code></td> <td class="doc-param-details"> <p>Where to look for dates in the doc. By default, look in the whole doc. You can combine this with the <code>merge_mode</code> argument for interesting results.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../qualifiers/#edsnlp.pipes.base.SpanGetterArg" title="edsnlp.pipes.base.SpanGetterArg">SpanGetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>merge_mode</code></td> <td class="doc-param-details"> <p>How to merge matched dates with the spans from <code>span_getter</code>, if given:</p> <ul> <li><code>intersect</code>: return only the matches that fall in the <code>span_getter</code> spans</li> <li><code>align</code>: if a date overlaps a span from <code>span_getter</code> (e.g. a date extracted by a machine learning model), return the <code>span_getter</code> span instead, and assign all the parsed information (<code>._.date</code> / <code>._.duration</code>) to it. Otherwise don't return the date.</li> </ul> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing_extensions.Literal">Literal</span>['intersect', 'align']</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>intersect</code> </span> </p> </td> </tr> <tr> <td><code>on_ents_only</code></td> <td class="doc-param-details"> <p>Deprecated, use <code>span_getter</code> and <code>merge_mode</code> instead. Whether to look on dates in the whole document or in specific sentences:</p> <ul> <li>If <code>True</code>: Only look in the sentences of each entity in doc.ents</li> <li>If False: Look in the whole document</li> <li>If given a string <code>key</code> or list of string: Only look in the sentences of each entity in <code class="highlight"><span class="n">doc</span><span class="o">.</span><span class="n">spans</span><span class="p">[</span><span class="n">key</span><span class="p">]</span></code></li> </ul> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[bool, str, <span title="typing.Iterable">Iterable</span>[str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>detect_periods</code></td> <td class="doc-param-details"> <p>Whether to detect periods (experimental)</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>detect_time</code></td> <td class="doc-param-details"> <p>Whether to detect time inside dates</p> <p> <span class="doc-param-default"> <b>DEFAULT:</b> <code>True</code> </span> </p> </td> </tr> <tr> <td><code>period_proximity_threshold</code></td> <td class="doc-param-details"> <p>Max number of words between two dates to extract a period.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>int</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>3</code> </span> </p> </td> </tr> <tr> <td><code>as_ents</code></td> <td class="doc-param-details"> <p>Deprecated, use span_setter instead. Whether to treat dates as entities</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>attr</code></td> <td class="doc-param-details"> <p>spaCy attribute to use</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>LOWER</code> </span> </p> </td> </tr> <tr> <td><code>date_label</code></td> <td class="doc-param-details"> <p>Label to use for dates</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>date</code> </span> </p> </td> </tr> <tr> <td><code>duration_label</code></td> <td class="doc-param-details"> <p>Label to use for durations</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>duration</code> </span> </p> </td> </tr> <tr> <td><code>period_label</code></td> <td class="doc-param-details"> <p>Label to use for periods</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>period</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set matches in the doc.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../ner/#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'dates': ['date'], 'durations': ['duration'], ...</code> </span> </p> </td> </tr> <tr> <td><code>explain</code></td> <td class="doc-param-details"> <p>Whether to keep track of regex cues for each entity.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.misc.dates.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.misc.dates.factory.create_component">eds.dates</a></code> pipeline was developed by AP-HP's Data Science team.</p> </div> </div> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> <h2 id="edsnlp.pipes.misc.dates.factory.create_component--extensions">Extensions</h2> <p>The <code><a href="#edsnlp.pipes.misc.dates.factory.create_component">eds.dates</a></code> pipeline declares two extensions on the <code>Span</code> object:</p> <ul> <li>the <code>span._.date</code> attribute of a date contains a parsed version of the date.</li> <li>the <code>span._.duration</code> attribute of a duration contains a parsed version of the duration.</li> </ul> <p>As with other components, you can use the <code>span._.value</code> attribute to get either the parsed date or the duration depending on the span.</p> <h2 id="edsnlp.pipes.misc.dates.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline object</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span></code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>Name of the pipeline component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[str]</code> </span> </p> </td> </tr> <tr> <td><code>absolute</code></td> <td class="doc-param-details"> <p>List of regular expressions for absolute dates.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[<span title="typing.List">List</span>[str], str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>relative</code></td> <td class="doc-param-details"> <p>List of regular expressions for relative dates (eg <code>hier</code>, <code>la semaine prochaine</code>).</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[<span title="typing.List">List</span>[str], str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>duration</code></td> <td class="doc-param-details"> <p>List of regular expressions for durations (eg <code>pendant trois mois</code>).</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[<span title="typing.List">List</span>[str], str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>false_positive</code></td> <td class="doc-param-details"> <p>List of regular expressions for false positive (eg phone numbers, etc).</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[<span title="typing.List">List</span>[str], str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>span_getter</code></td> <td class="doc-param-details"> <p>Where to look for dates in the doc. By default, look in the whole doc. You can combine this with the <code>merge_mode</code> argument for interesting results.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../qualifiers/#edsnlp.pipes.base.SpanGetterArg" title="edsnlp.pipes.base.SpanGetterArg">SpanGetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>merge_mode</code></td> <td class="doc-param-details"> <p>How to merge matched dates with the spans from <code>span_getter</code>, if given:</p> <ul> <li><code>intersect</code>: return only the matches that fall in the <code>span_getter</code> spans</li> <li><code>align</code>: if a date overlaps a span from <code>span_getter</code> (e.g. a date extracted by a machine learning model), return the <code>span_getter</code> span instead, and assign all the parsed information (<code>._.date</code> / <code>._.duration</code>) to it. Otherwise don't return the date.</li> </ul> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing_extensions.Literal">Literal</span>['intersect', 'align']</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>intersect</code> </span> </p> </td> </tr> <tr> <td><code>on_ents_only</code></td> <td class="doc-param-details"> <p>Deprecated, use <code>span_getter</code> and <code>merge_mode</code> instead. Whether to look on dates in the whole document or in specific sentences:</p> <ul> <li>If <code>True</code>: Only look in the sentences of each entity in doc.ents</li> <li>If False: Look in the whole document</li> <li>If given a string <code>key</code> or list of string: Only look in the sentences of each entity in <code class="highlight"><span class="n">doc</span><span class="o">.</span><span class="n">spans</span><span class="p">[</span><span class="n">key</span><span class="p">]</span></code></li> </ul> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[bool, str, <span title="typing.Iterable">Iterable</span>[str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>detect_periods</code></td> <td class="doc-param-details"> <p>Whether to detect periods (experimental)</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>detect_time</code></td> <td class="doc-param-details"> <p>Whether to detect time inside dates</p> <p> <span class="doc-param-default"> <b>DEFAULT:</b> <code>True</code> </span> </p> </td> </tr> <tr> <td><code>period_proximity_threshold</code></td> <td class="doc-param-details"> <p>Max number of words between two dates to extract a period.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>int</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>3</code> </span> </p> </td> </tr> <tr> <td><code>as_ents</code></td> <td class="doc-param-details"> <p>Deprecated, use span_setter instead. Whether to treat dates as entities</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>attr</code></td> <td class="doc-param-details"> <p>spaCy attribute to use</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>LOWER</code> </span> </p> </td> </tr> <tr> <td><code>date_label</code></td> <td class="doc-param-details"> <p>Label to use for dates</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>date</code> </span> </p> </td> </tr> <tr> <td><code>duration_label</code></td> <td class="doc-param-details"> <p>Label to use for durations</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>duration</code> </span> </p> </td> </tr> <tr> <td><code>period_label</code></td> <td class="doc-param-details"> <p>Label to use for periods</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>period</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set matches in the doc.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../ner/#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'dates': ['date'], 'durations': ['duration'], ...</code> </span> </p> </td> </tr> <tr> <td><code>explain</code></td> <td class="doc-param-details"> <p>Whether to keep track of regex cues for each entity.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.misc.dates.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.misc.dates.factory.create_component">eds.dates</a></code> pipeline was developed by AP-HP's Data Science team.</p> </div> </div> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/pipes/misc/index.html b/master/pipes/misc/index.html
index f39be5f2e..f94574b23 100644
--- a/master/pipes/misc/index.html
+++ b/master/pipes/misc/index.html
@@ -1,5 +1,5 @@
 <!DOCTYPE html>
-<html class="no-js" lang="en"> <head><meta charset="utf-8"/><meta content="width=device-width,initial-scale=1" name="viewport"/><link href="../qualifiers/history/" rel="prev"/><link href="dates/" rel="next"/><link href="../../assets/logo/edsnlp.svg" rel="icon"/><meta content="mkdocs-1.6.1, mkdocs-material-9.5.42" name="generator"/><title>Overview - EDS-NLP</title><link href="../../assets/stylesheets/main.0253249f.min.css" rel="stylesheet"/><link href="../../assets/stylesheets/palette.06af60db.min.css" rel="stylesheet"/><link crossorigin="" href="https://fonts.gstatic.com" rel="preconnect"/><link href="https://fonts.googleapis.com/css?family=Roboto:300,300i,400,400i,700,700i%7CRoboto+Mono:400,400i,700,700i&amp;display=fallback" rel="stylesheet"/><style>:root{--md-text-font:"Roboto";--md-code-font:"Roboto Mono"}</style><link href="../../assets/_mkdocstrings.css" rel="stylesheet"/><link href="../../assets/stylesheets/extra.css" rel="stylesheet"/><link href="../../assets/stylesheets/cards.css" rel="stylesheet"/><link href="../../assets/termynal/termynal.css" rel="stylesheet"/><script>__md_scope=new URL("../..",location),__md_hash=e=>[...e].reduce(((e,_)=>(e<<5)-e+_.charCodeAt(0)),0),__md_get=(e,_=localStorage,t=__md_scope)=>JSON.parse(_.getItem(t.pathname+"."+e)),__md_set=(e,_,t=localStorage,a=__md_scope)=>{try{t.setItem(a.pathname+"."+e,JSON.stringify(_))}catch(e){}}</script></head> <body data-md-color-accent="indigo" data-md-color-primary="indigo" data-md-color-scheme="default" dir="ltr"> <input autocomplete="off" class="md-toggle" data-md-toggle="drawer" id="__drawer" type="checkbox"/> <input autocomplete="off" class="md-toggle" data-md-toggle="search" id="__search" type="checkbox"/> <label class="md-overlay" for="__drawer"></label> <div data-md-component="skip"> <a class="md-skip" href="#miscellaneous"> Skip to content </a> </div> <div data-md-component="announce"> <aside class="md-banner"> <div class="md-banner__inner md-grid md-typeset"> <button aria-label="Don't show this again" class="md-banner__button md-icon"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M19 6.41 17.59 5 12 10.59 6.41 5 5 6.41 10.59 12 5 17.59 6.41 19 12 13.41 17.59 19 19 17.59 13.41 12z"></path></svg> </button> Check out the new <a href="../../tutorials/training">Model Training tutorial</a> ! </div> <script>var el=document.querySelector("[data-md-component=announce]");if(el){var content=el.querySelector(".md-typeset");__md_hash(content.innerHTML)===__md_get("__announce")&&(el.hidden=!0)}</script> </aside> </div> <div data-md-color-scheme="default" data-md-component="outdated" hidden=""> </div> <header class="md-header md-header--shadow" data-md-component="header"> <nav aria-label="Header" class="md-header__inner md-grid"> <a aria-label="EDS-NLP" class="md-header__button md-logo" data-md-component="logo" href="../.." title="EDS-NLP"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a3 3 0 0 0 3-3 3 3 0 0 0-3-3 3 3 0 0 0-3 3 3 3 0 0 0 3 3m0 3.54C9.64 9.35 6.5 8 3 8v11c3.5 0 6.64 1.35 9 3.54 2.36-2.19 5.5-3.54 9-3.54V8c-3.5 0-6.64 1.35-9 3.54"></path></svg> </a> <label class="md-header__button md-icon" for="__drawer"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M3 6h18v2H3zm0 5h18v2H3zm0 5h18v2H3z"></path></svg> </label> <div class="md-header__title" data-md-component="header-title"> <div class="md-header__ellipsis"> <div class="md-header__topic"> <span class="md-ellipsis"> EDS-NLP </span> </div> <div class="md-header__topic" data-md-component="header-topic"> <span class="md-ellipsis"> Overview </span> </div> </div> </div> <form class="md-header__option" data-md-component="palette"> <input aria-label="Switch to dark mode" class="md-option" data-md-color-accent="indigo" data-md-color-media="" data-md-color-primary="indigo" data-md-color-scheme="default" id="__palette_0" name="__palette" type="radio"/> <label class="md-header__button md-icon" for="__palette_1" hidden="" title="Switch to dark mode"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 18c-.89 0-1.74-.2-2.5-.55C11.56 16.5 13 14.42 13 12s-1.44-4.5-3.5-5.45C10.26 6.2 11.11 6 12 6a6 6 0 0 1 6 6 6 6 0 0 1-6 6m8-9.31V4h-4.69L12 .69 8.69 4H4v4.69L.69 12 4 15.31V20h4.69L12 23.31 15.31 20H20v-4.69L23.31 12z"></path></svg> </label> <input aria-label="Switch to light mode" class="md-option" data-md-color-accent="indigo" data-md-color-media="" data-md-color-primary="indigo" data-md-color-scheme="slate" id="__palette_1" name="__palette" type="radio"/> <label class="md-header__button md-icon" for="__palette_0" hidden="" title="Switch to light mode"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a4 4 0 0 0-4 4 4 4 0 0 0 4 4 4 4 0 0 0 4-4 4 4 0 0 0-4-4m0 10a6 6 0 0 1-6-6 6 6 0 0 1 6-6 6 6 0 0 1 6 6 6 6 0 0 1-6 6m8-9.31V4h-4.69L12 .69 8.69 4H4v4.69L.69 12 4 15.31V20h4.69L12 23.31 15.31 20H20v-4.69L23.31 12z"></path></svg> </label> </form> <script>var palette=__md_get("__palette");if(palette&&palette.color){if("(prefers-color-scheme)"===palette.color.media){var media=matchMedia("(prefers-color-scheme: light)"),input=document.querySelector(media.matches?"[data-md-color-media='(prefers-color-scheme: light)']":"[data-md-color-media='(prefers-color-scheme: dark)']");palette.color.media=input.getAttribute("data-md-color-media"),palette.color.scheme=input.getAttribute("data-md-color-scheme"),palette.color.primary=input.getAttribute("data-md-color-primary"),palette.color.accent=input.getAttribute("data-md-color-accent")}for(var[key,value]of Object.entries(palette.color))document.body.setAttribute("data-md-color-"+key,value)}</script> <label class="md-header__button md-icon" for="__search"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M9.5 3A6.5 6.5 0 0 1 16 9.5c0 1.61-.59 3.09-1.56 4.23l.27.27h.79l5 5-1.5 1.5-5-5v-.79l-.27-.27A6.52 6.52 0 0 1 9.5 16 6.5 6.5 0 0 1 3 9.5 6.5 6.5 0 0 1 9.5 3m0 2C7 5 5 7 5 9.5S7 14 9.5 14 14 12 14 9.5 12 5 9.5 5"></path></svg> </label> <div class="md-search" data-md-component="search" role="dialog"> <label class="md-search__overlay" for="__search"></label> <div class="md-search__inner" role="search"> <form class="md-search__form" name="search"> <input aria-label="Search" autocapitalize="off" autocomplete="off" autocorrect="off" class="md-search__input" data-md-component="search-query" name="query" placeholder="Search" required="" spellcheck="false" type="text"/> <label class="md-search__icon md-icon" for="__search"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M9.5 3A6.5 6.5 0 0 1 16 9.5c0 1.61-.59 3.09-1.56 4.23l.27.27h.79l5 5-1.5 1.5-5-5v-.79l-.27-.27A6.52 6.52 0 0 1 9.5 16 6.5 6.5 0 0 1 3 9.5 6.5 6.5 0 0 1 9.5 3m0 2C7 5 5 7 5 9.5S7 14 9.5 14 14 12 14 9.5 12 5 9.5 5"></path></svg> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M20 11v2H8l5.5 5.5-1.42 1.42L4.16 12l7.92-7.92L13.5 5.5 8 11z"></path></svg> </label> <nav aria-label="Search" class="md-search__options"> <button aria-label="Clear" class="md-search__icon md-icon" tabindex="-1" title="Clear" type="reset"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M19 6.41 17.59 5 12 10.59 6.41 5 5 6.41 10.59 12 5 17.59 6.41 19 12 13.41 17.59 19 19 17.59 13.41 12z"></path></svg> </button> </nav> </form> <div class="md-search__output"> <div class="md-search__scrollwrap" data-md-scrollfix="" tabindex="0"> <div class="md-search-result" data-md-component="search-result"> <div class="md-search-result__meta"> Initializing search </div> <ol class="md-search-result__list" role="presentation"></ol> </div> </div> </div> </div> </div> <div class="md-header__source"> <a class="md-source" data-md-component="source" href="https://github.com/aphp/edsnlp" title="Go to repository"> <div class="md-source__icon md-icon"> <svg viewbox="0 0 448 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M439.55 236.05 244 40.45a28.87 28.87 0 0 0-40.81 0l-40.66 40.63 51.52 51.52c27.06-9.14 52.68 16.77 43.39 43.68l49.66 49.66c34.23-11.8 61.18 31 35.47 56.69-26.49 26.49-70.21-2.87-56-37.34L240.22 199v121.85c25.3 12.54 22.26 41.85 9.08 55a34.34 34.34 0 0 1-48.55 0c-17.57-17.6-11.07-46.91 11.25-56v-123c-20.8-8.51-24.6-30.74-18.64-45L142.57 101 8.45 235.14a28.86 28.86 0 0 0 0 40.81l195.61 195.6a28.86 28.86 0 0 0 40.8 0l194.69-194.69a28.86 28.86 0 0 0 0-40.81"></path></svg> </div> <div class="md-source__repository"> aphp/edsnlp </div> </a> </div> </nav> </header> <div class="md-container" data-md-component="container"> <main class="md-main" data-md-component="main"> <div class="md-main__inner md-grid"> <div class="md-sidebar md-sidebar--primary" data-md-component="sidebar" data-md-type="navigation"> <div class="md-sidebar__scrollwrap"> <div class="md-sidebar__inner"> <nav aria-label="Navigation" class="md-nav md-nav--primary" data-md-level="0"> <label class="md-nav__title" for="__drawer"> <a aria-label="EDS-NLP" class="md-nav__button md-logo" data-md-component="logo" href="../.." title="EDS-NLP"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a3 3 0 0 0 3-3 3 3 0 0 0-3-3 3 3 0 0 0-3 3 3 3 0 0 0 3 3m0 3.54C9.64 9.35 6.5 8 3 8v11c3.5 0 6.64 1.35 9 3.54 2.36-2.19 5.5-3.54 9-3.54V8c-3.5 0-6.64 1.35-9 3.54"></path></svg> </a> EDS-NLP </label> <div class="md-nav__source"> <a class="md-source" data-md-component="source" href="https://github.com/aphp/edsnlp" title="Go to repository"> <div class="md-source__icon md-icon"> <svg viewbox="0 0 448 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M439.55 236.05 244 40.45a28.87 28.87 0 0 0-40.81 0l-40.66 40.63 51.52 51.52c27.06-9.14 52.68 16.77 43.39 43.68l49.66 49.66c34.23-11.8 61.18 31 35.47 56.69-26.49 26.49-70.21-2.87-56-37.34L240.22 199v121.85c25.3 12.54 22.26 41.85 9.08 55a34.34 34.34 0 0 1-48.55 0c-17.57-17.6-11.07-46.91 11.25-56v-123c-20.8-8.51-24.6-30.74-18.64-45L142.57 101 8.45 235.14a28.86 28.86 0 0 0 0 40.81l195.61 195.6a28.86 28.86 0 0 0 40.8 0l194.69-194.69a28.86 28.86 0 0 0 0-40.81"></path></svg> </div> <div class="md-source__repository"> aphp/edsnlp </div> </a> </div> <ul class="md-nav__list" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href="../.."> <span class="md-ellipsis"> Getting started </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="https://aphp.github.io/edsnlp/demo" target="_blank"> <span class="md-ellipsis"> Demo </span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../tutorials/"> <span class="md-ellipsis"> Tutorials </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--active md-nav__item--nested"> <input checked="" class="md-nav__toggle md-toggle" id="__nav_4" type="checkbox"/> <div class="md-nav__link md-nav__container"> <a class="md-nav__link" href="../"> <span class="md-ellipsis"> Pipes </span> </a> <label class="md-nav__link" for="__nav_4" id="__nav_4_label" tabindex="0"> <span class="md-nav__icon md-icon"></span> </label> </div> <nav aria-expanded="true" aria-labelledby="__nav_4_label" class="md-nav" data-md-level="1"> <label class="md-nav__title" for="__nav_4"> <span class="md-nav__icon md-icon"></span> Pipes </label> <ul class="md-nav__list" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href="../"> <span class="md-ellipsis"> Overview </span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../core/"> <span class="md-ellipsis"> Core Pipelines </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../qualifiers/"> <span class="md-ellipsis"> Qualifiers </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--active md-nav__item--nested"> <input checked="" class="md-nav__toggle md-toggle" id="__nav_4_5" type="checkbox"/> <div class="md-nav__link md-nav__container"> <a class="md-nav__link" href="./"> <span class="md-ellipsis"> Miscellaneous </span> </a> <label class="md-nav__link" for="__nav_4_5" id="__nav_4_5_label" tabindex="0"> <span class="md-nav__icon md-icon"></span> </label> </div> <nav aria-expanded="true" aria-labelledby="__nav_4_5_label" class="md-nav" data-md-level="2"> <label class="md-nav__title" for="__nav_4_5"> <span class="md-nav__icon md-icon"></span> Miscellaneous </label> <ul class="md-nav__list" data-md-scrollfix=""> <li class="md-nav__item md-nav__item--active"> <input class="md-nav__toggle md-toggle" id="__toc" type="checkbox"/> <label class="md-nav__link md-nav__link--active" for="__toc"> <span class="md-ellipsis"> Overview </span> <span class="md-nav__icon md-icon"></span> </label> <a class="md-nav__link md-nav__link--active" href="./"> <span class="md-ellipsis"> Overview </span> </a> <nav aria-label="Table of contents" class="md-nav md-nav--secondary"> <label class="md-nav__title" for="__toc"> <span class="md-nav__icon md-icon"></span> Table of contents </label> <ul class="md-nav__list" data-md-component="toc" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href="#available-components"> <span class="md-ellipsis"> Available components </span> </a> </li> </ul> </nav> </li> <li class="md-nav__item"> <a class="md-nav__link" href="dates/"> <span class="md-ellipsis"> Dates </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="quantities/"> <span class="md-ellipsis"> Quantities </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="consultation-dates/"> <span class="md-ellipsis"> Consultation dates </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="sections/"> <span class="md-ellipsis"> Sections </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="reason/"> <span class="md-ellipsis"> Reasons </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="tables/"> <span class="md-ellipsis"> Tables </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="split/"> <span class="md-ellipsis"> Split </span> </a> </li> </ul> </nav> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../ner/"> <span class="md-ellipsis"> Named Entity Recognition </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../trainable/"> <span class="md-ellipsis"> Trainable components </span> <span class="md-nav__icon md-icon"></span> </a> </li> </ul> </nav> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../tokenizers/"> <span class="md-ellipsis"> Tokenizers </span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../data/"> <span class="md-ellipsis"> Data Connectors </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../concepts/pipeline/"> <span class="md-ellipsis"> Concepts </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../utilities/"> <span class="md-ellipsis"> Utilities </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../reference/edsnlp/"> <span class="md-ellipsis"> Code Reference </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../contributing/"> <span class="md-ellipsis"> Contributing to EDS-NLP </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../changelog/"> <span class="md-ellipsis"> Changelog </span> </a> </li> </ul> </nav> </div> </div> </div> <div class="md-sidebar md-sidebar--secondary" data-md-component="sidebar" data-md-type="toc"> <div class="md-sidebar__scrollwrap"> <div class="md-sidebar__inner"> <nav aria-label="Table of contents" class="md-nav md-nav--secondary"> <label class="md-nav__title" for="__toc"> <span class="md-nav__icon md-icon"></span> Table of contents </label> <ul class="md-nav__list" data-md-component="toc" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href="#available-components"> <span class="md-ellipsis"> Available components </span> </a> </li> </ul> </nav> </div> </div> </div> <div class="md-content" data-md-component="content"> <article class="md-content__inner md-typeset"> <h1 id="miscellaneous">Miscellaneous</h1> <p>This section regroups components that extract information that can be used by other components, but have little medical value in itself.</p> <p>For instance, the date detection and normalisation pipeline falls in this category.</p> <h2 id="available-components">Available components</h2> <table> <thead> <tr> <th>Component</th> <th>Description</th> </tr> </thead> <tbody> <tr> <td><code><a href="dates/#edsnlp.pipes.misc.dates.factory.create_component">eds.dates</a></code></td> <td>Date extraction and normalisation</td> </tr> <tr> <td><code><a href="consultation-dates/#edsnlp.pipes.misc.consultation_dates.factory.create_component">eds.consultation_dates</a></code></td> <td>Identify consultation dates</td> </tr> <tr> <td><code><a href="quantities/#edsnlp.pipes.misc.quantities.factory.create_component">eds.quantities</a></code></td> <td>Quantity extraction and normalisation</td> </tr> <tr> <td><code><a href="sections/#edsnlp.pipes.misc.sections.factory.create_component">eds.sections</a></code></td> <td>Section detection</td> </tr> <tr> <td><code><a href="reason/#edsnlp.pipes.misc.reason.factory.create_component">eds.reason</a></code></td> <td>Rule-based hospitalisation reason detection</td> </tr> <tr> <td><code><a href="tables/#edsnlp.pipes.misc.tables.factory.create_component">eds.tables</a></code></td> <td>Tables detection</td> </tr> <tr> <td><code><a href="split/#edsnlp.pipes.misc.split.split.Split">eds.split</a></code></td> <td>Doc splitting</td> </tr> </tbody> </table> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+<html class="no-js" lang="en"> <head><meta charset="utf-8"/><meta content="width=device-width,initial-scale=1" name="viewport"/><link href="../qualifiers/history/" rel="prev"/><link href="dates/" rel="next"/><link href="../../assets/logo/edsnlp.svg" rel="icon"/><meta content="mkdocs-1.6.1, mkdocs-material-9.5.42" name="generator"/><title>Overview - EDS-NLP</title><link href="../../assets/stylesheets/main.0253249f.min.css" rel="stylesheet"/><link href="../../assets/stylesheets/palette.06af60db.min.css" rel="stylesheet"/><link crossorigin="" href="https://fonts.gstatic.com" rel="preconnect"/><link href="https://fonts.googleapis.com/css?family=Roboto:300,300i,400,400i,700,700i%7CRoboto+Mono:400,400i,700,700i&amp;display=fallback" rel="stylesheet"/><style>:root{--md-text-font:"Roboto";--md-code-font:"Roboto Mono"}</style><link href="../../assets/_mkdocstrings.css" rel="stylesheet"/><link href="../../assets/stylesheets/extra.css" rel="stylesheet"/><link href="../../assets/stylesheets/cards.css" rel="stylesheet"/><link href="../../assets/termynal/termynal.css" rel="stylesheet"/><script>__md_scope=new URL("../..",location),__md_hash=e=>[...e].reduce(((e,_)=>(e<<5)-e+_.charCodeAt(0)),0),__md_get=(e,_=localStorage,t=__md_scope)=>JSON.parse(_.getItem(t.pathname+"."+e)),__md_set=(e,_,t=localStorage,a=__md_scope)=>{try{t.setItem(a.pathname+"."+e,JSON.stringify(_))}catch(e){}}</script></head> <body data-md-color-accent="indigo" data-md-color-primary="indigo" data-md-color-scheme="default" dir="ltr"> <input autocomplete="off" class="md-toggle" data-md-toggle="drawer" id="__drawer" type="checkbox"/> <input autocomplete="off" class="md-toggle" data-md-toggle="search" id="__search" type="checkbox"/> <label class="md-overlay" for="__drawer"></label> <div data-md-component="skip"> <a class="md-skip" href="#miscellaneous"> Skip to content </a> </div> <div data-md-component="announce"> <aside class="md-banner"> <div class="md-banner__inner md-grid md-typeset"> <button aria-label="Don't show this again" class="md-banner__button md-icon"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M19 6.41 17.59 5 12 10.59 6.41 5 5 6.41 10.59 12 5 17.59 6.41 19 12 13.41 17.59 19 19 17.59 13.41 12z"></path></svg> </button> Check out the new <a href="../../tutorials/training">Model Training tutorial</a> ! </div> <script>var el=document.querySelector("[data-md-component=announce]");if(el){var content=el.querySelector(".md-typeset");__md_hash(content.innerHTML)===__md_get("__announce")&&(el.hidden=!0)}</script> </aside> </div> <div data-md-color-scheme="default" data-md-component="outdated" hidden=""> </div> <header class="md-header md-header--shadow" data-md-component="header"> <nav aria-label="Header" class="md-header__inner md-grid"> <a aria-label="EDS-NLP" class="md-header__button md-logo" data-md-component="logo" href="../.." title="EDS-NLP"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a3 3 0 0 0 3-3 3 3 0 0 0-3-3 3 3 0 0 0-3 3 3 3 0 0 0 3 3m0 3.54C9.64 9.35 6.5 8 3 8v11c3.5 0 6.64 1.35 9 3.54 2.36-2.19 5.5-3.54 9-3.54V8c-3.5 0-6.64 1.35-9 3.54"></path></svg> </a> <label class="md-header__button md-icon" for="__drawer"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M3 6h18v2H3zm0 5h18v2H3zm0 5h18v2H3z"></path></svg> </label> <div class="md-header__title" data-md-component="header-title"> <div class="md-header__ellipsis"> <div class="md-header__topic"> <span class="md-ellipsis"> EDS-NLP </span> </div> <div class="md-header__topic" data-md-component="header-topic"> <span class="md-ellipsis"> Overview </span> </div> </div> </div> <form class="md-header__option" data-md-component="palette"> <input aria-label="Switch to dark mode" class="md-option" data-md-color-accent="indigo" data-md-color-media="" data-md-color-primary="indigo" data-md-color-scheme="default" id="__palette_0" name="__palette" type="radio"/> <label class="md-header__button md-icon" for="__palette_1" hidden="" title="Switch to dark mode"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 18c-.89 0-1.74-.2-2.5-.55C11.56 16.5 13 14.42 13 12s-1.44-4.5-3.5-5.45C10.26 6.2 11.11 6 12 6a6 6 0 0 1 6 6 6 6 0 0 1-6 6m8-9.31V4h-4.69L12 .69 8.69 4H4v4.69L.69 12 4 15.31V20h4.69L12 23.31 15.31 20H20v-4.69L23.31 12z"></path></svg> </label> <input aria-label="Switch to light mode" class="md-option" data-md-color-accent="indigo" data-md-color-media="" data-md-color-primary="indigo" data-md-color-scheme="slate" id="__palette_1" name="__palette" type="radio"/> <label class="md-header__button md-icon" for="__palette_0" hidden="" title="Switch to light mode"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a4 4 0 0 0-4 4 4 4 0 0 0 4 4 4 4 0 0 0 4-4 4 4 0 0 0-4-4m0 10a6 6 0 0 1-6-6 6 6 0 0 1 6-6 6 6 0 0 1 6 6 6 6 0 0 1-6 6m8-9.31V4h-4.69L12 .69 8.69 4H4v4.69L.69 12 4 15.31V20h4.69L12 23.31 15.31 20H20v-4.69L23.31 12z"></path></svg> </label> </form> <script>var palette=__md_get("__palette");if(palette&&palette.color){if("(prefers-color-scheme)"===palette.color.media){var media=matchMedia("(prefers-color-scheme: light)"),input=document.querySelector(media.matches?"[data-md-color-media='(prefers-color-scheme: light)']":"[data-md-color-media='(prefers-color-scheme: dark)']");palette.color.media=input.getAttribute("data-md-color-media"),palette.color.scheme=input.getAttribute("data-md-color-scheme"),palette.color.primary=input.getAttribute("data-md-color-primary"),palette.color.accent=input.getAttribute("data-md-color-accent")}for(var[key,value]of Object.entries(palette.color))document.body.setAttribute("data-md-color-"+key,value)}</script> <label class="md-header__button md-icon" for="__search"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M9.5 3A6.5 6.5 0 0 1 16 9.5c0 1.61-.59 3.09-1.56 4.23l.27.27h.79l5 5-1.5 1.5-5-5v-.79l-.27-.27A6.52 6.52 0 0 1 9.5 16 6.5 6.5 0 0 1 3 9.5 6.5 6.5 0 0 1 9.5 3m0 2C7 5 5 7 5 9.5S7 14 9.5 14 14 12 14 9.5 12 5 9.5 5"></path></svg> </label> <div class="md-search" data-md-component="search" role="dialog"> <label class="md-search__overlay" for="__search"></label> <div class="md-search__inner" role="search"> <form class="md-search__form" name="search"> <input aria-label="Search" autocapitalize="off" autocomplete="off" autocorrect="off" class="md-search__input" data-md-component="search-query" name="query" placeholder="Search" required="" spellcheck="false" type="text"/> <label class="md-search__icon md-icon" for="__search"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M9.5 3A6.5 6.5 0 0 1 16 9.5c0 1.61-.59 3.09-1.56 4.23l.27.27h.79l5 5-1.5 1.5-5-5v-.79l-.27-.27A6.52 6.52 0 0 1 9.5 16 6.5 6.5 0 0 1 3 9.5 6.5 6.5 0 0 1 9.5 3m0 2C7 5 5 7 5 9.5S7 14 9.5 14 14 12 14 9.5 12 5 9.5 5"></path></svg> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M20 11v2H8l5.5 5.5-1.42 1.42L4.16 12l7.92-7.92L13.5 5.5 8 11z"></path></svg> </label> <nav aria-label="Search" class="md-search__options"> <button aria-label="Clear" class="md-search__icon md-icon" tabindex="-1" title="Clear" type="reset"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M19 6.41 17.59 5 12 10.59 6.41 5 5 6.41 10.59 12 5 17.59 6.41 19 12 13.41 17.59 19 19 17.59 13.41 12z"></path></svg> </button> </nav> </form> <div class="md-search__output"> <div class="md-search__scrollwrap" data-md-scrollfix="" tabindex="0"> <div class="md-search-result" data-md-component="search-result"> <div class="md-search-result__meta"> Initializing search </div> <ol class="md-search-result__list" role="presentation"></ol> </div> </div> </div> </div> </div> <div class="md-header__source"> <a class="md-source" data-md-component="source" href="https://github.com/aphp/edsnlp" title="Go to repository"> <div class="md-source__icon md-icon"> <svg viewbox="0 0 448 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M439.55 236.05 244 40.45a28.87 28.87 0 0 0-40.81 0l-40.66 40.63 51.52 51.52c27.06-9.14 52.68 16.77 43.39 43.68l49.66 49.66c34.23-11.8 61.18 31 35.47 56.69-26.49 26.49-70.21-2.87-56-37.34L240.22 199v121.85c25.3 12.54 22.26 41.85 9.08 55a34.34 34.34 0 0 1-48.55 0c-17.57-17.6-11.07-46.91 11.25-56v-123c-20.8-8.51-24.6-30.74-18.64-45L142.57 101 8.45 235.14a28.86 28.86 0 0 0 0 40.81l195.61 195.6a28.86 28.86 0 0 0 40.8 0l194.69-194.69a28.86 28.86 0 0 0 0-40.81"></path></svg> </div> <div class="md-source__repository"> aphp/edsnlp </div> </a> </div> </nav> </header> <div class="md-container" data-md-component="container"> <main class="md-main" data-md-component="main"> <div class="md-main__inner md-grid"> <div class="md-sidebar md-sidebar--primary" data-md-component="sidebar" data-md-type="navigation"> <div class="md-sidebar__scrollwrap"> <div class="md-sidebar__inner"> <nav aria-label="Navigation" class="md-nav md-nav--primary" data-md-level="0"> <label class="md-nav__title" for="__drawer"> <a aria-label="EDS-NLP" class="md-nav__button md-logo" data-md-component="logo" href="../.." title="EDS-NLP"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a3 3 0 0 0 3-3 3 3 0 0 0-3-3 3 3 0 0 0-3 3 3 3 0 0 0 3 3m0 3.54C9.64 9.35 6.5 8 3 8v11c3.5 0 6.64 1.35 9 3.54 2.36-2.19 5.5-3.54 9-3.54V8c-3.5 0-6.64 1.35-9 3.54"></path></svg> </a> EDS-NLP </label> <div class="md-nav__source"> <a class="md-source" data-md-component="source" href="https://github.com/aphp/edsnlp" title="Go to repository"> <div class="md-source__icon md-icon"> <svg viewbox="0 0 448 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M439.55 236.05 244 40.45a28.87 28.87 0 0 0-40.81 0l-40.66 40.63 51.52 51.52c27.06-9.14 52.68 16.77 43.39 43.68l49.66 49.66c34.23-11.8 61.18 31 35.47 56.69-26.49 26.49-70.21-2.87-56-37.34L240.22 199v121.85c25.3 12.54 22.26 41.85 9.08 55a34.34 34.34 0 0 1-48.55 0c-17.57-17.6-11.07-46.91 11.25-56v-123c-20.8-8.51-24.6-30.74-18.64-45L142.57 101 8.45 235.14a28.86 28.86 0 0 0 0 40.81l195.61 195.6a28.86 28.86 0 0 0 40.8 0l194.69-194.69a28.86 28.86 0 0 0 0-40.81"></path></svg> </div> <div class="md-source__repository"> aphp/edsnlp </div> </a> </div> <ul class="md-nav__list" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href="../.."> <span class="md-ellipsis"> Getting started </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="https://aphp.github.io/edsnlp/demo" target="_blank"> <span class="md-ellipsis"> Demo </span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../tutorials/"> <span class="md-ellipsis"> Tutorials </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--active md-nav__item--nested"> <input checked="" class="md-nav__toggle md-toggle" id="__nav_4" type="checkbox"/> <div class="md-nav__link md-nav__container"> <a class="md-nav__link" href="../"> <span class="md-ellipsis"> Pipes </span> </a> <label class="md-nav__link" for="__nav_4" id="__nav_4_label" tabindex="0"> <span class="md-nav__icon md-icon"></span> </label> </div> <nav aria-expanded="true" aria-labelledby="__nav_4_label" class="md-nav" data-md-level="1"> <label class="md-nav__title" for="__nav_4"> <span class="md-nav__icon md-icon"></span> Pipes </label> <ul class="md-nav__list" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href="../"> <span class="md-ellipsis"> Overview </span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../core/"> <span class="md-ellipsis"> Core Pipelines </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../qualifiers/"> <span class="md-ellipsis"> Qualifiers </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--active md-nav__item--nested"> <input checked="" class="md-nav__toggle md-toggle" id="__nav_4_5" type="checkbox"/> <div class="md-nav__link md-nav__container"> <a class="md-nav__link" href="./"> <span class="md-ellipsis"> Miscellaneous </span> </a> <label class="md-nav__link" for="__nav_4_5" id="__nav_4_5_label" tabindex="0"> <span class="md-nav__icon md-icon"></span> </label> </div> <nav aria-expanded="true" aria-labelledby="__nav_4_5_label" class="md-nav" data-md-level="2"> <label class="md-nav__title" for="__nav_4_5"> <span class="md-nav__icon md-icon"></span> Miscellaneous </label> <ul class="md-nav__list" data-md-scrollfix=""> <li class="md-nav__item md-nav__item--active"> <input class="md-nav__toggle md-toggle" id="__toc" type="checkbox"/> <label class="md-nav__link md-nav__link--active" for="__toc"> <span class="md-ellipsis"> Overview </span> <span class="md-nav__icon md-icon"></span> </label> <a class="md-nav__link md-nav__link--active" href="./"> <span class="md-ellipsis"> Overview </span> </a> <nav aria-label="Table of contents" class="md-nav md-nav--secondary"> <label class="md-nav__title" for="__toc"> <span class="md-nav__icon md-icon"></span> Table of contents </label> <ul class="md-nav__list" data-md-component="toc" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href="#available-components"> <span class="md-ellipsis"> Available components </span> </a> </li> </ul> </nav> </li> <li class="md-nav__item"> <a class="md-nav__link" href="dates/"> <span class="md-ellipsis"> Dates </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="quantities/"> <span class="md-ellipsis"> Quantities </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="consultation-dates/"> <span class="md-ellipsis"> Consultation dates </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="sections/"> <span class="md-ellipsis"> Sections </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="reason/"> <span class="md-ellipsis"> Reasons </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="tables/"> <span class="md-ellipsis"> Tables </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="split/"> <span class="md-ellipsis"> Split </span> </a> </li> </ul> </nav> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../ner/"> <span class="md-ellipsis"> Named Entity Recognition </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../trainable/"> <span class="md-ellipsis"> Trainable components </span> <span class="md-nav__icon md-icon"></span> </a> </li> </ul> </nav> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../tokenizers/"> <span class="md-ellipsis"> Tokenizers </span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../data/"> <span class="md-ellipsis"> Data Connectors </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../concepts/pipeline/"> <span class="md-ellipsis"> Concepts </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../utilities/"> <span class="md-ellipsis"> Utilities </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../reference/edsnlp/"> <span class="md-ellipsis"> Code Reference </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../contributing/"> <span class="md-ellipsis"> Contributing to EDS-NLP </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../changelog/"> <span class="md-ellipsis"> Changelog </span> </a> </li> </ul> </nav> </div> </div> </div> <div class="md-sidebar md-sidebar--secondary" data-md-component="sidebar" data-md-type="toc"> <div class="md-sidebar__scrollwrap"> <div class="md-sidebar__inner"> <nav aria-label="Table of contents" class="md-nav md-nav--secondary"> <label class="md-nav__title" for="__toc"> <span class="md-nav__icon md-icon"></span> Table of contents </label> <ul class="md-nav__list" data-md-component="toc" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href="#available-components"> <span class="md-ellipsis"> Available components </span> </a> </li> </ul> </nav> </div> </div> </div> <div class="md-content" data-md-component="content"> <article class="md-content__inner md-typeset"> <h1 id="miscellaneous">Miscellaneous</h1> <p>This section regroups components that extract information that can be used by other components, but have little medical value in itself.</p> <p>For instance, the date detection and normalisation pipeline falls in this category.</p> <h2 id="available-components">Available components</h2> <table> <thead> <tr> <th>Component</th> <th>Description</th> </tr> </thead> <tbody> <tr> <td><code><a href="dates/#edsnlp.pipes.misc.dates.factory.create_component">eds.dates</a></code></td> <td>Date extraction and normalisation</td> </tr> <tr> <td><code><a href="consultation-dates/#edsnlp.pipes.misc.consultation_dates.factory.create_component">eds.consultation_dates</a></code></td> <td>Identify consultation dates</td> </tr> <tr> <td><code><a href="quantities/#edsnlp.pipes.misc.quantities.factory.create_component">eds.quantities</a></code></td> <td>Quantity extraction and normalisation</td> </tr> <tr> <td><code><a href="sections/#edsnlp.pipes.misc.sections.factory.create_component">eds.sections</a></code></td> <td>Section detection</td> </tr> <tr> <td><code><a href="reason/#edsnlp.pipes.misc.reason.factory.create_component">eds.reason</a></code></td> <td>Rule-based hospitalisation reason detection</td> </tr> <tr> <td><code><a href="tables/#edsnlp.pipes.misc.tables.factory.create_component">eds.tables</a></code></td> <td>Tables detection</td> </tr> <tr> <td><code><a href="split/#edsnlp.pipes.misc.split.split.Split">eds.split</a></code></td> <td>Doc splitting</td> </tr> </tbody> </table> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/pipes/misc/quantities/index.html b/master/pipes/misc/quantities/index.html
index 2f062b799..a5c8c3812 100644
--- a/master/pipes/misc/quantities/index.html
+++ b/master/pipes/misc/quantities/index.html
@@ -164,7 +164,7 @@
     <span class="o">...</span>
   <span class="p">}</span>
 <span class="p">}</span>
-</code></pre></div> Set <code>quantities="all"</code> to extract all raw quantities from units_config file.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[str, <span title="typing.List">List</span>[<span title="typing.Union">Union</span>[str, <span title="edsnlp.pipes.misc.quantities.quantities.MsrConfig">MsrConfig</span>]], <span title="typing.Dict">Dict</span>[str, <span title="edsnlp.pipes.misc.quantities.quantities.MsrConfig">MsrConfig</span>]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>['weight', 'size', 'bmi', 'volume']</code> </span> </p> </td> </tr> <tr> <td><code>number_terms</code></td> <td class="doc-param-details"> <p>A mapping of numbers to their lexical variants</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Dict">Dict</span>[str, <span title="typing.List">List</span>[str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'0.125': ['⅛'], '0.16666666': ['⅙'], '0.2': ['...</code> </span> </p> </td> </tr> <tr> <td><code>stopwords</code></td> <td class="doc-param-details"> <p>A list of stopwords that do not matter when placed between a unitless trigger and a number</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.List">List</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>['par', 'sur', 'de', 'a', ',', 'et', '-', 'à']</code> </span> </p> </td> </tr> <tr> <td><code>unit_divisors</code></td> <td class="doc-param-details"> <p>A list of terms used to divide two units (like: m / s)</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.List">List</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>['/', 'par']</code> </span> </p> </td> </tr> <tr> <td><code>attr</code></td> <td class="doc-param-details"> <p>Whether to match on the text ('TEXT') or on the normalized text ('NORM')</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>NORM</code> </span> </p> </td> </tr> <tr> <td><code>ignore_excluded</code></td> <td class="doc-param-details"> <p>Whether to exclude pollution patterns when matching in the text</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>True</code> </span> </p> </td> </tr> <tr> <td><code>compose_units</code></td> <td class="doc-param-details"> <p>Whether to compose units (like "m/s" or "m.s-1")</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>True</code> </span> </p> </td> </tr> <tr> <td><code>extract_ranges</code></td> <td class="doc-param-details"> <p>Whether to extract ranges (like "entre 1 et 2 cm")</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>range_patterns</code></td> <td class="doc-param-details"> <p>A list of "{FROM} xx {TO} yy" patterns to match range quantities</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.List">List</span>[<span title="typing.Tuple">Tuple</span>[<span title="typing.Optional">Optional</span>[str], <span title="typing.Optional">Optional</span>[str]]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>[('De', 'à'), ('De', 'a'), ('de', 'à'), ('de', ...</code> </span> </p> </td> </tr> <tr> <td><code>after_snippet_limit</code></td> <td class="doc-param-details"> <p>Maximum word distance after to link a part of a quantity after its number</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>int</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>6</code> </span> </p> </td> </tr> <tr> <td><code>before_snippet_limit</code></td> <td class="doc-param-details"> <p>Maximum word distance after to link a part of a quantity before its number</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>int</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>10</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set the spans in the document. By default, each quantity will be assigned to its own span group (using either the "name" field of the config, or the key if you passed a dict), and to the "quantities" group.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<a class="autorefs autorefs-internal" href="../../ner/#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a>]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>span_getter</code></td> <td class="doc-param-details"> <p>Where to look for quantities in the doc. By default, look in the whole doc. You can combine this with the <code>merge_mode</code> argument for interesting results.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../qualifiers/#edsnlp.pipes.base.SpanGetterArg" title="edsnlp.pipes.base.SpanGetterArg">SpanGetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>merge_mode</code></td> <td class="doc-param-details"> <p>How to merge matches with the spans from <code>span_getter</code>, if given:</p> <ul> <li><code>intersect</code>: return only the matches that fall in the <code>span_getter</code> spans</li> <li><code>align</code>: if a match overlaps a span from <code>span_getter</code> (e.g. a match extracted by a machine learning model), return the <code>span_getter</code> span instead, and assign all the parsed information (<code>._.date</code> / <code>._.duration</code>) to it. Otherwise, don't return the date.</li> </ul> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing_extensions.Literal">Literal</span>['intersect', 'align']</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>intersect</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.misc.quantities.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.misc.quantities.factory.create_component">eds.quantities</a></code> pipeline was developed by AP-HP's Data Science team.</p> </div> </div> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> Set <code>quantities="all"</code> to extract all raw quantities from units_config file.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[str, <span title="typing.List">List</span>[<span title="typing.Union">Union</span>[str, <span title="edsnlp.pipes.misc.quantities.quantities.MsrConfig">MsrConfig</span>]], <span title="typing.Dict">Dict</span>[str, <span title="edsnlp.pipes.misc.quantities.quantities.MsrConfig">MsrConfig</span>]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>['weight', 'size', 'bmi', 'volume']</code> </span> </p> </td> </tr> <tr> <td><code>number_terms</code></td> <td class="doc-param-details"> <p>A mapping of numbers to their lexical variants</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Dict">Dict</span>[str, <span title="typing.List">List</span>[str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'0.125': ['⅛'], '0.16666666': ['⅙'], '0.2': ['...</code> </span> </p> </td> </tr> <tr> <td><code>stopwords</code></td> <td class="doc-param-details"> <p>A list of stopwords that do not matter when placed between a unitless trigger and a number</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.List">List</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>['par', 'sur', 'de', 'a', ',', 'et', '-', 'à']</code> </span> </p> </td> </tr> <tr> <td><code>unit_divisors</code></td> <td class="doc-param-details"> <p>A list of terms used to divide two units (like: m / s)</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.List">List</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>['/', 'par']</code> </span> </p> </td> </tr> <tr> <td><code>attr</code></td> <td class="doc-param-details"> <p>Whether to match on the text ('TEXT') or on the normalized text ('NORM')</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>NORM</code> </span> </p> </td> </tr> <tr> <td><code>ignore_excluded</code></td> <td class="doc-param-details"> <p>Whether to exclude pollution patterns when matching in the text</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>True</code> </span> </p> </td> </tr> <tr> <td><code>compose_units</code></td> <td class="doc-param-details"> <p>Whether to compose units (like "m/s" or "m.s-1")</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>True</code> </span> </p> </td> </tr> <tr> <td><code>extract_ranges</code></td> <td class="doc-param-details"> <p>Whether to extract ranges (like "entre 1 et 2 cm")</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>range_patterns</code></td> <td class="doc-param-details"> <p>A list of "{FROM} xx {TO} yy" patterns to match range quantities</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.List">List</span>[<span title="typing.Tuple">Tuple</span>[<span title="typing.Optional">Optional</span>[str], <span title="typing.Optional">Optional</span>[str]]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>[('De', 'à'), ('De', 'a'), ('de', 'à'), ('de', ...</code> </span> </p> </td> </tr> <tr> <td><code>after_snippet_limit</code></td> <td class="doc-param-details"> <p>Maximum word distance after to link a part of a quantity after its number</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>int</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>6</code> </span> </p> </td> </tr> <tr> <td><code>before_snippet_limit</code></td> <td class="doc-param-details"> <p>Maximum word distance after to link a part of a quantity before its number</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>int</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>10</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set the spans in the document. By default, each quantity will be assigned to its own span group (using either the "name" field of the config, or the key if you passed a dict), and to the "quantities" group.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<a class="autorefs autorefs-internal" href="../../ner/#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a>]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>span_getter</code></td> <td class="doc-param-details"> <p>Where to look for quantities in the doc. By default, look in the whole doc. You can combine this with the <code>merge_mode</code> argument for interesting results.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../qualifiers/#edsnlp.pipes.base.SpanGetterArg" title="edsnlp.pipes.base.SpanGetterArg">SpanGetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>merge_mode</code></td> <td class="doc-param-details"> <p>How to merge matches with the spans from <code>span_getter</code>, if given:</p> <ul> <li><code>intersect</code>: return only the matches that fall in the <code>span_getter</code> spans</li> <li><code>align</code>: if a match overlaps a span from <code>span_getter</code> (e.g. a match extracted by a machine learning model), return the <code>span_getter</code> span instead, and assign all the parsed information (<code>._.date</code> / <code>._.duration</code>) to it. Otherwise, don't return the date.</li> </ul> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing_extensions.Literal">Literal</span>['intersect', 'align']</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>intersect</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.misc.quantities.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.misc.quantities.factory.create_component">eds.quantities</a></code> pipeline was developed by AP-HP's Data Science team.</p> </div> </div> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/pipes/misc/reason/index.html b/master/pipes/misc/reason/index.html
index 5505ef569..035b9d15f 100644
--- a/master/pipes/misc/reason/index.html
+++ b/master/pipes/misc/reason/index.html
@@ -45,7 +45,7 @@
 <span class="n">ent</span> <span class="o">=</span> <span class="n">entities</span><span class="p">[</span><span class="mi">0</span><span class="p">]</span>
 <span class="n">ent</span><span class="o">.</span><span class="n">_</span><span class="o">.</span><span class="n">is_reason</span>
 <span class="c1"># Out: True</span>
-</code></pre></div> <h2 id="edsnlp.pipes.misc.reason.factory.create_component--extensions">Extensions</h2> <p>The <code><a href="#edsnlp.pipes.misc.reason.factory.create_component">eds.reason</a></code> pipeline adds the key <code>reasons</code> to <code>doc.spans</code> and declares one extension, on the <code>Span</code> objects called <code>ents_reason</code>.</p> <p>The <code>ents_reason</code> extension is a list of named entities that overlap the <code>Span</code>, typically entities found in upstream components like <code>matcher</code>.</p> <p>It also declares the boolean extension <code>is_reason</code>. This extension is set to True for the Reason Spans but also for the entities that overlap the reason span.</p> <h2 id="edsnlp.pipes.misc.reason.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline object</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span></code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>Name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> </p> </td> </tr> <tr> <td><code>reasons</code></td> <td class="doc-param-details"> <p>Reason patterns</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Dict">Dict</span>[str, <span title="typing.Union">Union</span>[<span title="typing.List">List</span>[str], str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>attr</code></td> <td class="doc-param-details"> <p>Default token attribute to use to build the text to match on.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>TEXT</code> </span> </p> </td> </tr> <tr> <td><code>use_sections</code></td> <td class="doc-param-details"> <p>Whether or not use the <code>sections</code> matcher to improve results.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>(bool)</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>ignore_excluded</code></td> <td class="doc-param-details"> <p>Whether to skip excluded tokens.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.misc.reason.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.misc.reason.factory.create_component">eds.reason</a></code> matcher was developed by AP-HP's Data Science team.</p> </div> </div> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> <h2 id="edsnlp.pipes.misc.reason.factory.create_component--extensions">Extensions</h2> <p>The <code><a href="#edsnlp.pipes.misc.reason.factory.create_component">eds.reason</a></code> pipeline adds the key <code>reasons</code> to <code>doc.spans</code> and declares one extension, on the <code>Span</code> objects called <code>ents_reason</code>.</p> <p>The <code>ents_reason</code> extension is a list of named entities that overlap the <code>Span</code>, typically entities found in upstream components like <code>matcher</code>.</p> <p>It also declares the boolean extension <code>is_reason</code>. This extension is set to True for the Reason Spans but also for the entities that overlap the reason span.</p> <h2 id="edsnlp.pipes.misc.reason.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline object</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span></code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>Name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> </p> </td> </tr> <tr> <td><code>reasons</code></td> <td class="doc-param-details"> <p>Reason patterns</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Dict">Dict</span>[str, <span title="typing.Union">Union</span>[<span title="typing.List">List</span>[str], str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>attr</code></td> <td class="doc-param-details"> <p>Default token attribute to use to build the text to match on.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>TEXT</code> </span> </p> </td> </tr> <tr> <td><code>use_sections</code></td> <td class="doc-param-details"> <p>Whether or not use the <code>sections</code> matcher to improve results.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>(bool)</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>ignore_excluded</code></td> <td class="doc-param-details"> <p>Whether to skip excluded tokens.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.misc.reason.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.misc.reason.factory.create_component">eds.reason</a></code> matcher was developed by AP-HP's Data Science team.</p> </div> </div> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/pipes/misc/sections/index.html b/master/pipes/misc/sections/index.html
index 8b5f57be7..8133ad3c2 100644
--- a/master/pipes/misc/sections/index.html
+++ b/master/pipes/misc/sections/index.html
@@ -15,7 +15,7 @@
 
 <span class="n">doc</span><span class="o">.</span><span class="n">spans</span><span class="p">[</span><span class="s2">"section_titles"</span><span class="p">]</span>
 <span class="c1"># Out: [Motif]</span>
-</code></pre></div> <h2 id="edsnlp.pipes.misc.sections.factory.create_component--extensions">Extensions</h2> <p>The <code><a href="#edsnlp.pipes.misc.sections.factory.create_component">eds.sections</a></code> matcher adds two fields to the <code>doc.spans</code> attribute :</p> <ol> <li>The <code>section_titles</code> key contains the list of all section titles extracted using the list declared in the <code>terms.py</code> module.</li> <li>The <code>sections</code> key contains a list of sections, ie spans of text between two section titles (or the last title and the end of the document).</li> </ol> <p>If the document has entities before calling this matcher an attribute <code>section</code> is added to each entity.</p> <h2 id="edsnlp.pipes.misc.sections.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline object.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span></code> </span> </p> </td> </tr> <tr> <td><code>sections</code></td> <td class="doc-param-details"> <p>Dictionary of terms to look for.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Dict">Dict</span>[str, <span title="typing.List">List</span>[str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'allergies': ['allergies'], 'antécédents': ['a...</code> </span> </p> </td> </tr> <tr> <td><code>attr</code></td> <td class="doc-param-details"> <p>Default attribute to match on.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>NORM</code> </span> </p> </td> </tr> <tr> <td><code>add_patterns</code></td> <td class="doc-param-details"> <p>Whether add update patterns to match start / end of lines</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>True</code> </span> </p> </td> </tr> <tr> <td><code>ignore_excluded</code></td> <td class="doc-param-details"> <p>Whether to skip excluded tokens.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>True</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.misc.sections.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.misc.sections.factory.create_component">eds.sections</a></code> matcher was developed by AP-HP's Data Science team.</p> </div> </div> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> <h2 id="edsnlp.pipes.misc.sections.factory.create_component--extensions">Extensions</h2> <p>The <code><a href="#edsnlp.pipes.misc.sections.factory.create_component">eds.sections</a></code> matcher adds two fields to the <code>doc.spans</code> attribute :</p> <ol> <li>The <code>section_titles</code> key contains the list of all section titles extracted using the list declared in the <code>terms.py</code> module.</li> <li>The <code>sections</code> key contains a list of sections, ie spans of text between two section titles (or the last title and the end of the document).</li> </ol> <p>If the document has entities before calling this matcher an attribute <code>section</code> is added to each entity.</p> <h2 id="edsnlp.pipes.misc.sections.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline object.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span></code> </span> </p> </td> </tr> <tr> <td><code>sections</code></td> <td class="doc-param-details"> <p>Dictionary of terms to look for.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Dict">Dict</span>[str, <span title="typing.List">List</span>[str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'allergies': ['allergies'], 'antécédents': ['a...</code> </span> </p> </td> </tr> <tr> <td><code>attr</code></td> <td class="doc-param-details"> <p>Default attribute to match on.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>NORM</code> </span> </p> </td> </tr> <tr> <td><code>add_patterns</code></td> <td class="doc-param-details"> <p>Whether add update patterns to match start / end of lines</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>True</code> </span> </p> </td> </tr> <tr> <td><code>ignore_excluded</code></td> <td class="doc-param-details"> <p>Whether to skip excluded tokens.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>True</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.misc.sections.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.misc.sections.factory.create_component">eds.sections</a></code> matcher was developed by AP-HP's Data Science team.</p> </div> </div> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/pipes/misc/split/index.html b/master/pipes/misc/split/index.html
index b2ab47751..63dd8b458 100644
--- a/master/pipes/misc/split/index.html
+++ b/master/pipes/misc/split/index.html
@@ -15,7 +15,7 @@
 <span class="nb">print</span><span class="p">(</span><span class="s2">" | "</span><span class="o">.</span><span class="n">join</span><span class="p">(</span><span class="n">doc</span><span class="o">.</span><span class="n">text</span><span class="o">.</span><span class="n">strip</span><span class="p">()</span> <span class="k">for</span> <span class="n">doc</span> <span class="ow">in</span> <span class="n">stream</span><span class="p">))</span>
 <span class="c1"># Out:</span>
 <span class="c1"># Sentence 1 | This is another longer sentence | more than 5 words</span>
-</code></pre></div> <h3 id="edsnlp.pipes.misc.split.split.Split--parameters">Parameters</h3> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>max_length</code></td> <td class="doc-param-details"> <p>The maximum length of the produced documents. If 0, the document will not be split based on length.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>int</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>0</code> </span> </p> </td> </tr> <tr> <td><code>regex</code></td> <td class="doc-param-details"> <p>The regex pattern to split the document on</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'\n{2,}'</code> </span> </p> </td> </tr> <tr> <td><code>filter_expr</code></td> <td class="doc-param-details"> <p>An optional filter expression to filter the produced documents</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>randomize</code></td> <td class="doc-param-details"> <p>The randomization factor to split the documents, to avoid producing documents that are all <code>max_length</code> tokens long (0 means all documents will have the maximum possible length while 1 will produce documents with a length varying between 0 and <code>max_length</code> uniformly)</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>float</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>0.0</code> </span> </p> </td> </tr> </tbody> </table> </div> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> <h3 id="edsnlp.pipes.misc.split.split.Split--parameters">Parameters</h3> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>max_length</code></td> <td class="doc-param-details"> <p>The maximum length of the produced documents. If 0, the document will not be split based on length.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>int</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>0</code> </span> </p> </td> </tr> <tr> <td><code>regex</code></td> <td class="doc-param-details"> <p>The regex pattern to split the document on</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'\n{2,}'</code> </span> </p> </td> </tr> <tr> <td><code>filter_expr</code></td> <td class="doc-param-details"> <p>An optional filter expression to filter the produced documents</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>randomize</code></td> <td class="doc-param-details"> <p>The randomization factor to split the documents, to avoid producing documents that are all <code>max_length</code> tokens long (0 means all documents will have the maximum possible length while 1 will produce documents with a length varying between 0 and <code>max_length</code> uniformly)</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>float</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>0.0</code> </span> </p> </td> </tr> </tbody> </table> </div> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/pipes/misc/tables/index.html b/master/pipes/misc/tables/index.html
index 7ce9dea33..d8b012216 100644
--- a/master/pipes/misc/tables/index.html
+++ b/master/pipes/misc/tables/index.html
@@ -59,7 +59,7 @@
 <span class="p">)</span>
 <span class="nb">type</span><span class="p">(</span><span class="n">df</span><span class="p">)</span>
 <span class="c1"># Out: pandas.core.frame.DataFrame</span>
-</code></pre></div> The pandas DataFrame:</p> <table> <thead> <tr> <th style="text-align: right;"></th> <th style="text-align: left;">0</th> <th style="text-align: left;">1</th> <th style="text-align: left;">2</th> <th style="text-align: left;">3</th> </tr> </thead> <tbody> <tr> <td style="text-align: right;">0</td> <td style="text-align: left;">Leucocytes</td> <td style="text-align: left;">x10*9/L</td> <td style="text-align: left;">4.97</td> <td style="text-align: left;">4.09-11</td> </tr> <tr> <td style="text-align: right;">1</td> <td style="text-align: left;">Hématies</td> <td style="text-align: left;">x10*12/L</td> <td style="text-align: left;">4.68</td> <td style="text-align: left;">4.53-5.79</td> </tr> <tr> <td style="text-align: right;">2</td> <td style="text-align: left;">Hémoglobine</td> <td style="text-align: left;">g/dL</td> <td style="text-align: left;">14.8</td> <td style="text-align: left;">13.4-16.7</td> </tr> <tr> <td style="text-align: right;">3</td> <td style="text-align: left;">Hématocrite</td> <td style="text-align: left;">%</td> <td style="text-align: left;">44.2</td> <td style="text-align: left;">39.2-48.6</td> </tr> <tr> <td style="text-align: right;">4</td> <td style="text-align: left;">VGM</td> <td style="text-align: left;">fL</td> <td style="text-align: left;">94.4 +</td> <td style="text-align: left;">79.6-94</td> </tr> <tr> <td style="text-align: right;">5</td> <td style="text-align: left;">TCMH</td> <td style="text-align: left;">pg</td> <td style="text-align: left;">31.6</td> <td style="text-align: left;">27.3-32.8</td> </tr> <tr> <td style="text-align: right;">6</td> <td style="text-align: left;">CCMH</td> <td style="text-align: left;">g/dL</td> <td style="text-align: left;">33.5</td> <td style="text-align: left;">32.4-36.3</td> </tr> <tr> <td style="text-align: right;">7</td> <td style="text-align: left;">Plaquettes</td> <td style="text-align: left;">x10*9/L</td> <td style="text-align: left;">191</td> <td style="text-align: left;">172-398</td> </tr> <tr> <td style="text-align: right;">8</td> <td style="text-align: left;">VMP</td> <td style="text-align: left;">fL</td> <td style="text-align: left;">11.5 +</td> <td style="text-align: left;">7.4-10.8</td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.misc.tables.factory.create_component--extensions">Extensions</h2> <p>The <code><a href="#edsnlp.pipes.misc.tables.factory.create_component">eds.tables</a></code> pipeline declares the <code>span._.to_pd_table()</code> Span extension. This function returns a parsed pandas version of the table.</p> <h2 id="edsnlp.pipes.misc.tables.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>Pipeline object</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span></code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>Name of the component.</p> <p> </p> </td> </tr> <tr> <td><code>tables_pattern</code></td> <td class="doc-param-details"> <p>The regex pattern to identify tables. The key of dictionary should be <code>tables</code></p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="typing.Dict">Dict</span>[str, str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>sep_pattern</code></td> <td class="doc-param-details"> <p>The regex pattern to identify the separator pattern. Used when calling <code>to_pd_table</code>.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>min_rows</code></td> <td class="doc-param-details"> <p>Only tables with more then <code>min_rows</code> lines will be detected.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[int]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>2</code> </span> </p> </td> </tr> <tr> <td><code>attr</code></td> <td class="doc-param-details"> <p>spaCy's attribute to use: a string with the value "TEXT" or "NORM", or a dict with the key 'term_attr'. We can also add a key for each regex.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>TEXT</code> </span> </p> </td> </tr> <tr> <td><code>ignore_excluded</code></td> <td class="doc-param-details"> <p>Whether to skip excluded tokens.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>True</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.misc.tables.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.misc.tables.factory.create_component">eds.tables</a></code> pipeline was developed by AP-HP's Data Science team.</p> </div> </div> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> The pandas DataFrame:</p> <table> <thead> <tr> <th style="text-align: right;"></th> <th style="text-align: left;">0</th> <th style="text-align: left;">1</th> <th style="text-align: left;">2</th> <th style="text-align: left;">3</th> </tr> </thead> <tbody> <tr> <td style="text-align: right;">0</td> <td style="text-align: left;">Leucocytes</td> <td style="text-align: left;">x10*9/L</td> <td style="text-align: left;">4.97</td> <td style="text-align: left;">4.09-11</td> </tr> <tr> <td style="text-align: right;">1</td> <td style="text-align: left;">Hématies</td> <td style="text-align: left;">x10*12/L</td> <td style="text-align: left;">4.68</td> <td style="text-align: left;">4.53-5.79</td> </tr> <tr> <td style="text-align: right;">2</td> <td style="text-align: left;">Hémoglobine</td> <td style="text-align: left;">g/dL</td> <td style="text-align: left;">14.8</td> <td style="text-align: left;">13.4-16.7</td> </tr> <tr> <td style="text-align: right;">3</td> <td style="text-align: left;">Hématocrite</td> <td style="text-align: left;">%</td> <td style="text-align: left;">44.2</td> <td style="text-align: left;">39.2-48.6</td> </tr> <tr> <td style="text-align: right;">4</td> <td style="text-align: left;">VGM</td> <td style="text-align: left;">fL</td> <td style="text-align: left;">94.4 +</td> <td style="text-align: left;">79.6-94</td> </tr> <tr> <td style="text-align: right;">5</td> <td style="text-align: left;">TCMH</td> <td style="text-align: left;">pg</td> <td style="text-align: left;">31.6</td> <td style="text-align: left;">27.3-32.8</td> </tr> <tr> <td style="text-align: right;">6</td> <td style="text-align: left;">CCMH</td> <td style="text-align: left;">g/dL</td> <td style="text-align: left;">33.5</td> <td style="text-align: left;">32.4-36.3</td> </tr> <tr> <td style="text-align: right;">7</td> <td style="text-align: left;">Plaquettes</td> <td style="text-align: left;">x10*9/L</td> <td style="text-align: left;">191</td> <td style="text-align: left;">172-398</td> </tr> <tr> <td style="text-align: right;">8</td> <td style="text-align: left;">VMP</td> <td style="text-align: left;">fL</td> <td style="text-align: left;">11.5 +</td> <td style="text-align: left;">7.4-10.8</td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.misc.tables.factory.create_component--extensions">Extensions</h2> <p>The <code><a href="#edsnlp.pipes.misc.tables.factory.create_component">eds.tables</a></code> pipeline declares the <code>span._.to_pd_table()</code> Span extension. This function returns a parsed pandas version of the table.</p> <h2 id="edsnlp.pipes.misc.tables.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>Pipeline object</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span></code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>Name of the component.</p> <p> </p> </td> </tr> <tr> <td><code>tables_pattern</code></td> <td class="doc-param-details"> <p>The regex pattern to identify tables. The key of dictionary should be <code>tables</code></p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="typing.Dict">Dict</span>[str, str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>sep_pattern</code></td> <td class="doc-param-details"> <p>The regex pattern to identify the separator pattern. Used when calling <code>to_pd_table</code>.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>min_rows</code></td> <td class="doc-param-details"> <p>Only tables with more then <code>min_rows</code> lines will be detected.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[int]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>2</code> </span> </p> </td> </tr> <tr> <td><code>attr</code></td> <td class="doc-param-details"> <p>spaCy's attribute to use: a string with the value "TEXT" or "NORM", or a dict with the key 'term_attr'. We can also add a key for each regex.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>TEXT</code> </span> </p> </td> </tr> <tr> <td><code>ignore_excluded</code></td> <td class="doc-param-details"> <p>Whether to skip excluded tokens.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>True</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.misc.tables.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.misc.tables.factory.create_component">eds.tables</a></code> pipeline was developed by AP-HP's Data Science team.</p> </div> </div> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/pipes/ner/adicap/index.html b/master/pipes/ner/adicap/index.html
index 7c5f93252..2b892ab9d 100644
--- a/master/pipes/ner/adicap/index.html
+++ b/master/pipes/ner/adicap/index.html
@@ -53,7 +53,7 @@
 <span class="c1"># 'pathology': 'PATHOLOGIE GÉNÉRALE NON TUMORALE',</span>
 <span class="c1"># 'pathology_type': 'ETAT SUBNORMAL - LESION MINEURE',</span>
 <span class="c1"># 'behaviour_type': 'CARACTERES GENERAUX'}</span>
-</code></pre></div> <h2 id="edsnlp.pipes.ner.adicap.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline object</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span>]</code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the pipe</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> </p> </td> </tr> <tr> <td><code>pattern</code></td> <td class="doc-param-details"> <p>The regex pattern to use for matching ADICAP codes</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="typing.Union">Union</span>[<span title="typing.List">List</span>[str], str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>([A-Z]\.?[A-Z]\.?[A-Z]{2}\.?(?:\d{4}|\d{4}|[A-Z...</code> </span> </p> </td> </tr> <tr> <td><code>prefix</code></td> <td class="doc-param-details"> <p>The regex pattern to use for matching the prefix before ADICAP codes</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="typing.Union">Union</span>[<span title="typing.List">List</span>[str], str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>(?i)(codification|adicap)</code> </span> </p> </td> </tr> <tr> <td><code>window</code></td> <td class="doc-param-details"> <p>Number of tokens to look for prefix. It will never go further the start of the sentence</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>int</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>500</code> </span> </p> </td> </tr> <tr> <td><code>attr</code></td> <td class="doc-param-details"> <p>Attribute to match on, eg <code>TEXT</code>, <code>NORM</code>, etc.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>TEXT</code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>Label name to use for the <code>Span</code> object and the extension</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>adicap</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set matches on the doc</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True, 'adicap': True}</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.ner.adicap.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.ner.adicap.factory.create_component">eds.adicap</a></code> pipeline was developed by AP-HP's Data Science team. The codes were downloaded from the website of 'Agence du numérique en santé' ("Thésaurus de la codification ADICAP - Index raisonné des lésions", <span><a class="citation" href="./#ref-terminologie-adicap" id="edsnlp.pipes.ner.adicap.factory.create_component--cite-terminologie-adicap">santé, 2019</a></span>)</p> </div> </div> <div class="footnote"><hr/><ol><li id="ref-terminologie-adicap"><p><p id="ref-terminologie-adicap">santé A., 2019. Thésaurus de la codification ADICAP - Index raisonné des lésions. <a href="http://esante.gouv.fr/terminologie-adicap" target="_blank">http://esante.gouv.fr/terminologie-adicap</a></p></p></li></ol><div></div></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> <h2 id="edsnlp.pipes.ner.adicap.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline object</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span>]</code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the pipe</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> </p> </td> </tr> <tr> <td><code>pattern</code></td> <td class="doc-param-details"> <p>The regex pattern to use for matching ADICAP codes</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="typing.Union">Union</span>[<span title="typing.List">List</span>[str], str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>([A-Z]\.?[A-Z]\.?[A-Z]{2}\.?(?:\d{4}|\d{4}|[A-Z...</code> </span> </p> </td> </tr> <tr> <td><code>prefix</code></td> <td class="doc-param-details"> <p>The regex pattern to use for matching the prefix before ADICAP codes</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="typing.Union">Union</span>[<span title="typing.List">List</span>[str], str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>(?i)(codification|adicap)</code> </span> </p> </td> </tr> <tr> <td><code>window</code></td> <td class="doc-param-details"> <p>Number of tokens to look for prefix. It will never go further the start of the sentence</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>int</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>500</code> </span> </p> </td> </tr> <tr> <td><code>attr</code></td> <td class="doc-param-details"> <p>Attribute to match on, eg <code>TEXT</code>, <code>NORM</code>, etc.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>TEXT</code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>Label name to use for the <code>Span</code> object and the extension</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>adicap</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set matches on the doc</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True, 'adicap': True}</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.ner.adicap.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.ner.adicap.factory.create_component">eds.adicap</a></code> pipeline was developed by AP-HP's Data Science team. The codes were downloaded from the website of 'Agence du numérique en santé' ("Thésaurus de la codification ADICAP - Index raisonné des lésions", <span><a class="citation" href="./#ref-terminologie-adicap" id="edsnlp.pipes.ner.adicap.factory.create_component--cite-terminologie-adicap">santé, 2019</a></span>)</p> </div> </div> <div class="footnote"><hr/><ol><li id="ref-terminologie-adicap"><p><p id="ref-terminologie-adicap">santé A., 2019. Thésaurus de la codification ADICAP - Index raisonné des lésions. <a href="http://esante.gouv.fr/terminologie-adicap" target="_blank">http://esante.gouv.fr/terminologie-adicap</a></p></p></li></ol><div></div></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/pipes/ner/behaviors/alcohol/index.html b/master/pipes/ner/behaviors/alcohol/index.html
index 74fdb87dd..a0f921324 100644
--- a/master/pipes/ner/behaviors/alcohol/index.html
+++ b/master/pipes/ner/behaviors/alcohol/index.html
@@ -142,7 +142,7 @@
 
 <span class="n">span</span><span class="o">.</span><span class="n">_</span><span class="o">.</span><span class="n">assigned</span>
 <span class="c1"># Out: {'stopped': sevrage}</span>
-</code></pre></div> </div> </div> </div> <h2 id="edsnlp.pipes.ner.behaviors.alcohol.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline object</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span>]</code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[str]</code> </span> </p> </td> </tr> <tr> <td><code>patterns</code></td> <td class="doc-param-details"> <p>The patterns to use for matching</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[<span title="typing.Dict">Dict</span>[str, <span title="typing.Any">Any</span>], <span title="typing.List">List</span>[<span title="typing.Dict">Dict</span>[str, <span title="typing.Any">Any</span>]]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'source': 'alcohol', 'regex': ['\\balco[ol]', ...</code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>The label to use for the <code>Span</code> object and the extension</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>alcohol</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set matches on the doc</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>SpanSetterArg</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True, 'alcohol': True}</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.ner.behaviors.alcohol.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.ner.behaviors.alcohol.factory.create_component">eds.alcohol</a></code> component was developed by AP-HP's Data Science team with a team of medical experts, following the insights of the algorithm proposed by <span><a class="citation" href="./#ref-petitjean_2024" id="edsnlp.pipes.ner.behaviors.alcohol.factory.create_component--cite-petitjean_2024">Petit-Jean et al., 2024</a></span>.</p> </div> </div> <div class="footnote"><hr/><ol><li id="ref-petitjean_2024"><p><p id="ref-petitjean_2024">Petit-Jean T., Gérardin C., Berthelot E., Chatellier G., Frank M., Tannier X., Kempf E. and Bey R., 2024. Collaborative and privacy-enhancing workflows on a clinical data warehouse: an example developing natural language processing pipelines to detect medical conditions. <i>Journal of the American Medical Informatics Association</i>. <i>31</i>, pp.1280-1290. <a href="https://dx.doi.org/10.1093/jamia/ocae069" target="_blank">10.1093/jamia/ocae069</a></p></p></li></ol><div></div></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> </div> </div> </div> <h2 id="edsnlp.pipes.ner.behaviors.alcohol.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline object</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span>]</code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[str]</code> </span> </p> </td> </tr> <tr> <td><code>patterns</code></td> <td class="doc-param-details"> <p>The patterns to use for matching</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[<span title="typing.Dict">Dict</span>[str, <span title="typing.Any">Any</span>], <span title="typing.List">List</span>[<span title="typing.Dict">Dict</span>[str, <span title="typing.Any">Any</span>]]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'source': 'alcohol', 'regex': ['\\balco[ol]', ...</code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>The label to use for the <code>Span</code> object and the extension</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>alcohol</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set matches on the doc</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>SpanSetterArg</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True, 'alcohol': True}</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.ner.behaviors.alcohol.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.ner.behaviors.alcohol.factory.create_component">eds.alcohol</a></code> component was developed by AP-HP's Data Science team with a team of medical experts, following the insights of the algorithm proposed by <span><a class="citation" href="./#ref-petitjean_2024" id="edsnlp.pipes.ner.behaviors.alcohol.factory.create_component--cite-petitjean_2024">Petit-Jean et al., 2024</a></span>.</p> </div> </div> <div class="footnote"><hr/><ol><li id="ref-petitjean_2024"><p><p id="ref-petitjean_2024">Petit-Jean T., Gérardin C., Berthelot E., Chatellier G., Frank M., Tannier X., Kempf E. and Bey R., 2024. Collaborative and privacy-enhancing workflows on a clinical data warehouse: an example developing natural language processing pipelines to detect medical conditions. <i>Journal of the American Medical Informatics Association</i>. <i>31</i>, pp.1280-1290. <a href="https://dx.doi.org/10.1093/jamia/ocae069" target="_blank">10.1093/jamia/ocae069</a></p></p></li></ol><div></div></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/pipes/ner/behaviors/index.html b/master/pipes/ner/behaviors/index.html
index 77cff2e77..7acac4e09 100644
--- a/master/pipes/ner/behaviors/index.html
+++ b/master/pipes/ner/behaviors/index.html
@@ -70,7 +70,7 @@
 <span class="n">diabetes</span> <span class="o">=</span> <span class="n">doc</span><span class="o">.</span><span class="n">spans</span><span class="p">[</span><span class="s2">"diabetes"</span><span class="p">]</span>
 <span class="p">(</span><span class="n">diabetes</span><span class="p">[</span><span class="mi">0</span><span class="p">]</span><span class="o">.</span><span class="n">_</span><span class="o">.</span><span class="n">detailed_status</span><span class="p">,</span> <span class="n">diabetes</span><span class="p">[</span><span class="mi">1</span><span class="p">]</span><span class="o">.</span><span class="n">_</span><span class="o">.</span><span class="n">detailed_status</span><span class="p">)</span>
 <span class="c1"># Out: ('WITH_COMPLICATION', 'WITHOUT_COMPLICATION') # (2)</span>
-</code></pre></div> <ol> <li>Here we see an example of additional information that can be extracted</li> <li>Here we see the importance of document-level aggregation to extract the correct severity of each comorbidity.</li> </ol> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> <ol> <li>Here we see an example of additional information that can be extracted</li> <li>Here we see the importance of document-level aggregation to extract the correct severity of each comorbidity.</li> </ol> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/pipes/ner/behaviors/tobacco/index.html b/master/pipes/ner/behaviors/tobacco/index.html
index 57e0927bf..79d99697b 100644
--- a/master/pipes/ner/behaviors/tobacco/index.html
+++ b/master/pipes/ner/behaviors/tobacco/index.html
@@ -162,7 +162,7 @@
 
 <span class="n">span</span><span class="o">.</span><span class="n">_</span><span class="o">.</span><span class="n">assigned</span>
 <span class="c1"># Out: {'stopped': sevré}</span>
-</code></pre></div> </div> </div> </div> <h2 id="edsnlp.pipes.ner.behaviors.tobacco.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline object</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span>]</code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[str]</code> </span> </p> </td> </tr> <tr> <td><code>patterns</code></td> <td class="doc-param-details"> <p>The patterns to use for matching</p> <p> <span class="doc-param-default"> <b>DEFAULT:</b> <code>[{'source': 'tobacco', 'regex': ['tabagi', 'tab...</code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>The label to use for the <code>Span</code> object and the extension</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>tobacco</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set matches on the doc</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True, 'tobacco': True}</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.ner.behaviors.tobacco.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.ner.behaviors.tobacco.factory.create_component">eds.tobacco</a></code> component was developed by AP-HP's Data Science team with a team of medical experts, following the insights of the algorithm proposed by <span><a class="citation" href="./#ref-petitjean_2024" id="edsnlp.pipes.ner.behaviors.tobacco.factory.create_component--cite-petitjean_2024">Petit-Jean et al., 2024</a></span>.</p> </div> </div> <div class="footnote"><hr/><ol><li id="ref-petitjean_2024"><p><p id="ref-petitjean_2024">Petit-Jean T., Gérardin C., Berthelot E., Chatellier G., Frank M., Tannier X., Kempf E. and Bey R., 2024. Collaborative and privacy-enhancing workflows on a clinical data warehouse: an example developing natural language processing pipelines to detect medical conditions. <i>Journal of the American Medical Informatics Association</i>. <i>31</i>, pp.1280-1290. <a href="https://dx.doi.org/10.1093/jamia/ocae069" target="_blank">10.1093/jamia/ocae069</a></p></p></li></ol><div></div></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> </div> </div> </div> <h2 id="edsnlp.pipes.ner.behaviors.tobacco.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline object</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span>]</code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[str]</code> </span> </p> </td> </tr> <tr> <td><code>patterns</code></td> <td class="doc-param-details"> <p>The patterns to use for matching</p> <p> <span class="doc-param-default"> <b>DEFAULT:</b> <code>[{'source': 'tobacco', 'regex': ['tabagi', 'tab...</code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>The label to use for the <code>Span</code> object and the extension</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>tobacco</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set matches on the doc</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True, 'tobacco': True}</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.ner.behaviors.tobacco.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.ner.behaviors.tobacco.factory.create_component">eds.tobacco</a></code> component was developed by AP-HP's Data Science team with a team of medical experts, following the insights of the algorithm proposed by <span><a class="citation" href="./#ref-petitjean_2024" id="edsnlp.pipes.ner.behaviors.tobacco.factory.create_component--cite-petitjean_2024">Petit-Jean et al., 2024</a></span>.</p> </div> </div> <div class="footnote"><hr/><ol><li id="ref-petitjean_2024"><p><p id="ref-petitjean_2024">Petit-Jean T., Gérardin C., Berthelot E., Chatellier G., Frank M., Tannier X., Kempf E. and Bey R., 2024. Collaborative and privacy-enhancing workflows on a clinical data warehouse: an example developing natural language processing pipelines to detect medical conditions. <i>Journal of the American Medical Informatics Association</i>. <i>31</i>, pp.1280-1290. <a href="https://dx.doi.org/10.1093/jamia/ocae069" target="_blank">10.1093/jamia/ocae069</a></p></p></li></ol><div></div></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/pipes/ner/cim10/index.html b/master/pipes/ner/cim10/index.html
index 0862e1ee9..c40ac0b34 100644
--- a/master/pipes/ner/cim10/index.html
+++ b/master/pipes/ner/cim10/index.html
@@ -18,7 +18,7 @@
 
 <span class="n">ent</span><span class="o">.</span><span class="n">kb_id_</span>
 <span class="c1"># Out: A01</span>
-</code></pre></div> <h2 id="edsnlp.pipes.ner.cim10.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline object</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span></code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'cim10'</code> </span> </p> </td> </tr> <tr> <td><code>attr</code></td> <td class="doc-param-details"> <p>The default attribute to use for matching.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'NORM'</code> </span> </p> </td> </tr> <tr> <td><code>ignore_excluded</code></td> <td class="doc-param-details"> <p>Whether to skip excluded tokens (requires an upstream pipeline to mark excluded tokens).</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>ignore_space_tokens</code></td> <td class="doc-param-details"> <p>Whether to skip space tokens during matching.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>term_matcher</code></td> <td class="doc-param-details"> <p>The matcher to use for matching phrases ? One of (exact, simstring)</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing_extensions.Literal">Literal</span>['exact', 'simstring']</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'exact'</code> </span> </p> </td> </tr> <tr> <td><code>term_matcher_config</code></td> <td class="doc-param-details"> <p>Parameters of the matcher term matcher</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Dict">Dict</span>[str, <span title="typing.Any">Any</span>]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{}</code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>Label name to use for the <code>Span</code> object and the extension</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'cim10'</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set matches on the doc</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True, 'cim10': True}</code> </span> </p> </td> </tr> </tbody> </table> <table> <thead> <tr> <th><b>RETURNS</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td> <span class="doc-returns-annotation"> <code><a class="autorefs autorefs-internal" href="../../../reference/edsnlp/pipes/core/terminology/terminology/#edsnlp.pipes.core.terminology.terminology.TerminologyMatcher" title="edsnlp.pipes.core.terminology.terminology.TerminologyMatcher">TerminologyMatcher</a></code> </span> </td> <td class="doc-returns-details"> <div class="doc-md-description"> </div> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.ner.cim10.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.ner.cim10.factory.create_component">eds.cim10</a></code> pipeline was developed by AP-HP's Data Science team.</p> </div> </div> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> <h2 id="edsnlp.pipes.ner.cim10.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline object</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span></code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'cim10'</code> </span> </p> </td> </tr> <tr> <td><code>attr</code></td> <td class="doc-param-details"> <p>The default attribute to use for matching.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'NORM'</code> </span> </p> </td> </tr> <tr> <td><code>ignore_excluded</code></td> <td class="doc-param-details"> <p>Whether to skip excluded tokens (requires an upstream pipeline to mark excluded tokens).</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>ignore_space_tokens</code></td> <td class="doc-param-details"> <p>Whether to skip space tokens during matching.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>term_matcher</code></td> <td class="doc-param-details"> <p>The matcher to use for matching phrases ? One of (exact, simstring)</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing_extensions.Literal">Literal</span>['exact', 'simstring']</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'exact'</code> </span> </p> </td> </tr> <tr> <td><code>term_matcher_config</code></td> <td class="doc-param-details"> <p>Parameters of the matcher term matcher</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Dict">Dict</span>[str, <span title="typing.Any">Any</span>]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{}</code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>Label name to use for the <code>Span</code> object and the extension</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'cim10'</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set matches on the doc</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True, 'cim10': True}</code> </span> </p> </td> </tr> </tbody> </table> <table> <thead> <tr> <th><b>RETURNS</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td> <span class="doc-returns-annotation"> <code><a class="autorefs autorefs-internal" href="../../../reference/edsnlp/pipes/core/terminology/terminology/#edsnlp.pipes.core.terminology.terminology.TerminologyMatcher" title="edsnlp.pipes.core.terminology.terminology.TerminologyMatcher">TerminologyMatcher</a></code> </span> </td> <td class="doc-returns-details"> <div class="doc-md-description"> </div> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.ner.cim10.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.ner.cim10.factory.create_component">eds.cim10</a></code> pipeline was developed by AP-HP's Data Science team.</p> </div> </div> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/pipes/ner/covid/index.html b/master/pipes/ner/covid/index.html
index 74dd8d153..305659afd 100644
--- a/master/pipes/ner/covid/index.html
+++ b/master/pipes/ner/covid/index.html
@@ -10,7 +10,7 @@
 
 <span class="n">doc</span><span class="o">.</span><span class="n">ents</span>
 <span class="c1"># Out: (infection au coronavirus,)</span>
-</code></pre></div> <h2 id="edsnlp.pipes.ner.covid.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>spaCy <code>Language</code> object.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span></code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the pipe</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'covid'</code> </span> </p> </td> </tr> <tr> <td><code>attr</code></td> <td class="doc-param-details"> <p>Attribute to match on, eg <code>TEXT</code>, <code>NORM</code>, etc.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[str, <span title="typing.Dict">Dict</span>[str, str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'LOWER'</code> </span> </p> </td> </tr> <tr> <td><code>ignore_excluded</code></td> <td class="doc-param-details"> <p>Whether to skip excluded tokens during matching.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>ignore_space_tokens</code></td> <td class="doc-param-details"> <p>Whether to skip space tokens during matching.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>patterns</code></td> <td class="doc-param-details"> <p>The regex pattern to use</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.List">List</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code><span title="edsnlp.pipes.ner.covid.patterns.patterns">patterns</span></code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>Label to use for matches</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'covid'</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set matches on the doc</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True, 'covid': True}</code> </span> </p> </td> </tr> </tbody> </table> <table> <thead> <tr> <th><b>RETURNS</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td> <span class="doc-returns-annotation"> <code><a class="autorefs autorefs-internal" href="../../../reference/edsnlp/pipes/core/matcher/matcher/#edsnlp.pipes.core.matcher.matcher.GenericMatcher" title="edsnlp.pipes.core.matcher.matcher.GenericMatcher">GenericMatcher</a></code> </span> </td> <td class="doc-returns-details"> <div class="doc-md-description"> </div> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.ner.covid.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.ner.covid.factory.create_component">eds.covid</a></code> pipeline was developed by AP-HP's Data Science team.</p> </div> </div> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> <h2 id="edsnlp.pipes.ner.covid.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>spaCy <code>Language</code> object.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span></code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the pipe</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'covid'</code> </span> </p> </td> </tr> <tr> <td><code>attr</code></td> <td class="doc-param-details"> <p>Attribute to match on, eg <code>TEXT</code>, <code>NORM</code>, etc.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[str, <span title="typing.Dict">Dict</span>[str, str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'LOWER'</code> </span> </p> </td> </tr> <tr> <td><code>ignore_excluded</code></td> <td class="doc-param-details"> <p>Whether to skip excluded tokens during matching.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>ignore_space_tokens</code></td> <td class="doc-param-details"> <p>Whether to skip space tokens during matching.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>patterns</code></td> <td class="doc-param-details"> <p>The regex pattern to use</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.List">List</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code><span title="edsnlp.pipes.ner.covid.patterns.patterns">patterns</span></code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>Label to use for matches</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'covid'</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set matches on the doc</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True, 'covid': True}</code> </span> </p> </td> </tr> </tbody> </table> <table> <thead> <tr> <th><b>RETURNS</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td> <span class="doc-returns-annotation"> <code><a class="autorefs autorefs-internal" href="../../../reference/edsnlp/pipes/core/matcher/matcher/#edsnlp.pipes.core.matcher.matcher.GenericMatcher" title="edsnlp.pipes.core.matcher.matcher.GenericMatcher">GenericMatcher</a></code> </span> </td> <td class="doc-returns-details"> <div class="doc-md-description"> </div> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.ner.covid.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.ner.covid.factory.create_component">eds.covid</a></code> pipeline was developed by AP-HP's Data Science team.</p> </div> </div> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/pipes/ner/disorders/aids/index.html b/master/pipes/ner/disorders/aids/index.html
index 67535932a..2fdc9383e 100644
--- a/master/pipes/ner/disorders/aids/index.html
+++ b/master/pipes/ner/disorders/aids/index.html
@@ -113,7 +113,7 @@
 
 <span class="n">span</span><span class="o">.</span><span class="n">_</span><span class="o">.</span><span class="n">assigned</span>
 <span class="c1"># Out: {'stage': [C]}</span>
-</code></pre></div> </div> </div> </div> <h2 id="edsnlp.pipes.ner.disorders.aids.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline object</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span>]</code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'aids'</code> </span> </p> </td> </tr> <tr> <td><code>patterns</code></td> <td class="doc-param-details"> <p>The patterns to use for matching</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[<span title="typing.Dict">Dict</span>[str, <span title="typing.Any">Any</span>], <span title="typing.List">List</span>[<span title="typing.Dict">Dict</span>[str, <span title="typing.Any">Any</span>]]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>[{'source': 'aids', 'regex': ['(vih.{1,5}stade....</code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>The label to use for the <code>Span</code> object and the extension</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>aids</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set matches on the doc</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True, 'aids': True}</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.ner.disorders.aids.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.ner.disorders.aids.factory.create_component">eds.aids</a></code> component was developed by AP-HP's Data Science team with a team of medical experts, following the insights of the algorithm proposed by <span><a class="citation" href="./#ref-petitjean_2024" id="edsnlp.pipes.ner.disorders.aids.factory.create_component--cite-petitjean_2024">Petit-Jean et al., 2024</a></span>.</p> </div> </div> <div class="footnote"><hr/><ol><li id="ref-petitjean_2024"><p><p id="ref-petitjean_2024">Petit-Jean T., Gérardin C., Berthelot E., Chatellier G., Frank M., Tannier X., Kempf E. and Bey R., 2024. Collaborative and privacy-enhancing workflows on a clinical data warehouse: an example developing natural language processing pipelines to detect medical conditions. <i>Journal of the American Medical Informatics Association</i>. <i>31</i>, pp.1280-1290. <a href="https://dx.doi.org/10.1093/jamia/ocae069" target="_blank">10.1093/jamia/ocae069</a></p></p></li></ol><div></div></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> </div> </div> </div> <h2 id="edsnlp.pipes.ner.disorders.aids.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline object</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span>]</code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'aids'</code> </span> </p> </td> </tr> <tr> <td><code>patterns</code></td> <td class="doc-param-details"> <p>The patterns to use for matching</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[<span title="typing.Dict">Dict</span>[str, <span title="typing.Any">Any</span>], <span title="typing.List">List</span>[<span title="typing.Dict">Dict</span>[str, <span title="typing.Any">Any</span>]]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>[{'source': 'aids', 'regex': ['(vih.{1,5}stade....</code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>The label to use for the <code>Span</code> object and the extension</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>aids</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set matches on the doc</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True, 'aids': True}</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.ner.disorders.aids.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.ner.disorders.aids.factory.create_component">eds.aids</a></code> component was developed by AP-HP's Data Science team with a team of medical experts, following the insights of the algorithm proposed by <span><a class="citation" href="./#ref-petitjean_2024" id="edsnlp.pipes.ner.disorders.aids.factory.create_component--cite-petitjean_2024">Petit-Jean et al., 2024</a></span>.</p> </div> </div> <div class="footnote"><hr/><ol><li id="ref-petitjean_2024"><p><p id="ref-petitjean_2024">Petit-Jean T., Gérardin C., Berthelot E., Chatellier G., Frank M., Tannier X., Kempf E. and Bey R., 2024. Collaborative and privacy-enhancing workflows on a clinical data warehouse: an example developing natural language processing pipelines to detect medical conditions. <i>Journal of the American Medical Informatics Association</i>. <i>31</i>, pp.1280-1290. <a href="https://dx.doi.org/10.1093/jamia/ocae069" target="_blank">10.1093/jamia/ocae069</a></p></p></li></ol><div></div></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/pipes/ner/disorders/cerebrovascular-accident/index.html b/master/pipes/ner/disorders/cerebrovascular-accident/index.html
index 3e0bc0b6e..00afdac9d 100644
--- a/master/pipes/ner/disorders/cerebrovascular-accident/index.html
+++ b/master/pipes/ner/disorders/cerebrovascular-accident/index.html
@@ -227,7 +227,7 @@
 
 <span class="n">spans</span>
 <span class="c1"># Out: [thrombolyse]</span>
-</code></pre></div> </div> </div> </div> <h2 id="edsnlp.pipes.ner.disorders.cerebrovascular_accident.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span>]</code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'cerebrovascular_accident'</code> </span> </p> </td> </tr> <tr> <td><code>patterns</code></td> <td class="doc-param-details"> <p>The patterns to use for matching</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[<span title="typing.Dict">Dict</span>[str, <span title="typing.Any">Any</span>], <span title="typing.List">List</span>[<span title="typing.Dict">Dict</span>[str, <span title="typing.Any">Any</span>]]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>[{'source': 'avc', 'regex': ['\\bavc\\b'], 'exc...</code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>The label to use for the <code>Span</code> object and the extension</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>cerebrovascular_accident</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set matches on the doc</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True, 'cerebrovascular_accident': True}</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.ner.disorders.cerebrovascular_accident.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.ner.disorders.cerebrovascular_accident.factory.create_component">eds.cerebrovascular_accident</a></code> component was developed by AP-HP's Data Science team with a team of medical experts, following the insights of the algorithm proposed by <span><a class="citation" href="./#ref-petitjean_2024" id="edsnlp.pipes.ner.disorders.cerebrovascular_accident.factory.create_component--cite-petitjean_2024">Petit-Jean et al., 2024</a></span>.</p> </div> </div> <div class="footnote"><hr/><ol><li id="ref-petitjean_2024"><p><p id="ref-petitjean_2024">Petit-Jean T., Gérardin C., Berthelot E., Chatellier G., Frank M., Tannier X., Kempf E. and Bey R., 2024. Collaborative and privacy-enhancing workflows on a clinical data warehouse: an example developing natural language processing pipelines to detect medical conditions. <i>Journal of the American Medical Informatics Association</i>. <i>31</i>, pp.1280-1290. <a href="https://dx.doi.org/10.1093/jamia/ocae069" target="_blank">10.1093/jamia/ocae069</a></p></p></li></ol><div></div></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> </div> </div> </div> <h2 id="edsnlp.pipes.ner.disorders.cerebrovascular_accident.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span>]</code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'cerebrovascular_accident'</code> </span> </p> </td> </tr> <tr> <td><code>patterns</code></td> <td class="doc-param-details"> <p>The patterns to use for matching</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[<span title="typing.Dict">Dict</span>[str, <span title="typing.Any">Any</span>], <span title="typing.List">List</span>[<span title="typing.Dict">Dict</span>[str, <span title="typing.Any">Any</span>]]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>[{'source': 'avc', 'regex': ['\\bavc\\b'], 'exc...</code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>The label to use for the <code>Span</code> object and the extension</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>cerebrovascular_accident</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set matches on the doc</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True, 'cerebrovascular_accident': True}</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.ner.disorders.cerebrovascular_accident.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.ner.disorders.cerebrovascular_accident.factory.create_component">eds.cerebrovascular_accident</a></code> component was developed by AP-HP's Data Science team with a team of medical experts, following the insights of the algorithm proposed by <span><a class="citation" href="./#ref-petitjean_2024" id="edsnlp.pipes.ner.disorders.cerebrovascular_accident.factory.create_component--cite-petitjean_2024">Petit-Jean et al., 2024</a></span>.</p> </div> </div> <div class="footnote"><hr/><ol><li id="ref-petitjean_2024"><p><p id="ref-petitjean_2024">Petit-Jean T., Gérardin C., Berthelot E., Chatellier G., Frank M., Tannier X., Kempf E. and Bey R., 2024. Collaborative and privacy-enhancing workflows on a clinical data warehouse: an example developing natural language processing pipelines to detect medical conditions. <i>Journal of the American Medical Informatics Association</i>. <i>31</i>, pp.1280-1290. <a href="https://dx.doi.org/10.1093/jamia/ocae069" target="_blank">10.1093/jamia/ocae069</a></p></p></li></ol><div></div></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/pipes/ner/disorders/ckd/index.html b/master/pipes/ner/disorders/ckd/index.html
index fdeb57afb..de6ea5365 100644
--- a/master/pipes/ner/disorders/ckd/index.html
+++ b/master/pipes/ner/disorders/ckd/index.html
@@ -243,7 +243,7 @@
 
 <span class="n">spans</span>
 <span class="c1"># Out: []</span>
-</code></pre></div> </div> </div> </div> <h2 id="edsnlp.pipes.ner.disorders.ckd.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span>]</code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'ckd'</code> </span> </p> </td> </tr> <tr> <td><code>patterns</code></td> <td class="doc-param-details"> <p>The patterns to use for matching</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[<span title="typing.Dict">Dict</span>[str, <span title="typing.Any">Any</span>], <span title="typing.List">List</span>[<span title="typing.Dict">Dict</span>[str, <span title="typing.Any">Any</span>]]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>[{'source': 'main', 'regex': ['glomerulonephrit...</code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>The label to use for the <code>Span</code> object and the extension</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>ckd</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set matches on the doc</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True, 'ckd': True}</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.ner.disorders.ckd.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.ner.disorders.ckd.factory.create_component">eds.ckd</a></code> component was developed by AP-HP's Data Science team with a team of medical experts, following the insights of the algorithm proposed by <span><a class="citation" href="./#ref-petitjean_2024" id="edsnlp.pipes.ner.disorders.ckd.factory.create_component--cite-petitjean_2024">Petit-Jean et al., 2024</a></span>.</p> </div> </div> <div class="footnote"><hr/><ol><li id="ref-petitjean_2024"><p><p id="ref-petitjean_2024">Petit-Jean T., Gérardin C., Berthelot E., Chatellier G., Frank M., Tannier X., Kempf E. and Bey R., 2024. Collaborative and privacy-enhancing workflows on a clinical data warehouse: an example developing natural language processing pipelines to detect medical conditions. <i>Journal of the American Medical Informatics Association</i>. <i>31</i>, pp.1280-1290. <a href="https://dx.doi.org/10.1093/jamia/ocae069" target="_blank">10.1093/jamia/ocae069</a></p></p></li></ol><div></div></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> </div> </div> </div> <h2 id="edsnlp.pipes.ner.disorders.ckd.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span>]</code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'ckd'</code> </span> </p> </td> </tr> <tr> <td><code>patterns</code></td> <td class="doc-param-details"> <p>The patterns to use for matching</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[<span title="typing.Dict">Dict</span>[str, <span title="typing.Any">Any</span>], <span title="typing.List">List</span>[<span title="typing.Dict">Dict</span>[str, <span title="typing.Any">Any</span>]]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>[{'source': 'main', 'regex': ['glomerulonephrit...</code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>The label to use for the <code>Span</code> object and the extension</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>ckd</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set matches on the doc</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True, 'ckd': True}</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.ner.disorders.ckd.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.ner.disorders.ckd.factory.create_component">eds.ckd</a></code> component was developed by AP-HP's Data Science team with a team of medical experts, following the insights of the algorithm proposed by <span><a class="citation" href="./#ref-petitjean_2024" id="edsnlp.pipes.ner.disorders.ckd.factory.create_component--cite-petitjean_2024">Petit-Jean et al., 2024</a></span>.</p> </div> </div> <div class="footnote"><hr/><ol><li id="ref-petitjean_2024"><p><p id="ref-petitjean_2024">Petit-Jean T., Gérardin C., Berthelot E., Chatellier G., Frank M., Tannier X., Kempf E. and Bey R., 2024. Collaborative and privacy-enhancing workflows on a clinical data warehouse: an example developing natural language processing pipelines to detect medical conditions. <i>Journal of the American Medical Informatics Association</i>. <i>31</i>, pp.1280-1290. <a href="https://dx.doi.org/10.1093/jamia/ocae069" target="_blank">10.1093/jamia/ocae069</a></p></p></li></ol><div></div></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/pipes/ner/disorders/congestive-heart-failure/index.html b/master/pipes/ner/disorders/congestive-heart-failure/index.html
index a69e71177..6a0226999 100644
--- a/master/pipes/ner/disorders/congestive-heart-failure/index.html
+++ b/master/pipes/ner/disorders/congestive-heart-failure/index.html
@@ -144,7 +144,7 @@
 
 <span class="n">spans</span>
 <span class="c1"># Out: []</span>
-</code></pre></div> </div> </div> </div> <h2 id="edsnlp.pipes.ner.disorders.congestive_heart_failure.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline object</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span>]</code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>(str)</code> </span> </p> </td> </tr> <tr> <td><code>patterns</code></td> <td class="doc-param-details"> <p>The patterns to use for matching</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="typing.Dict">Dict</span>[str, <span title="typing.Any">Any</span>]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>[{'source': 'main', 'regex': ['defaillance.{1,1...</code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>The label to use for the <code>Span</code> object and the extension</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>congestive_heart_failure</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set matches on the doc</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True, 'congestive_heart_failure': True}</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.ner.disorders.congestive_heart_failure.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.ner.disorders.congestive_heart_failure.factory.create_component">eds.congestive_heart_failure</a></code> component was developed by AP-HP's Data Science team with a team of medical experts, following the insights of the algorithm proposed by <span><a class="citation" href="./#ref-petitjean_2024" id="edsnlp.pipes.ner.disorders.congestive_heart_failure.factory.create_component--cite-petitjean_2024">Petit-Jean et al., 2024</a></span>.</p> </div> </div> <div class="footnote"><hr/><ol><li id="ref-petitjean_2024"><p><p id="ref-petitjean_2024">Petit-Jean T., Gérardin C., Berthelot E., Chatellier G., Frank M., Tannier X., Kempf E. and Bey R., 2024. Collaborative and privacy-enhancing workflows on a clinical data warehouse: an example developing natural language processing pipelines to detect medical conditions. <i>Journal of the American Medical Informatics Association</i>. <i>31</i>, pp.1280-1290. <a href="https://dx.doi.org/10.1093/jamia/ocae069" target="_blank">10.1093/jamia/ocae069</a></p></p></li></ol><div></div></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> </div> </div> </div> <h2 id="edsnlp.pipes.ner.disorders.congestive_heart_failure.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline object</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span>]</code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>(str)</code> </span> </p> </td> </tr> <tr> <td><code>patterns</code></td> <td class="doc-param-details"> <p>The patterns to use for matching</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="typing.Dict">Dict</span>[str, <span title="typing.Any">Any</span>]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>[{'source': 'main', 'regex': ['defaillance.{1,1...</code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>The label to use for the <code>Span</code> object and the extension</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>congestive_heart_failure</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set matches on the doc</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True, 'congestive_heart_failure': True}</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.ner.disorders.congestive_heart_failure.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.ner.disorders.congestive_heart_failure.factory.create_component">eds.congestive_heart_failure</a></code> component was developed by AP-HP's Data Science team with a team of medical experts, following the insights of the algorithm proposed by <span><a class="citation" href="./#ref-petitjean_2024" id="edsnlp.pipes.ner.disorders.congestive_heart_failure.factory.create_component--cite-petitjean_2024">Petit-Jean et al., 2024</a></span>.</p> </div> </div> <div class="footnote"><hr/><ol><li id="ref-petitjean_2024"><p><p id="ref-petitjean_2024">Petit-Jean T., Gérardin C., Berthelot E., Chatellier G., Frank M., Tannier X., Kempf E. and Bey R., 2024. Collaborative and privacy-enhancing workflows on a clinical data warehouse: an example developing natural language processing pipelines to detect medical conditions. <i>Journal of the American Medical Informatics Association</i>. <i>31</i>, pp.1280-1290. <a href="https://dx.doi.org/10.1093/jamia/ocae069" target="_blank">10.1093/jamia/ocae069</a></p></p></li></ol><div></div></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/pipes/ner/disorders/connective-tissue-disease/index.html b/master/pipes/ner/disorders/connective-tissue-disease/index.html
index cf0347aed..c2584f322 100644
--- a/master/pipes/ner/disorders/connective-tissue-disease/index.html
+++ b/master/pipes/ner/disorders/connective-tissue-disease/index.html
@@ -132,7 +132,7 @@
 
 <span class="n">spans</span>
 <span class="c1"># Out: [Raynaud]</span>
-</code></pre></div> </div> </div> </div> <h2 id="edsnlp.pipes.ner.disorders.connective_tissue_disease.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline object</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span>]</code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> </p> </td> </tr> <tr> <td><code>patterns</code></td> <td class="doc-param-details"> <p>The patterns to use for matching</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="typing.Dict">Dict</span>[str, <span title="typing.Any">Any</span>]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>[{'source': 'main', 'regex': ['arthrites.{1,5}j...</code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>The label to use for the <code>Span</code> object and the extension</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>connective_tissue_disease</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set matches on the doc</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True, 'connective_tissue_disease': True}</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.ner.disorders.connective_tissue_disease.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.ner.disorders.connective_tissue_disease.factory.create_component">eds.connective_tissue_disease</a></code> component was developed by AP-HP's Data Science team with a team of medical experts, following the insights of the algorithm proposed by <span><a class="citation" href="./#ref-petitjean_2024" id="edsnlp.pipes.ner.disorders.connective_tissue_disease.factory.create_component--cite-petitjean_2024">Petit-Jean et al., 2024</a></span>.</p> </div> </div> <div class="footnote"><hr/><ol><li id="ref-petitjean_2024"><p><p id="ref-petitjean_2024">Petit-Jean T., Gérardin C., Berthelot E., Chatellier G., Frank M., Tannier X., Kempf E. and Bey R., 2024. Collaborative and privacy-enhancing workflows on a clinical data warehouse: an example developing natural language processing pipelines to detect medical conditions. <i>Journal of the American Medical Informatics Association</i>. <i>31</i>, pp.1280-1290. <a href="https://dx.doi.org/10.1093/jamia/ocae069" target="_blank">10.1093/jamia/ocae069</a></p></p></li></ol><div></div></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> </div> </div> </div> <h2 id="edsnlp.pipes.ner.disorders.connective_tissue_disease.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline object</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span>]</code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> </p> </td> </tr> <tr> <td><code>patterns</code></td> <td class="doc-param-details"> <p>The patterns to use for matching</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="typing.Dict">Dict</span>[str, <span title="typing.Any">Any</span>]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>[{'source': 'main', 'regex': ['arthrites.{1,5}j...</code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>The label to use for the <code>Span</code> object and the extension</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>connective_tissue_disease</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set matches on the doc</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True, 'connective_tissue_disease': True}</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.ner.disorders.connective_tissue_disease.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.ner.disorders.connective_tissue_disease.factory.create_component">eds.connective_tissue_disease</a></code> component was developed by AP-HP's Data Science team with a team of medical experts, following the insights of the algorithm proposed by <span><a class="citation" href="./#ref-petitjean_2024" id="edsnlp.pipes.ner.disorders.connective_tissue_disease.factory.create_component--cite-petitjean_2024">Petit-Jean et al., 2024</a></span>.</p> </div> </div> <div class="footnote"><hr/><ol><li id="ref-petitjean_2024"><p><p id="ref-petitjean_2024">Petit-Jean T., Gérardin C., Berthelot E., Chatellier G., Frank M., Tannier X., Kempf E. and Bey R., 2024. Collaborative and privacy-enhancing workflows on a clinical data warehouse: an example developing natural language processing pipelines to detect medical conditions. <i>Journal of the American Medical Informatics Association</i>. <i>31</i>, pp.1280-1290. <a href="https://dx.doi.org/10.1093/jamia/ocae069" target="_blank">10.1093/jamia/ocae069</a></p></p></li></ol><div></div></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/pipes/ner/disorders/copd/index.html b/master/pipes/ner/disorders/copd/index.html
index c57ee8198..508254f78 100644
--- a/master/pipes/ner/disorders/copd/index.html
+++ b/master/pipes/ner/disorders/copd/index.html
@@ -175,7 +175,7 @@
 
 <span class="n">span</span><span class="o">.</span><span class="n">_</span><span class="o">.</span><span class="n">assigned</span>
 <span class="c1"># Out: {'long': [long cours]}</span>
-</code></pre></div> </div> </div> </div> <h2 id="edsnlp.pipes.ner.disorders.copd.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span>]</code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'copd'</code> </span> </p> </td> </tr> <tr> <td><code>patterns</code></td> <td class="doc-param-details"> <p>The patterns to use for matching</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[<span title="typing.Dict">Dict</span>[str, <span title="typing.Any">Any</span>], <span title="typing.List">List</span>[<span title="typing.Dict">Dict</span>[str, <span title="typing.Any">Any</span>]]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>[{'source': 'main', 'regex': ['alveolites.{1,5}...</code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>The label to use for the <code>Span</code> object and the extension</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>copd</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set matches on the doc</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True, 'copd': True}</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.ner.disorders.copd.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.ner.disorders.copd.factory.create_component">eds.copd</a></code> component was developed by AP-HP's Data Science team with a team of medical experts, following the insights of the algorithm proposed by <span><a class="citation" href="./#ref-petitjean_2024" id="edsnlp.pipes.ner.disorders.copd.factory.create_component--cite-petitjean_2024">Petit-Jean et al., 2024</a></span>.</p> </div> </div> <div class="footnote"><hr/><ol><li id="ref-petitjean_2024"><p><p id="ref-petitjean_2024">Petit-Jean T., Gérardin C., Berthelot E., Chatellier G., Frank M., Tannier X., Kempf E. and Bey R., 2024. Collaborative and privacy-enhancing workflows on a clinical data warehouse: an example developing natural language processing pipelines to detect medical conditions. <i>Journal of the American Medical Informatics Association</i>. <i>31</i>, pp.1280-1290. <a href="https://dx.doi.org/10.1093/jamia/ocae069" target="_blank">10.1093/jamia/ocae069</a></p></p></li></ol><div></div></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> </div> </div> </div> <h2 id="edsnlp.pipes.ner.disorders.copd.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span>]</code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'copd'</code> </span> </p> </td> </tr> <tr> <td><code>patterns</code></td> <td class="doc-param-details"> <p>The patterns to use for matching</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[<span title="typing.Dict">Dict</span>[str, <span title="typing.Any">Any</span>], <span title="typing.List">List</span>[<span title="typing.Dict">Dict</span>[str, <span title="typing.Any">Any</span>]]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>[{'source': 'main', 'regex': ['alveolites.{1,5}...</code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>The label to use for the <code>Span</code> object and the extension</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>copd</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set matches on the doc</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True, 'copd': True}</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.ner.disorders.copd.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.ner.disorders.copd.factory.create_component">eds.copd</a></code> component was developed by AP-HP's Data Science team with a team of medical experts, following the insights of the algorithm proposed by <span><a class="citation" href="./#ref-petitjean_2024" id="edsnlp.pipes.ner.disorders.copd.factory.create_component--cite-petitjean_2024">Petit-Jean et al., 2024</a></span>.</p> </div> </div> <div class="footnote"><hr/><ol><li id="ref-petitjean_2024"><p><p id="ref-petitjean_2024">Petit-Jean T., Gérardin C., Berthelot E., Chatellier G., Frank M., Tannier X., Kempf E. and Bey R., 2024. Collaborative and privacy-enhancing workflows on a clinical data warehouse: an example developing natural language processing pipelines to detect medical conditions. <i>Journal of the American Medical Informatics Association</i>. <i>31</i>, pp.1280-1290. <a href="https://dx.doi.org/10.1093/jamia/ocae069" target="_blank">10.1093/jamia/ocae069</a></p></p></li></ol><div></div></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/pipes/ner/disorders/dementia/index.html b/master/pipes/ner/disorders/dementia/index.html
index 2b0ca97dc..84a1e7a08 100644
--- a/master/pipes/ner/disorders/dementia/index.html
+++ b/master/pipes/ner/disorders/dementia/index.html
@@ -114,7 +114,7 @@
 
 <span class="n">spans</span>
 <span class="c1"># Out: [maladie de Charcot]</span>
-</code></pre></div> </div> </div> </div> <h2 id="edsnlp.pipes.ner.disorders.dementia.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span>]</code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'dementia'</code> </span> </p> </td> </tr> <tr> <td><code>patterns</code></td> <td class="doc-param-details"> <p>The patterns to use for matching</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[<span title="typing.Dict">Dict</span>[str, <span title="typing.Any">Any</span>], <span title="typing.List">List</span>[<span title="typing.Dict">Dict</span>[str, <span title="typing.Any">Any</span>]]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>[{'source': 'main', 'regex': ['demence', 'demen...</code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>The label to use for the <code>Span</code> object and the extension</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>dementia</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>The span setter to use</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True, 'dementia': True}</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.ner.disorders.dementia.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.ner.disorders.dementia.factory.create_component">eds.dementia</a></code> component was developed by AP-HP's Data Science team with a team of medical experts, following the insights of the algorithm proposed by <span><a class="citation" href="./#ref-petitjean_2024" id="edsnlp.pipes.ner.disorders.dementia.factory.create_component--cite-petitjean_2024">Petit-Jean et al., 2024</a></span>.</p> </div> </div> <div class="footnote"><hr/><ol><li id="ref-petitjean_2024"><p><p id="ref-petitjean_2024">Petit-Jean T., Gérardin C., Berthelot E., Chatellier G., Frank M., Tannier X., Kempf E. and Bey R., 2024. Collaborative and privacy-enhancing workflows on a clinical data warehouse: an example developing natural language processing pipelines to detect medical conditions. <i>Journal of the American Medical Informatics Association</i>. <i>31</i>, pp.1280-1290. <a href="https://dx.doi.org/10.1093/jamia/ocae069" target="_blank">10.1093/jamia/ocae069</a></p></p></li></ol><div></div></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> </div> </div> </div> <h2 id="edsnlp.pipes.ner.disorders.dementia.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span>]</code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'dementia'</code> </span> </p> </td> </tr> <tr> <td><code>patterns</code></td> <td class="doc-param-details"> <p>The patterns to use for matching</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[<span title="typing.Dict">Dict</span>[str, <span title="typing.Any">Any</span>], <span title="typing.List">List</span>[<span title="typing.Dict">Dict</span>[str, <span title="typing.Any">Any</span>]]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>[{'source': 'main', 'regex': ['demence', 'demen...</code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>The label to use for the <code>Span</code> object and the extension</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>dementia</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>The span setter to use</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True, 'dementia': True}</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.ner.disorders.dementia.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.ner.disorders.dementia.factory.create_component">eds.dementia</a></code> component was developed by AP-HP's Data Science team with a team of medical experts, following the insights of the algorithm proposed by <span><a class="citation" href="./#ref-petitjean_2024" id="edsnlp.pipes.ner.disorders.dementia.factory.create_component--cite-petitjean_2024">Petit-Jean et al., 2024</a></span>.</p> </div> </div> <div class="footnote"><hr/><ol><li id="ref-petitjean_2024"><p><p id="ref-petitjean_2024">Petit-Jean T., Gérardin C., Berthelot E., Chatellier G., Frank M., Tannier X., Kempf E. and Bey R., 2024. Collaborative and privacy-enhancing workflows on a clinical data warehouse: an example developing natural language processing pipelines to detect medical conditions. <i>Journal of the American Medical Informatics Association</i>. <i>31</i>, pp.1280-1290. <a href="https://dx.doi.org/10.1093/jamia/ocae069" target="_blank">10.1093/jamia/ocae069</a></p></p></li></ol><div></div></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/pipes/ner/disorders/diabetes/index.html b/master/pipes/ner/disorders/diabetes/index.html
index bcbad417f..bc299590a 100644
--- a/master/pipes/ner/disorders/diabetes/index.html
+++ b/master/pipes/ner/disorders/diabetes/index.html
@@ -166,7 +166,7 @@
 
 <span class="n">span</span><span class="o">.</span><span class="n">_</span><span class="o">.</span><span class="n">detailed_status</span>
 <span class="c1"># Out: WITH_COMPLICATION</span>
-</code></pre></div> </div> </div> </div> <h2 id="edsnlp.pipes.ner.disorders.diabetes.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span>]</code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'diabetes'</code> </span> </p> </td> </tr> <tr> <td><code>patterns</code></td> <td class="doc-param-details"> <p>The patterns to use for matching</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[<span title="typing.Dict">Dict</span>[str, <span title="typing.Any">Any</span>], <span title="typing.List">List</span>[<span title="typing.Dict">Dict</span>[str, <span title="typing.Any">Any</span>]]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>[{'source': 'main', 'regex': ['\\bds?n?id\\b', ...</code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>The label to use for the <code>Span</code> object and the extension</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>diabetes</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>The span setter to use</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True, 'diabetes': True}</code> </span> </p> </td> </tr> </tbody> </table> <h1 id="edsnlp.pipes.ner.disorders.diabetes.factory.create_component--authors-and-citation">Authors and citation</h1> <p>The <code><a href="#edsnlp.pipes.ner.disorders.diabetes.factory.create_component">eds.diabetes</a></code> component was developed by AP-HP's Data Science team with a team of medical experts, following the insights of the algorithm proposed by <span><a class="citation" href="./#ref-petitjean_2024" id="edsnlp.pipes.ner.disorders.diabetes.factory.create_component--cite-petitjean_2024">Petit-Jean et al., 2024</a></span>.</p> </div> </div> <div class="footnote"><hr/><ol><li id="ref-petitjean_2024"><p><p id="ref-petitjean_2024">Petit-Jean T., Gérardin C., Berthelot E., Chatellier G., Frank M., Tannier X., Kempf E. and Bey R., 2024. Collaborative and privacy-enhancing workflows on a clinical data warehouse: an example developing natural language processing pipelines to detect medical conditions. <i>Journal of the American Medical Informatics Association</i>. <i>31</i>, pp.1280-1290. <a href="https://dx.doi.org/10.1093/jamia/ocae069" target="_blank">10.1093/jamia/ocae069</a></p></p></li></ol><div></div></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> </div> </div> </div> <h2 id="edsnlp.pipes.ner.disorders.diabetes.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span>]</code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'diabetes'</code> </span> </p> </td> </tr> <tr> <td><code>patterns</code></td> <td class="doc-param-details"> <p>The patterns to use for matching</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[<span title="typing.Dict">Dict</span>[str, <span title="typing.Any">Any</span>], <span title="typing.List">List</span>[<span title="typing.Dict">Dict</span>[str, <span title="typing.Any">Any</span>]]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>[{'source': 'main', 'regex': ['\\bds?n?id\\b', ...</code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>The label to use for the <code>Span</code> object and the extension</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>diabetes</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>The span setter to use</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True, 'diabetes': True}</code> </span> </p> </td> </tr> </tbody> </table> <h1 id="edsnlp.pipes.ner.disorders.diabetes.factory.create_component--authors-and-citation">Authors and citation</h1> <p>The <code><a href="#edsnlp.pipes.ner.disorders.diabetes.factory.create_component">eds.diabetes</a></code> component was developed by AP-HP's Data Science team with a team of medical experts, following the insights of the algorithm proposed by <span><a class="citation" href="./#ref-petitjean_2024" id="edsnlp.pipes.ner.disorders.diabetes.factory.create_component--cite-petitjean_2024">Petit-Jean et al., 2024</a></span>.</p> </div> </div> <div class="footnote"><hr/><ol><li id="ref-petitjean_2024"><p><p id="ref-petitjean_2024">Petit-Jean T., Gérardin C., Berthelot E., Chatellier G., Frank M., Tannier X., Kempf E. and Bey R., 2024. Collaborative and privacy-enhancing workflows on a clinical data warehouse: an example developing natural language processing pipelines to detect medical conditions. <i>Journal of the American Medical Informatics Association</i>. <i>31</i>, pp.1280-1290. <a href="https://dx.doi.org/10.1093/jamia/ocae069" target="_blank">10.1093/jamia/ocae069</a></p></p></li></ol><div></div></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/pipes/ner/disorders/hemiplegia/index.html b/master/pipes/ner/disorders/hemiplegia/index.html
index 82dd805dd..068d57510 100644
--- a/master/pipes/ner/disorders/hemiplegia/index.html
+++ b/master/pipes/ner/disorders/hemiplegia/index.html
@@ -74,7 +74,7 @@
 
 <span class="n">spans</span>
 <span class="c1"># Out: [LIS]</span>
-</code></pre></div> </div> </div> </div> <h2 id="edsnlp.pipes.ner.disorders.hemiplegia.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span>]</code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'hemiplegia'</code> </span> </p> </td> </tr> <tr> <td><code>patterns</code></td> <td class="doc-param-details"> <p>The patterns to use for matching</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[<span title="typing.Dict">Dict</span>[str, <span title="typing.Any">Any</span>], <span title="typing.List">List</span>[<span title="typing.Dict">Dict</span>[str, <span title="typing.Any">Any</span>]]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>[{'source': 'main', 'regex': ['hemiplegi', 'tet...</code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>The label to use for the <code>Span</code> object and the extension</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>hemiplegia</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set matches on the doc</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True, 'hemiplegia': True}</code> </span> </p> </td> </tr> </tbody> </table> <h1 id="edsnlp.pipes.ner.disorders.hemiplegia.factory.create_component--authors-and-citation">Authors and citation</h1> <p>The <code><a href="#edsnlp.pipes.ner.disorders.hemiplegia.factory.create_component">eds.hemiplegia</a></code> component was developed by AP-HP's Data Science team with a team of medical experts, following the insights of the algorithm proposed by <span><a class="citation" href="./#ref-petitjean_2024" id="edsnlp.pipes.ner.disorders.hemiplegia.factory.create_component--cite-petitjean_2024">Petit-Jean et al., 2024</a></span>.</p> </div> </div> <div class="footnote"><hr/><ol><li id="ref-petitjean_2024"><p><p id="ref-petitjean_2024">Petit-Jean T., Gérardin C., Berthelot E., Chatellier G., Frank M., Tannier X., Kempf E. and Bey R., 2024. Collaborative and privacy-enhancing workflows on a clinical data warehouse: an example developing natural language processing pipelines to detect medical conditions. <i>Journal of the American Medical Informatics Association</i>. <i>31</i>, pp.1280-1290. <a href="https://dx.doi.org/10.1093/jamia/ocae069" target="_blank">10.1093/jamia/ocae069</a></p></p></li></ol><div></div></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> </div> </div> </div> <h2 id="edsnlp.pipes.ner.disorders.hemiplegia.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span>]</code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'hemiplegia'</code> </span> </p> </td> </tr> <tr> <td><code>patterns</code></td> <td class="doc-param-details"> <p>The patterns to use for matching</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[<span title="typing.Dict">Dict</span>[str, <span title="typing.Any">Any</span>], <span title="typing.List">List</span>[<span title="typing.Dict">Dict</span>[str, <span title="typing.Any">Any</span>]]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>[{'source': 'main', 'regex': ['hemiplegi', 'tet...</code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>The label to use for the <code>Span</code> object and the extension</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>hemiplegia</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set matches on the doc</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True, 'hemiplegia': True}</code> </span> </p> </td> </tr> </tbody> </table> <h1 id="edsnlp.pipes.ner.disorders.hemiplegia.factory.create_component--authors-and-citation">Authors and citation</h1> <p>The <code><a href="#edsnlp.pipes.ner.disorders.hemiplegia.factory.create_component">eds.hemiplegia</a></code> component was developed by AP-HP's Data Science team with a team of medical experts, following the insights of the algorithm proposed by <span><a class="citation" href="./#ref-petitjean_2024" id="edsnlp.pipes.ner.disorders.hemiplegia.factory.create_component--cite-petitjean_2024">Petit-Jean et al., 2024</a></span>.</p> </div> </div> <div class="footnote"><hr/><ol><li id="ref-petitjean_2024"><p><p id="ref-petitjean_2024">Petit-Jean T., Gérardin C., Berthelot E., Chatellier G., Frank M., Tannier X., Kempf E. and Bey R., 2024. Collaborative and privacy-enhancing workflows on a clinical data warehouse: an example developing natural language processing pipelines to detect medical conditions. <i>Journal of the American Medical Informatics Association</i>. <i>31</i>, pp.1280-1290. <a href="https://dx.doi.org/10.1093/jamia/ocae069" target="_blank">10.1093/jamia/ocae069</a></p></p></li></ol><div></div></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/pipes/ner/disorders/index.html b/master/pipes/ner/disorders/index.html
index f8456a179..958ee863e 100644
--- a/master/pipes/ner/disorders/index.html
+++ b/master/pipes/ner/disorders/index.html
@@ -19,7 +19,7 @@
         <span class="p">),</span>
     <span class="p">),</span>
 <span class="p">)</span>
-</code></pre></div> </li> </ul> <div class="admonition warning"> <p class="admonition-title">Use qualifiers</p> <p>Those components <strong>should be used with a qualification pipeline</strong> to avoid extracted unwanted matches. At the very least, you can use available rule-based qualifiers (<code><a href="../../qualifiers/negation/#edsnlp.pipes.qualifiers.negation.factory.create_component">eds.negation</a></code>, <code><a href="../../qualifiers/hypothesis/#edsnlp.pipes.qualifiers.hypothesis.factory.create_component">eds.hypothesis</a></code> and <code><a href="../../qualifiers/family/#edsnlp.pipes.qualifiers.family.factory.create_component">eds.family</a></code>). Better, a machine learning qualification component was developed and trained specifically for those components. For privacy reason, the model isn't publicly available yet.</p> <div class="admonition aphp"> <p class="admonition-title">Use the ML model</p> <p>The model will soon be available in the models catalogue of AP-HP's CDW.</p> </div> </div> <div class="admonition tip"> <p class="admonition-title">On the medical definition of the comorbidities</p> <p>Those components were developped to extract <strong>chronic</strong> and <strong>symptomatic</strong> conditions only.</p> </div> <h2 id="aggregation">Aggregation</h2> <p>For relevant phenotyping, matches should be aggregated at the document-level. For instance, a document might mention a complicated diabetes at the beginning ("<em>Le patient a une rétinopathie diabétique</em>"), and then refer to this diabetes without mentionning that it is complicated anymore ("<em>Concernant son diabète, le patient ...</em>"). Thus, a good and simple aggregation rule is, for each comorbidity, to</p> <ul> <li>disregard all entities tagged as irrelevant by the qualification component(s)</li> <li>take the maximum (i.e., the most severe) status of the leftover entities</li> </ul> <p>An implementation of this rule is presented <a class="autorefs autorefs-internal" href="../../../tutorials/aggregating-results/#aggregating-results">here</a></p> <div class="footnote"><hr/><ol><li id="ref-petitjean_2024"><p><p id="ref-petitjean_2024">Petit-Jean T., Gérardin C., Berthelot E., Chatellier G., Frank M., Tannier X., Kempf E. and Bey R., 2024. Collaborative and privacy-enhancing workflows on a clinical data warehouse: an example developing natural language processing pipelines to detect medical conditions. <i>Journal of the American Medical Informatics Association</i>. <i>31</i>, pp.1280-1290. <a href="https://dx.doi.org/10.1093/jamia/ocae069" target="_blank">10.1093/jamia/ocae069</a></p></p></li></ol><div></div></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> </li> </ul> <div class="admonition warning"> <p class="admonition-title">Use qualifiers</p> <p>Those components <strong>should be used with a qualification pipeline</strong> to avoid extracted unwanted matches. At the very least, you can use available rule-based qualifiers (<code><a href="../../qualifiers/negation/#edsnlp.pipes.qualifiers.negation.factory.create_component">eds.negation</a></code>, <code><a href="../../qualifiers/hypothesis/#edsnlp.pipes.qualifiers.hypothesis.factory.create_component">eds.hypothesis</a></code> and <code><a href="../../qualifiers/family/#edsnlp.pipes.qualifiers.family.factory.create_component">eds.family</a></code>). Better, a machine learning qualification component was developed and trained specifically for those components. For privacy reason, the model isn't publicly available yet.</p> <div class="admonition aphp"> <p class="admonition-title">Use the ML model</p> <p>The model will soon be available in the models catalogue of AP-HP's CDW.</p> </div> </div> <div class="admonition tip"> <p class="admonition-title">On the medical definition of the comorbidities</p> <p>Those components were developped to extract <strong>chronic</strong> and <strong>symptomatic</strong> conditions only.</p> </div> <h2 id="aggregation">Aggregation</h2> <p>For relevant phenotyping, matches should be aggregated at the document-level. For instance, a document might mention a complicated diabetes at the beginning ("<em>Le patient a une rétinopathie diabétique</em>"), and then refer to this diabetes without mentionning that it is complicated anymore ("<em>Concernant son diabète, le patient ...</em>"). Thus, a good and simple aggregation rule is, for each comorbidity, to</p> <ul> <li>disregard all entities tagged as irrelevant by the qualification component(s)</li> <li>take the maximum (i.e., the most severe) status of the leftover entities</li> </ul> <p>An implementation of this rule is presented <a class="autorefs autorefs-internal" href="../../../tutorials/aggregating-results/#aggregating-results">here</a></p> <div class="footnote"><hr/><ol><li id="ref-petitjean_2024"><p><p id="ref-petitjean_2024">Petit-Jean T., Gérardin C., Berthelot E., Chatellier G., Frank M., Tannier X., Kempf E. and Bey R., 2024. Collaborative and privacy-enhancing workflows on a clinical data warehouse: an example developing natural language processing pipelines to detect medical conditions. <i>Journal of the American Medical Informatics Association</i>. <i>31</i>, pp.1280-1290. <a href="https://dx.doi.org/10.1093/jamia/ocae069" target="_blank">10.1093/jamia/ocae069</a></p></p></li></ol><div></div></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/pipes/ner/disorders/leukemia/index.html b/master/pipes/ner/disorders/leukemia/index.html
index 33a83a756..f4db707de 100644
--- a/master/pipes/ner/disorders/leukemia/index.html
+++ b/master/pipes/ner/disorders/leukemia/index.html
@@ -110,7 +110,7 @@
 
 <span class="n">spans</span>
 <span class="c1"># Out: [Vaquez]</span>
-</code></pre></div> </div> </div> </div> <h2 id="edsnlp.pipes.ner.disorders.leukemia.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span>]</code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[str]</code> </span> </p> </td> </tr> <tr> <td><code>patterns</code></td> <td class="doc-param-details"> <p>The patterns to use for matching</p> <p> <span class="doc-param-default"> <b>DEFAULT:</b> <code>[{'source': 'main', 'regex': ['leucemie', '(syn...</code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>The label to use for the <code>Span</code> object and the extension</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>leukemia</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set matches on the doc</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True, 'leukemia': True}</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.ner.disorders.leukemia.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.ner.disorders.leukemia.factory.create_component">eds.leukemia</a></code> component was developed by AP-HP's Data Science team with a team of medical experts, following the insights of the algorithm proposed by <span><a class="citation" href="./#ref-petitjean_2024" id="edsnlp.pipes.ner.disorders.leukemia.factory.create_component--cite-petitjean_2024">Petit-Jean et al., 2024</a></span>.</p> </div> </div> <div class="footnote"><hr/><ol><li id="ref-petitjean_2024"><p><p id="ref-petitjean_2024">Petit-Jean T., Gérardin C., Berthelot E., Chatellier G., Frank M., Tannier X., Kempf E. and Bey R., 2024. Collaborative and privacy-enhancing workflows on a clinical data warehouse: an example developing natural language processing pipelines to detect medical conditions. <i>Journal of the American Medical Informatics Association</i>. <i>31</i>, pp.1280-1290. <a href="https://dx.doi.org/10.1093/jamia/ocae069" target="_blank">10.1093/jamia/ocae069</a></p></p></li></ol><div></div></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> </div> </div> </div> <h2 id="edsnlp.pipes.ner.disorders.leukemia.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span>]</code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[str]</code> </span> </p> </td> </tr> <tr> <td><code>patterns</code></td> <td class="doc-param-details"> <p>The patterns to use for matching</p> <p> <span class="doc-param-default"> <b>DEFAULT:</b> <code>[{'source': 'main', 'regex': ['leucemie', '(syn...</code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>The label to use for the <code>Span</code> object and the extension</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>leukemia</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set matches on the doc</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True, 'leukemia': True}</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.ner.disorders.leukemia.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.ner.disorders.leukemia.factory.create_component">eds.leukemia</a></code> component was developed by AP-HP's Data Science team with a team of medical experts, following the insights of the algorithm proposed by <span><a class="citation" href="./#ref-petitjean_2024" id="edsnlp.pipes.ner.disorders.leukemia.factory.create_component--cite-petitjean_2024">Petit-Jean et al., 2024</a></span>.</p> </div> </div> <div class="footnote"><hr/><ol><li id="ref-petitjean_2024"><p><p id="ref-petitjean_2024">Petit-Jean T., Gérardin C., Berthelot E., Chatellier G., Frank M., Tannier X., Kempf E. and Bey R., 2024. Collaborative and privacy-enhancing workflows on a clinical data warehouse: an example developing natural language processing pipelines to detect medical conditions. <i>Journal of the American Medical Informatics Association</i>. <i>31</i>, pp.1280-1290. <a href="https://dx.doi.org/10.1093/jamia/ocae069" target="_blank">10.1093/jamia/ocae069</a></p></p></li></ol><div></div></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/pipes/ner/disorders/liver-disease/index.html b/master/pipes/ner/disorders/liver-disease/index.html
index a605607cd..1e3d221ac 100644
--- a/master/pipes/ner/disorders/liver-disease/index.html
+++ b/master/pipes/ner/disorders/liver-disease/index.html
@@ -112,7 +112,7 @@
 
 <span class="n">span</span><span class="o">.</span><span class="n">_</span><span class="o">.</span><span class="n">detailed_status</span>
 <span class="c1"># Out: MODERATE_TO_SEVERE</span>
-</code></pre></div> </div> </div> </div> <h2 id="edsnlp.pipes.ner.disorders.liver_disease.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span>]</code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[str]</code> </span> </p> </td> </tr> <tr> <td><code>patterns</code></td> <td class="doc-param-details"> <p>The patterns to use for matching</p> <p> <span class="doc-param-default"> <b>DEFAULT:</b> <code>[{'source': 'mild', 'regex': ['cholangites?.{1,...</code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>The label to use for the <code>Span</code> object and the extension</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>liver_disease</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set matches on the doc</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True, 'liver_disease': True}</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.ner.disorders.liver_disease.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.ner.disorders.liver_disease.factory.create_component">eds.liver_disease</a></code> component was developed by AP-HP's Data Science team with a team of medical experts, following the insights of the algorithm proposed by <span><a class="citation" href="./#ref-petitjean_2024" id="edsnlp.pipes.ner.disorders.liver_disease.factory.create_component--cite-petitjean_2024">Petit-Jean et al., 2024</a></span>.</p> </div> </div> <div class="footnote"><hr/><ol><li id="ref-petitjean_2024"><p><p id="ref-petitjean_2024">Petit-Jean T., Gérardin C., Berthelot E., Chatellier G., Frank M., Tannier X., Kempf E. and Bey R., 2024. Collaborative and privacy-enhancing workflows on a clinical data warehouse: an example developing natural language processing pipelines to detect medical conditions. <i>Journal of the American Medical Informatics Association</i>. <i>31</i>, pp.1280-1290. <a href="https://dx.doi.org/10.1093/jamia/ocae069" target="_blank">10.1093/jamia/ocae069</a></p></p></li></ol><div></div></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> </div> </div> </div> <h2 id="edsnlp.pipes.ner.disorders.liver_disease.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span>]</code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[str]</code> </span> </p> </td> </tr> <tr> <td><code>patterns</code></td> <td class="doc-param-details"> <p>The patterns to use for matching</p> <p> <span class="doc-param-default"> <b>DEFAULT:</b> <code>[{'source': 'mild', 'regex': ['cholangites?.{1,...</code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>The label to use for the <code>Span</code> object and the extension</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>liver_disease</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set matches on the doc</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True, 'liver_disease': True}</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.ner.disorders.liver_disease.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.ner.disorders.liver_disease.factory.create_component">eds.liver_disease</a></code> component was developed by AP-HP's Data Science team with a team of medical experts, following the insights of the algorithm proposed by <span><a class="citation" href="./#ref-petitjean_2024" id="edsnlp.pipes.ner.disorders.liver_disease.factory.create_component--cite-petitjean_2024">Petit-Jean et al., 2024</a></span>.</p> </div> </div> <div class="footnote"><hr/><ol><li id="ref-petitjean_2024"><p><p id="ref-petitjean_2024">Petit-Jean T., Gérardin C., Berthelot E., Chatellier G., Frank M., Tannier X., Kempf E. and Bey R., 2024. Collaborative and privacy-enhancing workflows on a clinical data warehouse: an example developing natural language processing pipelines to detect medical conditions. <i>Journal of the American Medical Informatics Association</i>. <i>31</i>, pp.1280-1290. <a href="https://dx.doi.org/10.1093/jamia/ocae069" target="_blank">10.1093/jamia/ocae069</a></p></p></li></ol><div></div></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/pipes/ner/disorders/lymphoma/index.html b/master/pipes/ner/disorders/lymphoma/index.html
index ac1591e25..ffe342fe3 100644
--- a/master/pipes/ner/disorders/lymphoma/index.html
+++ b/master/pipes/ner/disorders/lymphoma/index.html
@@ -116,7 +116,7 @@
 
 <span class="n">spans</span>
 <span class="c1"># Out: []</span>
-</code></pre></div> </div> </div> </div> <h2 id="edsnlp.pipes.ner.disorders.lymphoma.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span>]</code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'lymphoma'</code> </span> </p> </td> </tr> <tr> <td><code>patterns</code></td> <td class="doc-param-details"> <p>The patterns to use for matching</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[<span title="typing.Dict">Dict</span>[str, <span title="typing.Any">Any</span>], <span title="typing.List">List</span>[<span title="typing.Dict">Dict</span>[str, <span title="typing.Any">Any</span>]]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>[{'source': 'main', 'regex': ['lymphom(?:.{1,10...</code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>The label to use for the <code>Span</code> object and the extension</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>lymphoma</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set matches on the doc</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True, 'lymphoma': True}</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.ner.disorders.lymphoma.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.ner.disorders.lymphoma.factory.create_component">eds.lymphoma</a></code> component was developed by AP-HP's Data Science team with a team of medical experts, following the insights of the algorithm proposed by <span><a class="citation" href="./#ref-petitjean_2024" id="edsnlp.pipes.ner.disorders.lymphoma.factory.create_component--cite-petitjean_2024">Petit-Jean et al., 2024</a></span>.</p> </div> </div> <div class="footnote"><hr/><ol><li id="ref-petitjean_2024"><p><p id="ref-petitjean_2024">Petit-Jean T., Gérardin C., Berthelot E., Chatellier G., Frank M., Tannier X., Kempf E. and Bey R., 2024. Collaborative and privacy-enhancing workflows on a clinical data warehouse: an example developing natural language processing pipelines to detect medical conditions. <i>Journal of the American Medical Informatics Association</i>. <i>31</i>, pp.1280-1290. <a href="https://dx.doi.org/10.1093/jamia/ocae069" target="_blank">10.1093/jamia/ocae069</a></p></p></li></ol><div></div></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> </div> </div> </div> <h2 id="edsnlp.pipes.ner.disorders.lymphoma.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span>]</code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'lymphoma'</code> </span> </p> </td> </tr> <tr> <td><code>patterns</code></td> <td class="doc-param-details"> <p>The patterns to use for matching</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[<span title="typing.Dict">Dict</span>[str, <span title="typing.Any">Any</span>], <span title="typing.List">List</span>[<span title="typing.Dict">Dict</span>[str, <span title="typing.Any">Any</span>]]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>[{'source': 'main', 'regex': ['lymphom(?:.{1,10...</code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>The label to use for the <code>Span</code> object and the extension</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>lymphoma</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set matches on the doc</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True, 'lymphoma': True}</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.ner.disorders.lymphoma.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.ner.disorders.lymphoma.factory.create_component">eds.lymphoma</a></code> component was developed by AP-HP's Data Science team with a team of medical experts, following the insights of the algorithm proposed by <span><a class="citation" href="./#ref-petitjean_2024" id="edsnlp.pipes.ner.disorders.lymphoma.factory.create_component--cite-petitjean_2024">Petit-Jean et al., 2024</a></span>.</p> </div> </div> <div class="footnote"><hr/><ol><li id="ref-petitjean_2024"><p><p id="ref-petitjean_2024">Petit-Jean T., Gérardin C., Berthelot E., Chatellier G., Frank M., Tannier X., Kempf E. and Bey R., 2024. Collaborative and privacy-enhancing workflows on a clinical data warehouse: an example developing natural language processing pipelines to detect medical conditions. <i>Journal of the American Medical Informatics Association</i>. <i>31</i>, pp.1280-1290. <a href="https://dx.doi.org/10.1093/jamia/ocae069" target="_blank">10.1093/jamia/ocae069</a></p></p></li></ol><div></div></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/pipes/ner/disorders/myocardial-infarction/index.html b/master/pipes/ner/disorders/myocardial-infarction/index.html
index eaa62a285..320b49623 100644
--- a/master/pipes/ner/disorders/myocardial-infarction/index.html
+++ b/master/pipes/ner/disorders/myocardial-infarction/index.html
@@ -121,7 +121,7 @@
 
 <span class="n">span</span><span class="o">.</span><span class="n">_</span><span class="o">.</span><span class="n">assigned</span>
 <span class="c1"># Out: {'heart_localized': [myocarde]}</span>
-</code></pre></div> </div> </div> </div> <h2 id="edsnlp.pipes.ner.disorders.myocardial_infarction.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span>]</code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[str]</code> </span> </p> </td> </tr> <tr> <td><code>patterns</code></td> <td class="doc-param-details"> <p>The patterns to use for matching</p> <p> <span class="doc-param-default"> <b>DEFAULT:</b> <code>[{'source': 'main', 'regex': ['coronaropathie',...</code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>The label to use for the <code>Span</code> object and the extension</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>myocardial_infarction</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set matches on the doc</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True, 'myocardial_infarction': True}</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.ner.disorders.myocardial_infarction.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.ner.disorders.myocardial_infarction.factory.create_component">eds.myocardial_infarction</a></code> component was developed by AP-HP's Data Science team with a team of medical experts, following the insights of the algorithm proposed by <span><a class="citation" href="./#ref-petitjean_2024" id="edsnlp.pipes.ner.disorders.myocardial_infarction.factory.create_component--cite-petitjean_2024">Petit-Jean et al., 2024</a></span>.</p> </div> </div> <div class="footnote"><hr/><ol><li id="ref-petitjean_2024"><p><p id="ref-petitjean_2024">Petit-Jean T., Gérardin C., Berthelot E., Chatellier G., Frank M., Tannier X., Kempf E. and Bey R., 2024. Collaborative and privacy-enhancing workflows on a clinical data warehouse: an example developing natural language processing pipelines to detect medical conditions. <i>Journal of the American Medical Informatics Association</i>. <i>31</i>, pp.1280-1290. <a href="https://dx.doi.org/10.1093/jamia/ocae069" target="_blank">10.1093/jamia/ocae069</a></p></p></li></ol><div></div></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> </div> </div> </div> <h2 id="edsnlp.pipes.ner.disorders.myocardial_infarction.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span>]</code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[str]</code> </span> </p> </td> </tr> <tr> <td><code>patterns</code></td> <td class="doc-param-details"> <p>The patterns to use for matching</p> <p> <span class="doc-param-default"> <b>DEFAULT:</b> <code>[{'source': 'main', 'regex': ['coronaropathie',...</code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>The label to use for the <code>Span</code> object and the extension</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>myocardial_infarction</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set matches on the doc</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True, 'myocardial_infarction': True}</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.ner.disorders.myocardial_infarction.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.ner.disorders.myocardial_infarction.factory.create_component">eds.myocardial_infarction</a></code> component was developed by AP-HP's Data Science team with a team of medical experts, following the insights of the algorithm proposed by <span><a class="citation" href="./#ref-petitjean_2024" id="edsnlp.pipes.ner.disorders.myocardial_infarction.factory.create_component--cite-petitjean_2024">Petit-Jean et al., 2024</a></span>.</p> </div> </div> <div class="footnote"><hr/><ol><li id="ref-petitjean_2024"><p><p id="ref-petitjean_2024">Petit-Jean T., Gérardin C., Berthelot E., Chatellier G., Frank M., Tannier X., Kempf E. and Bey R., 2024. Collaborative and privacy-enhancing workflows on a clinical data warehouse: an example developing natural language processing pipelines to detect medical conditions. <i>Journal of the American Medical Informatics Association</i>. <i>31</i>, pp.1280-1290. <a href="https://dx.doi.org/10.1093/jamia/ocae069" target="_blank">10.1093/jamia/ocae069</a></p></p></li></ol><div></div></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/pipes/ner/disorders/peptic-ulcer-disease/index.html b/master/pipes/ner/disorders/peptic-ulcer-disease/index.html
index 23250298e..85bcfbdc4 100644
--- a/master/pipes/ner/disorders/peptic-ulcer-disease/index.html
+++ b/master/pipes/ner/disorders/peptic-ulcer-disease/index.html
@@ -94,7 +94,7 @@
 
 <span class="n">span</span><span class="o">.</span><span class="n">_</span><span class="o">.</span><span class="n">assigned</span>
 <span class="c1"># Out: {'is_peptic': [gastrique]}</span>
-</code></pre></div> </div> </div> </div> <h2 id="edsnlp.pipes.ner.disorders.peptic_ulcer_disease.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline object</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span>]</code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[str]</code> </span> </p> </td> </tr> <tr> <td><code>patterns</code></td> <td class="doc-param-details"> <p>The patterns to use for matching</p> <p> <span class="doc-param-default"> <b>DEFAULT:</b> <code>[{'source': 'main', 'regex': ['ulcere.{1,10}gas...</code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>The label to use for the <code>Span</code> object and the extension</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>peptic_ulcer_disease</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set matches on the doc</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True, 'peptic_ulcer_disease': True}</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.ner.disorders.peptic_ulcer_disease.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.ner.disorders.peptic_ulcer_disease.factory.create_component">eds.peptic_ulcer_disease</a></code> component was developed by AP-HP's Data Science team with a team of medical experts, following the insights of the algorithm proposed by <span><a class="citation" href="./#ref-petitjean_2024" id="edsnlp.pipes.ner.disorders.peptic_ulcer_disease.factory.create_component--cite-petitjean_2024">Petit-Jean et al., 2024</a></span>.</p> </div> </div> <div class="footnote"><hr/><ol><li id="ref-petitjean_2024"><p><p id="ref-petitjean_2024">Petit-Jean T., Gérardin C., Berthelot E., Chatellier G., Frank M., Tannier X., Kempf E. and Bey R., 2024. Collaborative and privacy-enhancing workflows on a clinical data warehouse: an example developing natural language processing pipelines to detect medical conditions. <i>Journal of the American Medical Informatics Association</i>. <i>31</i>, pp.1280-1290. <a href="https://dx.doi.org/10.1093/jamia/ocae069" target="_blank">10.1093/jamia/ocae069</a></p></p></li></ol><div></div></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> </div> </div> </div> <h2 id="edsnlp.pipes.ner.disorders.peptic_ulcer_disease.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline object</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span>]</code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[str]</code> </span> </p> </td> </tr> <tr> <td><code>patterns</code></td> <td class="doc-param-details"> <p>The patterns to use for matching</p> <p> <span class="doc-param-default"> <b>DEFAULT:</b> <code>[{'source': 'main', 'regex': ['ulcere.{1,10}gas...</code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>The label to use for the <code>Span</code> object and the extension</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>peptic_ulcer_disease</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set matches on the doc</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True, 'peptic_ulcer_disease': True}</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.ner.disorders.peptic_ulcer_disease.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.ner.disorders.peptic_ulcer_disease.factory.create_component">eds.peptic_ulcer_disease</a></code> component was developed by AP-HP's Data Science team with a team of medical experts, following the insights of the algorithm proposed by <span><a class="citation" href="./#ref-petitjean_2024" id="edsnlp.pipes.ner.disorders.peptic_ulcer_disease.factory.create_component--cite-petitjean_2024">Petit-Jean et al., 2024</a></span>.</p> </div> </div> <div class="footnote"><hr/><ol><li id="ref-petitjean_2024"><p><p id="ref-petitjean_2024">Petit-Jean T., Gérardin C., Berthelot E., Chatellier G., Frank M., Tannier X., Kempf E. and Bey R., 2024. Collaborative and privacy-enhancing workflows on a clinical data warehouse: an example developing natural language processing pipelines to detect medical conditions. <i>Journal of the American Medical Informatics Association</i>. <i>31</i>, pp.1280-1290. <a href="https://dx.doi.org/10.1093/jamia/ocae069" target="_blank">10.1093/jamia/ocae069</a></p></p></li></ol><div></div></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/pipes/ner/disorders/peripheral-vascular-disease/index.html b/master/pipes/ner/disorders/peripheral-vascular-disease/index.html
index 91a1d5d6a..9200addb7 100644
--- a/master/pipes/ner/disorders/peripheral-vascular-disease/index.html
+++ b/master/pipes/ner/disorders/peripheral-vascular-disease/index.html
@@ -301,7 +301,7 @@
 
 <span class="n">spans</span>
 <span class="c1"># Out: []</span>
-</code></pre></div> </div> </div> </div> <h2 id="edsnlp.pipes.ner.disorders.peripheral_vascular_disease.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span>]</code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'peripheral_vascular_disease'</code> </span> </p> </td> </tr> <tr> <td><code>patterns</code></td> <td class="doc-param-details"> <p>The patterns to use for matching</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[<span title="typing.Dict">Dict</span>[str, <span title="typing.Any">Any</span>], <span title="typing.List">List</span>[<span title="typing.Dict">Dict</span>[str, <span title="typing.Any">Any</span>]]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>[{'source': 'acronym', 'regex': ['\\bAOMI\\b', ...</code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>The label to use for the <code>Span</code> object and the extension</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>peripheral_vascular_disease</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set matches on the doc</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True, 'peripheral_vascular_disease': T...</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.ner.disorders.peripheral_vascular_disease.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.ner.disorders.peripheral_vascular_disease.factory.create_component">eds.peripheral_vascular_disease</a></code> component was developed by AP-HP's Data Science team with a team of medical experts, following the insights of the algorithm proposed by <span><a class="citation" href="./#ref-petitjean_2024" id="edsnlp.pipes.ner.disorders.peripheral_vascular_disease.factory.create_component--cite-petitjean_2024">Petit-Jean et al., 2024</a></span>.</p> </div> </div> <div class="footnote"><hr/><ol><li id="ref-petitjean_2024"><p><p id="ref-petitjean_2024">Petit-Jean T., Gérardin C., Berthelot E., Chatellier G., Frank M., Tannier X., Kempf E. and Bey R., 2024. Collaborative and privacy-enhancing workflows on a clinical data warehouse: an example developing natural language processing pipelines to detect medical conditions. <i>Journal of the American Medical Informatics Association</i>. <i>31</i>, pp.1280-1290. <a href="https://dx.doi.org/10.1093/jamia/ocae069" target="_blank">10.1093/jamia/ocae069</a></p></p></li></ol><div></div></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> </div> </div> </div> <h2 id="edsnlp.pipes.ner.disorders.peripheral_vascular_disease.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span>]</code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'peripheral_vascular_disease'</code> </span> </p> </td> </tr> <tr> <td><code>patterns</code></td> <td class="doc-param-details"> <p>The patterns to use for matching</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[<span title="typing.Dict">Dict</span>[str, <span title="typing.Any">Any</span>], <span title="typing.List">List</span>[<span title="typing.Dict">Dict</span>[str, <span title="typing.Any">Any</span>]]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>[{'source': 'acronym', 'regex': ['\\bAOMI\\b', ...</code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>The label to use for the <code>Span</code> object and the extension</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>peripheral_vascular_disease</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set matches on the doc</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True, 'peripheral_vascular_disease': T...</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.ner.disorders.peripheral_vascular_disease.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.ner.disorders.peripheral_vascular_disease.factory.create_component">eds.peripheral_vascular_disease</a></code> component was developed by AP-HP's Data Science team with a team of medical experts, following the insights of the algorithm proposed by <span><a class="citation" href="./#ref-petitjean_2024" id="edsnlp.pipes.ner.disorders.peripheral_vascular_disease.factory.create_component--cite-petitjean_2024">Petit-Jean et al., 2024</a></span>.</p> </div> </div> <div class="footnote"><hr/><ol><li id="ref-petitjean_2024"><p><p id="ref-petitjean_2024">Petit-Jean T., Gérardin C., Berthelot E., Chatellier G., Frank M., Tannier X., Kempf E. and Bey R., 2024. Collaborative and privacy-enhancing workflows on a clinical data warehouse: an example developing natural language processing pipelines to detect medical conditions. <i>Journal of the American Medical Informatics Association</i>. <i>31</i>, pp.1280-1290. <a href="https://dx.doi.org/10.1093/jamia/ocae069" target="_blank">10.1093/jamia/ocae069</a></p></p></li></ol><div></div></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/pipes/ner/disorders/solid-tumor/index.html b/master/pipes/ner/disorders/solid-tumor/index.html
index 85d778a20..a19da8108 100644
--- a/master/pipes/ner/disorders/solid-tumor/index.html
+++ b/master/pipes/ner/disorders/solid-tumor/index.html
@@ -204,7 +204,7 @@
 
 <span class="n">span</span><span class="o">.</span><span class="n">_</span><span class="o">.</span><span class="n">detailed_status</span>
 <span class="c1"># Out: METASTASIS</span>
-</code></pre></div> </div> </div> </div> <h2 id="edsnlp.pipes.ner.disorders.solid_tumor.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span>]</code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[str]</code> </span> </p> </td> </tr> <tr> <td><code>patterns</code></td> <td class="doc-param-details"> <p>The patterns to use for matching</p> <p> <span class="doc-param-default"> <b>DEFAULT:</b> <code>[{'source': 'main', 'regex': ['carcinom(?!.{0,1...</code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>The label to use for the <code>Span</code> object and the extension</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>solid_tumor</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set matches on the doc</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True, 'solid_tumor': True}</code> </span> </p> </td> </tr> <tr> <td><code>use_tnm</code></td> <td class="doc-param-details"> <p>Whether to use TNM scores matching as well</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>use_patterns_metastasis_ct_scan</code></td> <td class="doc-param-details"> <p>Whether to use the metastasis patterns developed for the CT-Scans</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.ner.disorders.solid_tumor.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.ner.disorders.solid_tumor.factory.create_component">eds.solid_tumor</a></code> component was developed by AP-HP's Data Science team with a team of medical experts, following the insights of the algorithm proposed by <span><a class="citation" href="./#ref-petitjean_2024" id="edsnlp.pipes.ner.disorders.solid_tumor.factory.create_component--cite-petitjean_2024">Petit-Jean et al., 2024</a></span> and <span><a class="citation" href="./#ref-kempf:hal-03519085" id="edsnlp.pipes.ner.disorders.solid_tumor.factory.create_component--cite-kempf:hal-03519085">Kempf et al., 2022</a></span>.</p> </div> </div> <div class="footnote"><hr/><ol><li id="ref-petitjean_2024"><p><p id="ref-petitjean_2024">Petit-Jean T., Gérardin C., Berthelot E., Chatellier G., Frank M., Tannier X., Kempf E. and Bey R., 2024. Collaborative and privacy-enhancing workflows on a clinical data warehouse: an example developing natural language processing pipelines to detect medical conditions. <i>Journal of the American Medical Informatics Association</i>. <i>31</i>, pp.1280-1290. <a href="https://dx.doi.org/10.1093/jamia/ocae069" target="_blank">10.1093/jamia/ocae069</a></p></p></li><li id="ref-kempf:hal-03519085"><p><p id="ref-kempf:hal-03519085">Kempf E., Priou S., Lamé G., Daniel C., Bellamine A., Sommacale D., Belkacemi y., Bey R., Galula G., Taright N., Tannier X., Rance B., Flicoteaux R., Hemery F., Audureau E., Chatellier G. and Tournigand C., 2022. Impact of two waves of Sars-Cov2 outbreak on the number, clinical presentation, care trajectories and survival of patients newly referred for a colorectal cancer: A French multicentric cohort study from a large group of University hospitals. <i>{International Journal of Cancer}</i>. <i>150</i>, pp.1609-1618. <a href="https://dx.doi.org/10.1002/ijc.33928" target="_blank">10.1002/ijc.33928</a></p></p></li></ol><div></div></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> </div> </div> </div> <h2 id="edsnlp.pipes.ner.disorders.solid_tumor.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span>]</code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[str]</code> </span> </p> </td> </tr> <tr> <td><code>patterns</code></td> <td class="doc-param-details"> <p>The patterns to use for matching</p> <p> <span class="doc-param-default"> <b>DEFAULT:</b> <code>[{'source': 'main', 'regex': ['carcinom(?!.{0,1...</code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>The label to use for the <code>Span</code> object and the extension</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>solid_tumor</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set matches on the doc</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True, 'solid_tumor': True}</code> </span> </p> </td> </tr> <tr> <td><code>use_tnm</code></td> <td class="doc-param-details"> <p>Whether to use TNM scores matching as well</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>use_patterns_metastasis_ct_scan</code></td> <td class="doc-param-details"> <p>Whether to use the metastasis patterns developed for the CT-Scans</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.ner.disorders.solid_tumor.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.ner.disorders.solid_tumor.factory.create_component">eds.solid_tumor</a></code> component was developed by AP-HP's Data Science team with a team of medical experts, following the insights of the algorithm proposed by <span><a class="citation" href="./#ref-petitjean_2024" id="edsnlp.pipes.ner.disorders.solid_tumor.factory.create_component--cite-petitjean_2024">Petit-Jean et al., 2024</a></span> and <span><a class="citation" href="./#ref-kempf:hal-03519085" id="edsnlp.pipes.ner.disorders.solid_tumor.factory.create_component--cite-kempf:hal-03519085">Kempf et al., 2022</a></span>.</p> </div> </div> <div class="footnote"><hr/><ol><li id="ref-petitjean_2024"><p><p id="ref-petitjean_2024">Petit-Jean T., Gérardin C., Berthelot E., Chatellier G., Frank M., Tannier X., Kempf E. and Bey R., 2024. Collaborative and privacy-enhancing workflows on a clinical data warehouse: an example developing natural language processing pipelines to detect medical conditions. <i>Journal of the American Medical Informatics Association</i>. <i>31</i>, pp.1280-1290. <a href="https://dx.doi.org/10.1093/jamia/ocae069" target="_blank">10.1093/jamia/ocae069</a></p></p></li><li id="ref-kempf:hal-03519085"><p><p id="ref-kempf:hal-03519085">Kempf E., Priou S., Lamé G., Daniel C., Bellamine A., Sommacale D., Belkacemi y., Bey R., Galula G., Taright N., Tannier X., Rance B., Flicoteaux R., Hemery F., Audureau E., Chatellier G. and Tournigand C., 2022. Impact of two waves of Sars-Cov2 outbreak on the number, clinical presentation, care trajectories and survival of patients newly referred for a colorectal cancer: A French multicentric cohort study from a large group of University hospitals. <i>{International Journal of Cancer}</i>. <i>150</i>, pp.1609-1618. <a href="https://dx.doi.org/10.1002/ijc.33928" target="_blank">10.1002/ijc.33928</a></p></p></li></ol><div></div></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/pipes/ner/drugs/index.html b/master/pipes/ner/drugs/index.html
index 5e7069b56..caa7161db 100644
--- a/master/pipes/ner/drugs/index.html
+++ b/master/pipes/ner/drugs/index.html
@@ -22,7 +22,7 @@
 <span class="p">)</span>
 <span class="n">oral_antidiabetics_detected</span>
 <span class="c1"># Out: [('glucophage', 'A10BA02')]</span>
-</code></pre></div> <p>Glucophage is the brand name of a medication that contains metformine, the first-line medication for the treatment of type 2 diabetes.</p> <h2 id="edsnlp.pipes.ner.drugs.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline object</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span></code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'drugs'</code> </span> </p> </td> </tr> <tr> <td><code>attr</code></td> <td class="doc-param-details"> <p>The default attribute to use for matching.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'NORM'</code> </span> </p> </td> </tr> <tr> <td><code>ignore_excluded</code></td> <td class="doc-param-details"> <p>Whether to skip excluded tokens (requires an upstream pipeline to mark excluded tokens).</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>ignore_space_tokens</code></td> <td class="doc-param-details"> <p>Whether to skip space tokens during matching.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>term_matcher</code></td> <td class="doc-param-details"> <p>The matcher to use for matching phrases ? One of (exact, simstring)</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing_extensions.Literal">Literal</span>['exact', 'simstring']</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'exact'</code> </span> </p> </td> </tr> <tr> <td><code>term_matcher_config</code></td> <td class="doc-param-details"> <p>Parameters of the matcher term matcher</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Dict">Dict</span>[str, <span title="typing.Any">Any</span>]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{}</code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>Label name to use for the <code>Span</code> object and the extension</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'drug'</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set matches on the doc</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True, 'drug': True}</code> </span> </p> </td> </tr> </tbody> </table> <table> <thead> <tr> <th><b>RETURNS</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td> <span class="doc-returns-annotation"> <code><a class="autorefs autorefs-internal" href="../../../reference/edsnlp/pipes/core/terminology/terminology/#edsnlp.pipes.core.terminology.terminology.TerminologyMatcher" title="edsnlp.pipes.core.terminology.terminology.TerminologyMatcher">TerminologyMatcher</a></code> </span> </td> <td class="doc-returns-details"> <div class="doc-md-description"> </div> </td> </tr> </tbody> </table> <h1 id="edsnlp.pipes.ner.drugs.factory.create_component--authors-and-citation">Authors and citation</h1> <p>The <code><a href="#edsnlp.pipes.ner.drugs.factory.create_component">eds.drugs</a></code> pipeline was developed by the IAM team and CHU de Bordeaux's Data Science team.</p> </div> </div> <div class="footnote"><hr/><ol><li id="ref-cossin:hal-02987843"><p><p id="ref-cossin:hal-02987843">Cossin S., Lebrun L., Lobre G., Loustau R., Jouhet V., Griffier R., Mougin F., Diallo G. and Thiessard F., 2019. Romedi: An Open Data Source About French Drugs on the Semantic Web. <i>{Studies in Health Technology and Informatics}</i>. <i>264</i>, pp.79-82. <a href="https://dx.doi.org/10.3233/SHTI190187" target="_blank">10.3233/SHTI190187</a></p></p></li></ol><div></div></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> <p>Glucophage is the brand name of a medication that contains metformine, the first-line medication for the treatment of type 2 diabetes.</p> <h2 id="edsnlp.pipes.ner.drugs.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline object</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span></code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'drugs'</code> </span> </p> </td> </tr> <tr> <td><code>attr</code></td> <td class="doc-param-details"> <p>The default attribute to use for matching.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'NORM'</code> </span> </p> </td> </tr> <tr> <td><code>ignore_excluded</code></td> <td class="doc-param-details"> <p>Whether to skip excluded tokens (requires an upstream pipeline to mark excluded tokens).</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>ignore_space_tokens</code></td> <td class="doc-param-details"> <p>Whether to skip space tokens during matching.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>term_matcher</code></td> <td class="doc-param-details"> <p>The matcher to use for matching phrases ? One of (exact, simstring)</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing_extensions.Literal">Literal</span>['exact', 'simstring']</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'exact'</code> </span> </p> </td> </tr> <tr> <td><code>term_matcher_config</code></td> <td class="doc-param-details"> <p>Parameters of the matcher term matcher</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Dict">Dict</span>[str, <span title="typing.Any">Any</span>]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{}</code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>Label name to use for the <code>Span</code> object and the extension</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'drug'</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set matches on the doc</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True, 'drug': True}</code> </span> </p> </td> </tr> </tbody> </table> <table> <thead> <tr> <th><b>RETURNS</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td> <span class="doc-returns-annotation"> <code><a class="autorefs autorefs-internal" href="../../../reference/edsnlp/pipes/core/terminology/terminology/#edsnlp.pipes.core.terminology.terminology.TerminologyMatcher" title="edsnlp.pipes.core.terminology.terminology.TerminologyMatcher">TerminologyMatcher</a></code> </span> </td> <td class="doc-returns-details"> <div class="doc-md-description"> </div> </td> </tr> </tbody> </table> <h1 id="edsnlp.pipes.ner.drugs.factory.create_component--authors-and-citation">Authors and citation</h1> <p>The <code><a href="#edsnlp.pipes.ner.drugs.factory.create_component">eds.drugs</a></code> pipeline was developed by the IAM team and CHU de Bordeaux's Data Science team.</p> </div> </div> <div class="footnote"><hr/><ol><li id="ref-cossin:hal-02987843"><p><p id="ref-cossin:hal-02987843">Cossin S., Lebrun L., Lobre G., Loustau R., Jouhet V., Griffier R., Mougin F., Diallo G. and Thiessard F., 2019. Romedi: An Open Data Source About French Drugs on the Semantic Web. <i>{Studies in Health Technology and Informatics}</i>. <i>264</i>, pp.79-82. <a href="https://dx.doi.org/10.3233/SHTI190187" target="_blank">10.3233/SHTI190187</a></p></p></li></ol><div></div></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/pipes/ner/index.html b/master/pipes/ner/index.html
index a3c33479c..b604c06f6 100644
--- a/master/pipes/ner/index.html
+++ b/master/pipes/ner/index.html
@@ -1,5 +1,5 @@
 <!DOCTYPE html>
-<html class="no-js" lang="en"> <head><meta charset="utf-8"/><meta content="width=device-width,initial-scale=1" name="viewport"/><link href="../misc/split/" rel="prev"/><link href="scores/" rel="next"/><link href="../../assets/logo/edsnlp.svg" rel="icon"/><meta content="mkdocs-1.6.1, mkdocs-material-9.5.42" name="generator"/><title>Overview - EDS-NLP</title><link href="../../assets/stylesheets/main.0253249f.min.css" rel="stylesheet"/><link href="../../assets/stylesheets/palette.06af60db.min.css" rel="stylesheet"/><link crossorigin="" href="https://fonts.gstatic.com" rel="preconnect"/><link href="https://fonts.googleapis.com/css?family=Roboto:300,300i,400,400i,700,700i%7CRoboto+Mono:400,400i,700,700i&amp;display=fallback" rel="stylesheet"/><style>:root{--md-text-font:"Roboto";--md-code-font:"Roboto Mono"}</style><link href="../../assets/_mkdocstrings.css" rel="stylesheet"/><link href="../../assets/stylesheets/extra.css" rel="stylesheet"/><link href="../../assets/stylesheets/cards.css" rel="stylesheet"/><link href="../../assets/termynal/termynal.css" rel="stylesheet"/><script>__md_scope=new URL("../..",location),__md_hash=e=>[...e].reduce(((e,_)=>(e<<5)-e+_.charCodeAt(0)),0),__md_get=(e,_=localStorage,t=__md_scope)=>JSON.parse(_.getItem(t.pathname+"."+e)),__md_set=(e,_,t=localStorage,a=__md_scope)=>{try{t.setItem(a.pathname+"."+e,JSON.stringify(_))}catch(e){}}</script></head> <body data-md-color-accent="indigo" data-md-color-primary="indigo" data-md-color-scheme="default" dir="ltr"> <input autocomplete="off" class="md-toggle" data-md-toggle="drawer" id="__drawer" type="checkbox"/> <input autocomplete="off" class="md-toggle" data-md-toggle="search" id="__search" type="checkbox"/> <label class="md-overlay" for="__drawer"></label> <div data-md-component="skip"> <a class="md-skip" href="#named-entity-recognition-components"> Skip to content </a> </div> <div data-md-component="announce"> <aside class="md-banner"> <div class="md-banner__inner md-grid md-typeset"> <button aria-label="Don't show this again" class="md-banner__button md-icon"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M19 6.41 17.59 5 12 10.59 6.41 5 5 6.41 10.59 12 5 17.59 6.41 19 12 13.41 17.59 19 19 17.59 13.41 12z"></path></svg> </button> Check out the new <a href="../../tutorials/training">Model Training tutorial</a> ! </div> <script>var el=document.querySelector("[data-md-component=announce]");if(el){var content=el.querySelector(".md-typeset");__md_hash(content.innerHTML)===__md_get("__announce")&&(el.hidden=!0)}</script> </aside> </div> <div data-md-color-scheme="default" data-md-component="outdated" hidden=""> </div> <header class="md-header md-header--shadow" data-md-component="header"> <nav aria-label="Header" class="md-header__inner md-grid"> <a aria-label="EDS-NLP" class="md-header__button md-logo" data-md-component="logo" href="../.." title="EDS-NLP"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a3 3 0 0 0 3-3 3 3 0 0 0-3-3 3 3 0 0 0-3 3 3 3 0 0 0 3 3m0 3.54C9.64 9.35 6.5 8 3 8v11c3.5 0 6.64 1.35 9 3.54 2.36-2.19 5.5-3.54 9-3.54V8c-3.5 0-6.64 1.35-9 3.54"></path></svg> </a> <label class="md-header__button md-icon" for="__drawer"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M3 6h18v2H3zm0 5h18v2H3zm0 5h18v2H3z"></path></svg> </label> <div class="md-header__title" data-md-component="header-title"> <div class="md-header__ellipsis"> <div class="md-header__topic"> <span class="md-ellipsis"> EDS-NLP </span> </div> <div class="md-header__topic" data-md-component="header-topic"> <span class="md-ellipsis"> Overview </span> </div> </div> </div> <form class="md-header__option" data-md-component="palette"> <input aria-label="Switch to dark mode" class="md-option" data-md-color-accent="indigo" data-md-color-media="" data-md-color-primary="indigo" data-md-color-scheme="default" id="__palette_0" name="__palette" type="radio"/> <label class="md-header__button md-icon" for="__palette_1" hidden="" title="Switch to dark mode"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 18c-.89 0-1.74-.2-2.5-.55C11.56 16.5 13 14.42 13 12s-1.44-4.5-3.5-5.45C10.26 6.2 11.11 6 12 6a6 6 0 0 1 6 6 6 6 0 0 1-6 6m8-9.31V4h-4.69L12 .69 8.69 4H4v4.69L.69 12 4 15.31V20h4.69L12 23.31 15.31 20H20v-4.69L23.31 12z"></path></svg> </label> <input aria-label="Switch to light mode" class="md-option" data-md-color-accent="indigo" data-md-color-media="" data-md-color-primary="indigo" data-md-color-scheme="slate" id="__palette_1" name="__palette" type="radio"/> <label class="md-header__button md-icon" for="__palette_0" hidden="" title="Switch to light mode"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a4 4 0 0 0-4 4 4 4 0 0 0 4 4 4 4 0 0 0 4-4 4 4 0 0 0-4-4m0 10a6 6 0 0 1-6-6 6 6 0 0 1 6-6 6 6 0 0 1 6 6 6 6 0 0 1-6 6m8-9.31V4h-4.69L12 .69 8.69 4H4v4.69L.69 12 4 15.31V20h4.69L12 23.31 15.31 20H20v-4.69L23.31 12z"></path></svg> </label> </form> <script>var palette=__md_get("__palette");if(palette&&palette.color){if("(prefers-color-scheme)"===palette.color.media){var media=matchMedia("(prefers-color-scheme: light)"),input=document.querySelector(media.matches?"[data-md-color-media='(prefers-color-scheme: light)']":"[data-md-color-media='(prefers-color-scheme: dark)']");palette.color.media=input.getAttribute("data-md-color-media"),palette.color.scheme=input.getAttribute("data-md-color-scheme"),palette.color.primary=input.getAttribute("data-md-color-primary"),palette.color.accent=input.getAttribute("data-md-color-accent")}for(var[key,value]of Object.entries(palette.color))document.body.setAttribute("data-md-color-"+key,value)}</script> <label class="md-header__button md-icon" for="__search"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M9.5 3A6.5 6.5 0 0 1 16 9.5c0 1.61-.59 3.09-1.56 4.23l.27.27h.79l5 5-1.5 1.5-5-5v-.79l-.27-.27A6.52 6.52 0 0 1 9.5 16 6.5 6.5 0 0 1 3 9.5 6.5 6.5 0 0 1 9.5 3m0 2C7 5 5 7 5 9.5S7 14 9.5 14 14 12 14 9.5 12 5 9.5 5"></path></svg> </label> <div class="md-search" data-md-component="search" role="dialog"> <label class="md-search__overlay" for="__search"></label> <div class="md-search__inner" role="search"> <form class="md-search__form" name="search"> <input aria-label="Search" autocapitalize="off" autocomplete="off" autocorrect="off" class="md-search__input" data-md-component="search-query" name="query" placeholder="Search" required="" spellcheck="false" type="text"/> <label class="md-search__icon md-icon" for="__search"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M9.5 3A6.5 6.5 0 0 1 16 9.5c0 1.61-.59 3.09-1.56 4.23l.27.27h.79l5 5-1.5 1.5-5-5v-.79l-.27-.27A6.52 6.52 0 0 1 9.5 16 6.5 6.5 0 0 1 3 9.5 6.5 6.5 0 0 1 9.5 3m0 2C7 5 5 7 5 9.5S7 14 9.5 14 14 12 14 9.5 12 5 9.5 5"></path></svg> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M20 11v2H8l5.5 5.5-1.42 1.42L4.16 12l7.92-7.92L13.5 5.5 8 11z"></path></svg> </label> <nav aria-label="Search" class="md-search__options"> <button aria-label="Clear" class="md-search__icon md-icon" tabindex="-1" title="Clear" type="reset"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M19 6.41 17.59 5 12 10.59 6.41 5 5 6.41 10.59 12 5 17.59 6.41 19 12 13.41 17.59 19 19 17.59 13.41 12z"></path></svg> </button> </nav> </form> <div class="md-search__output"> <div class="md-search__scrollwrap" data-md-scrollfix="" tabindex="0"> <div class="md-search-result" data-md-component="search-result"> <div class="md-search-result__meta"> Initializing search </div> <ol class="md-search-result__list" role="presentation"></ol> </div> </div> </div> </div> </div> <div class="md-header__source"> <a class="md-source" data-md-component="source" href="https://github.com/aphp/edsnlp" title="Go to repository"> <div class="md-source__icon md-icon"> <svg viewbox="0 0 448 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M439.55 236.05 244 40.45a28.87 28.87 0 0 0-40.81 0l-40.66 40.63 51.52 51.52c27.06-9.14 52.68 16.77 43.39 43.68l49.66 49.66c34.23-11.8 61.18 31 35.47 56.69-26.49 26.49-70.21-2.87-56-37.34L240.22 199v121.85c25.3 12.54 22.26 41.85 9.08 55a34.34 34.34 0 0 1-48.55 0c-17.57-17.6-11.07-46.91 11.25-56v-123c-20.8-8.51-24.6-30.74-18.64-45L142.57 101 8.45 235.14a28.86 28.86 0 0 0 0 40.81l195.61 195.6a28.86 28.86 0 0 0 40.8 0l194.69-194.69a28.86 28.86 0 0 0 0-40.81"></path></svg> </div> <div class="md-source__repository"> aphp/edsnlp </div> </a> </div> </nav> </header> <div class="md-container" data-md-component="container"> <main class="md-main" data-md-component="main"> <div class="md-main__inner md-grid"> <div class="md-sidebar md-sidebar--primary" data-md-component="sidebar" data-md-type="navigation"> <div class="md-sidebar__scrollwrap"> <div class="md-sidebar__inner"> <nav aria-label="Navigation" class="md-nav md-nav--primary" data-md-level="0"> <label class="md-nav__title" for="__drawer"> <a aria-label="EDS-NLP" class="md-nav__button md-logo" data-md-component="logo" href="../.." title="EDS-NLP"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a3 3 0 0 0 3-3 3 3 0 0 0-3-3 3 3 0 0 0-3 3 3 3 0 0 0 3 3m0 3.54C9.64 9.35 6.5 8 3 8v11c3.5 0 6.64 1.35 9 3.54 2.36-2.19 5.5-3.54 9-3.54V8c-3.5 0-6.64 1.35-9 3.54"></path></svg> </a> EDS-NLP </label> <div class="md-nav__source"> <a class="md-source" data-md-component="source" href="https://github.com/aphp/edsnlp" title="Go to repository"> <div class="md-source__icon md-icon"> <svg viewbox="0 0 448 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M439.55 236.05 244 40.45a28.87 28.87 0 0 0-40.81 0l-40.66 40.63 51.52 51.52c27.06-9.14 52.68 16.77 43.39 43.68l49.66 49.66c34.23-11.8 61.18 31 35.47 56.69-26.49 26.49-70.21-2.87-56-37.34L240.22 199v121.85c25.3 12.54 22.26 41.85 9.08 55a34.34 34.34 0 0 1-48.55 0c-17.57-17.6-11.07-46.91 11.25-56v-123c-20.8-8.51-24.6-30.74-18.64-45L142.57 101 8.45 235.14a28.86 28.86 0 0 0 0 40.81l195.61 195.6a28.86 28.86 0 0 0 40.8 0l194.69-194.69a28.86 28.86 0 0 0 0-40.81"></path></svg> </div> <div class="md-source__repository"> aphp/edsnlp </div> </a> </div> <ul class="md-nav__list" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href="../.."> <span class="md-ellipsis"> Getting started </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="https://aphp.github.io/edsnlp/demo" target="_blank"> <span class="md-ellipsis"> Demo </span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../tutorials/"> <span class="md-ellipsis"> Tutorials </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--active md-nav__item--nested"> <input checked="" class="md-nav__toggle md-toggle" id="__nav_4" type="checkbox"/> <div class="md-nav__link md-nav__container"> <a class="md-nav__link" href="../"> <span class="md-ellipsis"> Pipes </span> </a> <label class="md-nav__link" for="__nav_4" id="__nav_4_label" tabindex="0"> <span class="md-nav__icon md-icon"></span> </label> </div> <nav aria-expanded="true" aria-labelledby="__nav_4_label" class="md-nav" data-md-level="1"> <label class="md-nav__title" for="__nav_4"> <span class="md-nav__icon md-icon"></span> Pipes </label> <ul class="md-nav__list" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href="../"> <span class="md-ellipsis"> Overview </span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../core/"> <span class="md-ellipsis"> Core Pipelines </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../qualifiers/"> <span class="md-ellipsis"> Qualifiers </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../misc/"> <span class="md-ellipsis"> Miscellaneous </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--active md-nav__item--nested"> <input checked="" class="md-nav__toggle md-toggle" id="__nav_4_6" type="checkbox"/> <div class="md-nav__link md-nav__container"> <a class="md-nav__link" href="./"> <span class="md-ellipsis"> Named Entity Recognition </span> </a> <label class="md-nav__link" for="__nav_4_6" id="__nav_4_6_label" tabindex="0"> <span class="md-nav__icon md-icon"></span> </label> </div> <nav aria-expanded="true" aria-labelledby="__nav_4_6_label" class="md-nav" data-md-level="2"> <label class="md-nav__title" for="__nav_4_6"> <span class="md-nav__icon md-icon"></span> Named Entity Recognition </label> <ul class="md-nav__list" data-md-scrollfix=""> <li class="md-nav__item md-nav__item--active"> <input class="md-nav__toggle md-toggle" id="__toc" type="checkbox"/> <label class="md-nav__link md-nav__link--active" for="__toc"> <span class="md-ellipsis"> Overview </span> <span class="md-nav__icon md-icon"></span> </label> <a class="md-nav__link md-nav__link--active" href="./"> <span class="md-ellipsis"> Overview </span> </a> <nav aria-label="Table of contents" class="md-nav md-nav--secondary"> <label class="md-nav__title" for="__toc"> <span class="md-nav__icon md-icon"></span> Table of contents </label> <ul class="md-nav__list" data-md-component="toc" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href="#edsnlp.pipes.base.SpanSetterArg"> <span class="md-ellipsis"> Span setters: where are stored extracted entities ? </span> </a> <nav aria-label="Span setters: where are stored extracted entities ?" class="md-nav"> <ul class="md-nav__list"> <li class="md-nav__item"> <a class="md-nav__link" href="#edsnlp.pipes.base.SpanSetterArg--examples"> <span class="md-ellipsis"> Examples </span> </a> </li> </ul> </nav> </li> <li class="md-nav__item"> <a class="md-nav__link" href="#available-components"> <span class="md-ellipsis"> Available components </span> </a> </li> </ul> </nav> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="scores/"> <span class="md-ellipsis"> Scores </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="disorders/"> <span class="md-ellipsis"> Disorders </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="behaviors/"> <span class="md-ellipsis"> Behaviors </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="adicap/"> <span class="md-ellipsis"> Adicap </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="tnm/"> <span class="md-ellipsis"> TNM </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="umls/"> <span class="md-ellipsis"> UMLS </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="cim10/"> <span class="md-ellipsis"> CIM10 </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="drugs/"> <span class="md-ellipsis"> Drugs </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="suicide_attempt/"> <span class="md-ellipsis"> Suicide Attempt </span> </a> </li> </ul> </nav> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../trainable/"> <span class="md-ellipsis"> Trainable components </span> <span class="md-nav__icon md-icon"></span> </a> </li> </ul> </nav> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../tokenizers/"> <span class="md-ellipsis"> Tokenizers </span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../data/"> <span class="md-ellipsis"> Data Connectors </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../concepts/pipeline/"> <span class="md-ellipsis"> Concepts </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../utilities/"> <span class="md-ellipsis"> Utilities </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../reference/edsnlp/"> <span class="md-ellipsis"> Code Reference </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../contributing/"> <span class="md-ellipsis"> Contributing to EDS-NLP </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../changelog/"> <span class="md-ellipsis"> Changelog </span> </a> </li> </ul> </nav> </div> </div> </div> <div class="md-sidebar md-sidebar--secondary" data-md-component="sidebar" data-md-type="toc"> <div class="md-sidebar__scrollwrap"> <div class="md-sidebar__inner"> <nav aria-label="Table of contents" class="md-nav md-nav--secondary"> <label class="md-nav__title" for="__toc"> <span class="md-nav__icon md-icon"></span> Table of contents </label> <ul class="md-nav__list" data-md-component="toc" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href="#edsnlp.pipes.base.SpanSetterArg"> <span class="md-ellipsis"> Span setters: where are stored extracted entities ? </span> </a> <nav aria-label="Span setters: where are stored extracted entities ?" class="md-nav"> <ul class="md-nav__list"> <li class="md-nav__item"> <a class="md-nav__link" href="#edsnlp.pipes.base.SpanSetterArg--examples"> <span class="md-ellipsis"> Examples </span> </a> </li> </ul> </nav> </li> <li class="md-nav__item"> <a class="md-nav__link" href="#available-components"> <span class="md-ellipsis"> Available components </span> </a> </li> </ul> </nav> </div> </div> </div> <div class="md-content" data-md-component="content"> <article class="md-content__inner md-typeset"> <h1 id="named-entity-recognition-components">Named Entity Recognition Components</h1> <p>We provide several Named Entity Recognition (NER) components. Named Entity Recognition is the task of identifying short relevant spans of text, named entities, and classifying them into pre-defined categories. In the case of clinical documents, these entities can be scores, disorders, behaviors, codes, dates, quantities, etc.</p> <h2 id="edsnlp.pipes.base.SpanSetterArg">Span setters: where are stored extracted entities ?</h2> <p>A component assigns entities to a document by adding them to the <code>doc.ents</code> or <code>doc.spans[group]</code> attributes. <code>doc.ents</code> only supports non overlapping entities, therefore, if two entities overlap, the longest one will be kept. <code>doc.spans[group]</code> on the other hand, can contain overlapping entities. To control where entities are added, you can use the <code>span_setter</code> argument in any of these component.</p> <div class="doc doc-object doc-class"> <p>Valid values for the <code>span_setter</code> argument of a component can be :</p> <ul> <li>a (doc, matches) -&gt; None callable</li> <li>a span group name</li> <li>a list of span group names</li> <li>a dict of group name to True or list of labels</li> </ul> <p>The group name <code>"ents"</code> is a special case, and will add the matches to <code>doc.ents</code></p> <h3 id="edsnlp.pipes.base.SpanSetterArg--examples">Examples</h3> <ul> <li><code>span_setter=["ents", "ckd"]</code> will add the matches to both <code>doc.ents</code> and <code>doc.spans["ckd"]</code>. It is equivalent to <code>{"ents": True, "ckd": True}</code>.</li> <li><code>span_setter={"ents": ["foo", "bar"]}</code> will add the matches with label "foo" and "bar" to <code>doc.ents</code>.</li> <li><code>span_setter="ents"</code> will add all matches only to <code>doc.ents</code>.</li> <li><code>span_setter="ckd"</code> will add all matches only to <code>doc.spans["ckd"]</code>.</li> </ul> </div><h2 id="available-components">Available components</h2> <table> <thead> <tr> <th>Component</th> <th>Description</th> </tr> </thead> <tbody> <tr> <td><code><a href="covid/#edsnlp.pipes.ner.covid.factory.create_component">eds.covid</a></code></td> <td>A COVID mentions detector</td> </tr> <tr> <td><code><a href="scores/charlson/#edsnlp.pipes.ner.scores.charlson.factory.create_component">eds.charlson</a></code></td> <td>A Charlson score extractor</td> </tr> <tr> <td><code><a href="scores/sofa/#edsnlp.pipes.ner.scores.sofa.factory.create_component">eds.sofa</a></code></td> <td>A SOFA score extractor</td> </tr> <tr> <td><code><a href="scores/elston-ellis/#edsnlp.pipes.ner.scores.elston_ellis.factory.create_component">eds.elston_ellis</a></code></td> <td>An Elston &amp; Ellis code extractor</td> </tr> <tr> <td><code><a href="scores/emergency-priority/#edsnlp.pipes.ner.scores.emergency.priority.factory.create_component">eds.emergency_priority</a></code></td> <td>A priority score extractor</td> </tr> <tr> <td><code><a href="scores/emergency-ccmu/#edsnlp.pipes.ner.scores.emergency.ccmu.factory.create_component">eds.emergency_ccmu</a></code></td> <td>A CCMU score extractor</td> </tr> <tr> <td><code><a href="scores/emergency-gemsa/#edsnlp.pipes.ner.scores.emergency.gemsa.factory.create_component">eds.emergency_gemsa</a></code></td> <td>A GEMSA score extractor</td> </tr> <tr> <td><code><a href="tnm/#edsnlp.pipes.ner.tnm.factory.create_component">eds.tnm</a></code></td> <td>A TNM score extractor</td> </tr> <tr> <td><code><a href="adicap/#edsnlp.pipes.ner.adicap.factory.create_component">eds.adicap</a></code></td> <td>A ADICAP codes extractor</td> </tr> <tr> <td><code><a href="drugs/#edsnlp.pipes.ner.drugs.factory.create_component">eds.drugs</a></code></td> <td>A drug mentions extractor</td> </tr> <tr> <td><code><a href="cim10/#edsnlp.pipes.ner.cim10.factory.create_component">eds.cim10</a></code></td> <td>A CIM10 terminology matcher</td> </tr> <tr> <td><code><a href="umls/#edsnlp.pipes.ner.umls.factory.create_component">eds.umls</a></code></td> <td>An UMLS terminology matcher</td> </tr> <tr> <td><code><a href="disorders/ckd/#edsnlp.pipes.ner.disorders.ckd.factory.create_component">eds.ckd</a></code></td> <td>CKD extractor</td> </tr> <tr> <td><code><a href="disorders/copd/#edsnlp.pipes.ner.disorders.copd.factory.create_component">eds.copd</a></code></td> <td>COPD extractor</td> </tr> <tr> <td><code><a href="disorders/cerebrovascular-accident/#edsnlp.pipes.ner.disorders.cerebrovascular_accident.factory.create_component">eds.cerebrovascular_accident</a></code></td> <td>Cerebrovascular accident extractor</td> </tr> <tr> <td><code><a href="disorders/congestive-heart-failure/#edsnlp.pipes.ner.disorders.congestive_heart_failure.factory.create_component">eds.congestive_heart_failure</a></code></td> <td>Congestive heart failure extractor</td> </tr> <tr> <td><code><a href="disorders/connective-tissue-disease/#edsnlp.pipes.ner.disorders.connective_tissue_disease.factory.create_component">eds.connective_tissue_disease</a></code></td> <td>Connective tissue disease extractor</td> </tr> <tr> <td><code><a href="disorders/dementia/#edsnlp.pipes.ner.disorders.dementia.factory.create_component">eds.dementia</a></code></td> <td>Dementia extractor</td> </tr> <tr> <td><code><a href="disorders/diabetes/#edsnlp.pipes.ner.disorders.diabetes.factory.create_component">eds.diabetes</a></code></td> <td>Diabetes extractor</td> </tr> <tr> <td><code><a href="disorders/hemiplegia/#edsnlp.pipes.ner.disorders.hemiplegia.factory.create_component">eds.hemiplegia</a></code></td> <td>Hemiplegia extractor</td> </tr> <tr> <td><code><a href="disorders/leukemia/#edsnlp.pipes.ner.disorders.leukemia.factory.create_component">eds.leukemia</a></code></td> <td>Leukemia extractor</td> </tr> <tr> <td><code><a href="disorders/liver-disease/#edsnlp.pipes.ner.disorders.liver_disease.factory.create_component">eds.liver_disease</a></code></td> <td>Liver disease extractor</td> </tr> <tr> <td><code><a href="disorders/lymphoma/#edsnlp.pipes.ner.disorders.lymphoma.factory.create_component">eds.lymphoma</a></code></td> <td>Lymphoma extractor</td> </tr> <tr> <td><code><a href="disorders/myocardial-infarction/#edsnlp.pipes.ner.disorders.myocardial_infarction.factory.create_component">eds.myocardial_infarction</a></code></td> <td>Myocardial infarction extractor</td> </tr> <tr> <td><code><a href="disorders/peptic-ulcer-disease/#edsnlp.pipes.ner.disorders.peptic_ulcer_disease.factory.create_component">eds.peptic_ulcer_disease</a></code></td> <td>Peptic ulcer disease extractor</td> </tr> <tr> <td><code><a href="disorders/peripheral-vascular-disease/#edsnlp.pipes.ner.disorders.peripheral_vascular_disease.factory.create_component">eds.peripheral_vascular_disease</a></code></td> <td>Peripheral vascular disease extractor</td> </tr> <tr> <td><code><a href="disorders/solid-tumor/#edsnlp.pipes.ner.disorders.solid_tumor.factory.create_component">eds.solid_tumor</a></code></td> <td>Solid tumor extractor</td> </tr> <tr> <td><code><a href="behaviors/alcohol/#edsnlp.pipes.ner.behaviors.alcohol.factory.create_component">eds.alcohol</a></code></td> <td>Alcohol consumption extractor</td> </tr> <tr> <td><code><a href="behaviors/tobacco/#edsnlp.pipes.ner.behaviors.tobacco.factory.create_component">eds.tobacco</a></code></td> <td>Tobacco consumption extractor</td> </tr> </tbody> </table> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+<html class="no-js" lang="en"> <head><meta charset="utf-8"/><meta content="width=device-width,initial-scale=1" name="viewport"/><link href="../misc/split/" rel="prev"/><link href="scores/" rel="next"/><link href="../../assets/logo/edsnlp.svg" rel="icon"/><meta content="mkdocs-1.6.1, mkdocs-material-9.5.42" name="generator"/><title>Overview - EDS-NLP</title><link href="../../assets/stylesheets/main.0253249f.min.css" rel="stylesheet"/><link href="../../assets/stylesheets/palette.06af60db.min.css" rel="stylesheet"/><link crossorigin="" href="https://fonts.gstatic.com" rel="preconnect"/><link href="https://fonts.googleapis.com/css?family=Roboto:300,300i,400,400i,700,700i%7CRoboto+Mono:400,400i,700,700i&amp;display=fallback" rel="stylesheet"/><style>:root{--md-text-font:"Roboto";--md-code-font:"Roboto Mono"}</style><link href="../../assets/_mkdocstrings.css" rel="stylesheet"/><link href="../../assets/stylesheets/extra.css" rel="stylesheet"/><link href="../../assets/stylesheets/cards.css" rel="stylesheet"/><link href="../../assets/termynal/termynal.css" rel="stylesheet"/><script>__md_scope=new URL("../..",location),__md_hash=e=>[...e].reduce(((e,_)=>(e<<5)-e+_.charCodeAt(0)),0),__md_get=(e,_=localStorage,t=__md_scope)=>JSON.parse(_.getItem(t.pathname+"."+e)),__md_set=(e,_,t=localStorage,a=__md_scope)=>{try{t.setItem(a.pathname+"."+e,JSON.stringify(_))}catch(e){}}</script></head> <body data-md-color-accent="indigo" data-md-color-primary="indigo" data-md-color-scheme="default" dir="ltr"> <input autocomplete="off" class="md-toggle" data-md-toggle="drawer" id="__drawer" type="checkbox"/> <input autocomplete="off" class="md-toggle" data-md-toggle="search" id="__search" type="checkbox"/> <label class="md-overlay" for="__drawer"></label> <div data-md-component="skip"> <a class="md-skip" href="#named-entity-recognition-components"> Skip to content </a> </div> <div data-md-component="announce"> <aside class="md-banner"> <div class="md-banner__inner md-grid md-typeset"> <button aria-label="Don't show this again" class="md-banner__button md-icon"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M19 6.41 17.59 5 12 10.59 6.41 5 5 6.41 10.59 12 5 17.59 6.41 19 12 13.41 17.59 19 19 17.59 13.41 12z"></path></svg> </button> Check out the new <a href="../../tutorials/training">Model Training tutorial</a> ! </div> <script>var el=document.querySelector("[data-md-component=announce]");if(el){var content=el.querySelector(".md-typeset");__md_hash(content.innerHTML)===__md_get("__announce")&&(el.hidden=!0)}</script> </aside> </div> <div data-md-color-scheme="default" data-md-component="outdated" hidden=""> </div> <header class="md-header md-header--shadow" data-md-component="header"> <nav aria-label="Header" class="md-header__inner md-grid"> <a aria-label="EDS-NLP" class="md-header__button md-logo" data-md-component="logo" href="../.." title="EDS-NLP"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a3 3 0 0 0 3-3 3 3 0 0 0-3-3 3 3 0 0 0-3 3 3 3 0 0 0 3 3m0 3.54C9.64 9.35 6.5 8 3 8v11c3.5 0 6.64 1.35 9 3.54 2.36-2.19 5.5-3.54 9-3.54V8c-3.5 0-6.64 1.35-9 3.54"></path></svg> </a> <label class="md-header__button md-icon" for="__drawer"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M3 6h18v2H3zm0 5h18v2H3zm0 5h18v2H3z"></path></svg> </label> <div class="md-header__title" data-md-component="header-title"> <div class="md-header__ellipsis"> <div class="md-header__topic"> <span class="md-ellipsis"> EDS-NLP </span> </div> <div class="md-header__topic" data-md-component="header-topic"> <span class="md-ellipsis"> Overview </span> </div> </div> </div> <form class="md-header__option" data-md-component="palette"> <input aria-label="Switch to dark mode" class="md-option" data-md-color-accent="indigo" data-md-color-media="" data-md-color-primary="indigo" data-md-color-scheme="default" id="__palette_0" name="__palette" type="radio"/> <label class="md-header__button md-icon" for="__palette_1" hidden="" title="Switch to dark mode"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 18c-.89 0-1.74-.2-2.5-.55C11.56 16.5 13 14.42 13 12s-1.44-4.5-3.5-5.45C10.26 6.2 11.11 6 12 6a6 6 0 0 1 6 6 6 6 0 0 1-6 6m8-9.31V4h-4.69L12 .69 8.69 4H4v4.69L.69 12 4 15.31V20h4.69L12 23.31 15.31 20H20v-4.69L23.31 12z"></path></svg> </label> <input aria-label="Switch to light mode" class="md-option" data-md-color-accent="indigo" data-md-color-media="" data-md-color-primary="indigo" data-md-color-scheme="slate" id="__palette_1" name="__palette" type="radio"/> <label class="md-header__button md-icon" for="__palette_0" hidden="" title="Switch to light mode"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a4 4 0 0 0-4 4 4 4 0 0 0 4 4 4 4 0 0 0 4-4 4 4 0 0 0-4-4m0 10a6 6 0 0 1-6-6 6 6 0 0 1 6-6 6 6 0 0 1 6 6 6 6 0 0 1-6 6m8-9.31V4h-4.69L12 .69 8.69 4H4v4.69L.69 12 4 15.31V20h4.69L12 23.31 15.31 20H20v-4.69L23.31 12z"></path></svg> </label> </form> <script>var palette=__md_get("__palette");if(palette&&palette.color){if("(prefers-color-scheme)"===palette.color.media){var media=matchMedia("(prefers-color-scheme: light)"),input=document.querySelector(media.matches?"[data-md-color-media='(prefers-color-scheme: light)']":"[data-md-color-media='(prefers-color-scheme: dark)']");palette.color.media=input.getAttribute("data-md-color-media"),palette.color.scheme=input.getAttribute("data-md-color-scheme"),palette.color.primary=input.getAttribute("data-md-color-primary"),palette.color.accent=input.getAttribute("data-md-color-accent")}for(var[key,value]of Object.entries(palette.color))document.body.setAttribute("data-md-color-"+key,value)}</script> <label class="md-header__button md-icon" for="__search"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M9.5 3A6.5 6.5 0 0 1 16 9.5c0 1.61-.59 3.09-1.56 4.23l.27.27h.79l5 5-1.5 1.5-5-5v-.79l-.27-.27A6.52 6.52 0 0 1 9.5 16 6.5 6.5 0 0 1 3 9.5 6.5 6.5 0 0 1 9.5 3m0 2C7 5 5 7 5 9.5S7 14 9.5 14 14 12 14 9.5 12 5 9.5 5"></path></svg> </label> <div class="md-search" data-md-component="search" role="dialog"> <label class="md-search__overlay" for="__search"></label> <div class="md-search__inner" role="search"> <form class="md-search__form" name="search"> <input aria-label="Search" autocapitalize="off" autocomplete="off" autocorrect="off" class="md-search__input" data-md-component="search-query" name="query" placeholder="Search" required="" spellcheck="false" type="text"/> <label class="md-search__icon md-icon" for="__search"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M9.5 3A6.5 6.5 0 0 1 16 9.5c0 1.61-.59 3.09-1.56 4.23l.27.27h.79l5 5-1.5 1.5-5-5v-.79l-.27-.27A6.52 6.52 0 0 1 9.5 16 6.5 6.5 0 0 1 3 9.5 6.5 6.5 0 0 1 9.5 3m0 2C7 5 5 7 5 9.5S7 14 9.5 14 14 12 14 9.5 12 5 9.5 5"></path></svg> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M20 11v2H8l5.5 5.5-1.42 1.42L4.16 12l7.92-7.92L13.5 5.5 8 11z"></path></svg> </label> <nav aria-label="Search" class="md-search__options"> <button aria-label="Clear" class="md-search__icon md-icon" tabindex="-1" title="Clear" type="reset"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M19 6.41 17.59 5 12 10.59 6.41 5 5 6.41 10.59 12 5 17.59 6.41 19 12 13.41 17.59 19 19 17.59 13.41 12z"></path></svg> </button> </nav> </form> <div class="md-search__output"> <div class="md-search__scrollwrap" data-md-scrollfix="" tabindex="0"> <div class="md-search-result" data-md-component="search-result"> <div class="md-search-result__meta"> Initializing search </div> <ol class="md-search-result__list" role="presentation"></ol> </div> </div> </div> </div> </div> <div class="md-header__source"> <a class="md-source" data-md-component="source" href="https://github.com/aphp/edsnlp" title="Go to repository"> <div class="md-source__icon md-icon"> <svg viewbox="0 0 448 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M439.55 236.05 244 40.45a28.87 28.87 0 0 0-40.81 0l-40.66 40.63 51.52 51.52c27.06-9.14 52.68 16.77 43.39 43.68l49.66 49.66c34.23-11.8 61.18 31 35.47 56.69-26.49 26.49-70.21-2.87-56-37.34L240.22 199v121.85c25.3 12.54 22.26 41.85 9.08 55a34.34 34.34 0 0 1-48.55 0c-17.57-17.6-11.07-46.91 11.25-56v-123c-20.8-8.51-24.6-30.74-18.64-45L142.57 101 8.45 235.14a28.86 28.86 0 0 0 0 40.81l195.61 195.6a28.86 28.86 0 0 0 40.8 0l194.69-194.69a28.86 28.86 0 0 0 0-40.81"></path></svg> </div> <div class="md-source__repository"> aphp/edsnlp </div> </a> </div> </nav> </header> <div class="md-container" data-md-component="container"> <main class="md-main" data-md-component="main"> <div class="md-main__inner md-grid"> <div class="md-sidebar md-sidebar--primary" data-md-component="sidebar" data-md-type="navigation"> <div class="md-sidebar__scrollwrap"> <div class="md-sidebar__inner"> <nav aria-label="Navigation" class="md-nav md-nav--primary" data-md-level="0"> <label class="md-nav__title" for="__drawer"> <a aria-label="EDS-NLP" class="md-nav__button md-logo" data-md-component="logo" href="../.." title="EDS-NLP"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a3 3 0 0 0 3-3 3 3 0 0 0-3-3 3 3 0 0 0-3 3 3 3 0 0 0 3 3m0 3.54C9.64 9.35 6.5 8 3 8v11c3.5 0 6.64 1.35 9 3.54 2.36-2.19 5.5-3.54 9-3.54V8c-3.5 0-6.64 1.35-9 3.54"></path></svg> </a> EDS-NLP </label> <div class="md-nav__source"> <a class="md-source" data-md-component="source" href="https://github.com/aphp/edsnlp" title="Go to repository"> <div class="md-source__icon md-icon"> <svg viewbox="0 0 448 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M439.55 236.05 244 40.45a28.87 28.87 0 0 0-40.81 0l-40.66 40.63 51.52 51.52c27.06-9.14 52.68 16.77 43.39 43.68l49.66 49.66c34.23-11.8 61.18 31 35.47 56.69-26.49 26.49-70.21-2.87-56-37.34L240.22 199v121.85c25.3 12.54 22.26 41.85 9.08 55a34.34 34.34 0 0 1-48.55 0c-17.57-17.6-11.07-46.91 11.25-56v-123c-20.8-8.51-24.6-30.74-18.64-45L142.57 101 8.45 235.14a28.86 28.86 0 0 0 0 40.81l195.61 195.6a28.86 28.86 0 0 0 40.8 0l194.69-194.69a28.86 28.86 0 0 0 0-40.81"></path></svg> </div> <div class="md-source__repository"> aphp/edsnlp </div> </a> </div> <ul class="md-nav__list" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href="../.."> <span class="md-ellipsis"> Getting started </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="https://aphp.github.io/edsnlp/demo" target="_blank"> <span class="md-ellipsis"> Demo </span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../tutorials/"> <span class="md-ellipsis"> Tutorials </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--active md-nav__item--nested"> <input checked="" class="md-nav__toggle md-toggle" id="__nav_4" type="checkbox"/> <div class="md-nav__link md-nav__container"> <a class="md-nav__link" href="../"> <span class="md-ellipsis"> Pipes </span> </a> <label class="md-nav__link" for="__nav_4" id="__nav_4_label" tabindex="0"> <span class="md-nav__icon md-icon"></span> </label> </div> <nav aria-expanded="true" aria-labelledby="__nav_4_label" class="md-nav" data-md-level="1"> <label class="md-nav__title" for="__nav_4"> <span class="md-nav__icon md-icon"></span> Pipes </label> <ul class="md-nav__list" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href="../"> <span class="md-ellipsis"> Overview </span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../core/"> <span class="md-ellipsis"> Core Pipelines </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../qualifiers/"> <span class="md-ellipsis"> Qualifiers </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../misc/"> <span class="md-ellipsis"> Miscellaneous </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--active md-nav__item--nested"> <input checked="" class="md-nav__toggle md-toggle" id="__nav_4_6" type="checkbox"/> <div class="md-nav__link md-nav__container"> <a class="md-nav__link" href="./"> <span class="md-ellipsis"> Named Entity Recognition </span> </a> <label class="md-nav__link" for="__nav_4_6" id="__nav_4_6_label" tabindex="0"> <span class="md-nav__icon md-icon"></span> </label> </div> <nav aria-expanded="true" aria-labelledby="__nav_4_6_label" class="md-nav" data-md-level="2"> <label class="md-nav__title" for="__nav_4_6"> <span class="md-nav__icon md-icon"></span> Named Entity Recognition </label> <ul class="md-nav__list" data-md-scrollfix=""> <li class="md-nav__item md-nav__item--active"> <input class="md-nav__toggle md-toggle" id="__toc" type="checkbox"/> <label class="md-nav__link md-nav__link--active" for="__toc"> <span class="md-ellipsis"> Overview </span> <span class="md-nav__icon md-icon"></span> </label> <a class="md-nav__link md-nav__link--active" href="./"> <span class="md-ellipsis"> Overview </span> </a> <nav aria-label="Table of contents" class="md-nav md-nav--secondary"> <label class="md-nav__title" for="__toc"> <span class="md-nav__icon md-icon"></span> Table of contents </label> <ul class="md-nav__list" data-md-component="toc" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href="#edsnlp.pipes.base.SpanSetterArg"> <span class="md-ellipsis"> Span setters: where are stored extracted entities ? </span> </a> <nav aria-label="Span setters: where are stored extracted entities ?" class="md-nav"> <ul class="md-nav__list"> <li class="md-nav__item"> <a class="md-nav__link" href="#edsnlp.pipes.base.SpanSetterArg--examples"> <span class="md-ellipsis"> Examples </span> </a> </li> </ul> </nav> </li> <li class="md-nav__item"> <a class="md-nav__link" href="#available-components"> <span class="md-ellipsis"> Available components </span> </a> </li> </ul> </nav> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="scores/"> <span class="md-ellipsis"> Scores </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="disorders/"> <span class="md-ellipsis"> Disorders </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="behaviors/"> <span class="md-ellipsis"> Behaviors </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="adicap/"> <span class="md-ellipsis"> Adicap </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="tnm/"> <span class="md-ellipsis"> TNM </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="umls/"> <span class="md-ellipsis"> UMLS </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="cim10/"> <span class="md-ellipsis"> CIM10 </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="drugs/"> <span class="md-ellipsis"> Drugs </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="suicide_attempt/"> <span class="md-ellipsis"> Suicide Attempt </span> </a> </li> </ul> </nav> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../trainable/"> <span class="md-ellipsis"> Trainable components </span> <span class="md-nav__icon md-icon"></span> </a> </li> </ul> </nav> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../tokenizers/"> <span class="md-ellipsis"> Tokenizers </span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../data/"> <span class="md-ellipsis"> Data Connectors </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../concepts/pipeline/"> <span class="md-ellipsis"> Concepts </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../utilities/"> <span class="md-ellipsis"> Utilities </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../reference/edsnlp/"> <span class="md-ellipsis"> Code Reference </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../contributing/"> <span class="md-ellipsis"> Contributing to EDS-NLP </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../changelog/"> <span class="md-ellipsis"> Changelog </span> </a> </li> </ul> </nav> </div> </div> </div> <div class="md-sidebar md-sidebar--secondary" data-md-component="sidebar" data-md-type="toc"> <div class="md-sidebar__scrollwrap"> <div class="md-sidebar__inner"> <nav aria-label="Table of contents" class="md-nav md-nav--secondary"> <label class="md-nav__title" for="__toc"> <span class="md-nav__icon md-icon"></span> Table of contents </label> <ul class="md-nav__list" data-md-component="toc" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href="#edsnlp.pipes.base.SpanSetterArg"> <span class="md-ellipsis"> Span setters: where are stored extracted entities ? </span> </a> <nav aria-label="Span setters: where are stored extracted entities ?" class="md-nav"> <ul class="md-nav__list"> <li class="md-nav__item"> <a class="md-nav__link" href="#edsnlp.pipes.base.SpanSetterArg--examples"> <span class="md-ellipsis"> Examples </span> </a> </li> </ul> </nav> </li> <li class="md-nav__item"> <a class="md-nav__link" href="#available-components"> <span class="md-ellipsis"> Available components </span> </a> </li> </ul> </nav> </div> </div> </div> <div class="md-content" data-md-component="content"> <article class="md-content__inner md-typeset"> <h1 id="named-entity-recognition-components">Named Entity Recognition Components</h1> <p>We provide several Named Entity Recognition (NER) components. Named Entity Recognition is the task of identifying short relevant spans of text, named entities, and classifying them into pre-defined categories. In the case of clinical documents, these entities can be scores, disorders, behaviors, codes, dates, quantities, etc.</p> <h2 id="edsnlp.pipes.base.SpanSetterArg">Span setters: where are stored extracted entities ?</h2> <p>A component assigns entities to a document by adding them to the <code>doc.ents</code> or <code>doc.spans[group]</code> attributes. <code>doc.ents</code> only supports non overlapping entities, therefore, if two entities overlap, the longest one will be kept. <code>doc.spans[group]</code> on the other hand, can contain overlapping entities. To control where entities are added, you can use the <code>span_setter</code> argument in any of these component.</p> <div class="doc doc-object doc-class"> <p>Valid values for the <code>span_setter</code> argument of a component can be :</p> <ul> <li>a (doc, matches) -&gt; None callable</li> <li>a span group name</li> <li>a list of span group names</li> <li>a dict of group name to True or list of labels</li> </ul> <p>The group name <code>"ents"</code> is a special case, and will add the matches to <code>doc.ents</code></p> <h3 id="edsnlp.pipes.base.SpanSetterArg--examples">Examples</h3> <ul> <li><code>span_setter=["ents", "ckd"]</code> will add the matches to both <code>doc.ents</code> and <code>doc.spans["ckd"]</code>. It is equivalent to <code>{"ents": True, "ckd": True}</code>.</li> <li><code>span_setter={"ents": ["foo", "bar"]}</code> will add the matches with label "foo" and "bar" to <code>doc.ents</code>.</li> <li><code>span_setter="ents"</code> will add all matches only to <code>doc.ents</code>.</li> <li><code>span_setter="ckd"</code> will add all matches only to <code>doc.spans["ckd"]</code>.</li> </ul> </div><h2 id="available-components">Available components</h2> <table> <thead> <tr> <th>Component</th> <th>Description</th> </tr> </thead> <tbody> <tr> <td><code><a href="covid/#edsnlp.pipes.ner.covid.factory.create_component">eds.covid</a></code></td> <td>A COVID mentions detector</td> </tr> <tr> <td><code><a href="scores/charlson/#edsnlp.pipes.ner.scores.charlson.factory.create_component">eds.charlson</a></code></td> <td>A Charlson score extractor</td> </tr> <tr> <td><code><a href="scores/sofa/#edsnlp.pipes.ner.scores.sofa.factory.create_component">eds.sofa</a></code></td> <td>A SOFA score extractor</td> </tr> <tr> <td><code><a href="scores/elston-ellis/#edsnlp.pipes.ner.scores.elston_ellis.factory.create_component">eds.elston_ellis</a></code></td> <td>An Elston &amp; Ellis code extractor</td> </tr> <tr> <td><code><a href="scores/emergency-priority/#edsnlp.pipes.ner.scores.emergency.priority.factory.create_component">eds.emergency_priority</a></code></td> <td>A priority score extractor</td> </tr> <tr> <td><code><a href="scores/emergency-ccmu/#edsnlp.pipes.ner.scores.emergency.ccmu.factory.create_component">eds.emergency_ccmu</a></code></td> <td>A CCMU score extractor</td> </tr> <tr> <td><code><a href="scores/emergency-gemsa/#edsnlp.pipes.ner.scores.emergency.gemsa.factory.create_component">eds.emergency_gemsa</a></code></td> <td>A GEMSA score extractor</td> </tr> <tr> <td><code><a href="tnm/#edsnlp.pipes.ner.tnm.factory.create_component">eds.tnm</a></code></td> <td>A TNM score extractor</td> </tr> <tr> <td><code><a href="adicap/#edsnlp.pipes.ner.adicap.factory.create_component">eds.adicap</a></code></td> <td>A ADICAP codes extractor</td> </tr> <tr> <td><code><a href="drugs/#edsnlp.pipes.ner.drugs.factory.create_component">eds.drugs</a></code></td> <td>A drug mentions extractor</td> </tr> <tr> <td><code><a href="cim10/#edsnlp.pipes.ner.cim10.factory.create_component">eds.cim10</a></code></td> <td>A CIM10 terminology matcher</td> </tr> <tr> <td><code><a href="umls/#edsnlp.pipes.ner.umls.factory.create_component">eds.umls</a></code></td> <td>An UMLS terminology matcher</td> </tr> <tr> <td><code><a href="disorders/ckd/#edsnlp.pipes.ner.disorders.ckd.factory.create_component">eds.ckd</a></code></td> <td>CKD extractor</td> </tr> <tr> <td><code><a href="disorders/copd/#edsnlp.pipes.ner.disorders.copd.factory.create_component">eds.copd</a></code></td> <td>COPD extractor</td> </tr> <tr> <td><code><a href="disorders/cerebrovascular-accident/#edsnlp.pipes.ner.disorders.cerebrovascular_accident.factory.create_component">eds.cerebrovascular_accident</a></code></td> <td>Cerebrovascular accident extractor</td> </tr> <tr> <td><code><a href="disorders/congestive-heart-failure/#edsnlp.pipes.ner.disorders.congestive_heart_failure.factory.create_component">eds.congestive_heart_failure</a></code></td> <td>Congestive heart failure extractor</td> </tr> <tr> <td><code><a href="disorders/connective-tissue-disease/#edsnlp.pipes.ner.disorders.connective_tissue_disease.factory.create_component">eds.connective_tissue_disease</a></code></td> <td>Connective tissue disease extractor</td> </tr> <tr> <td><code><a href="disorders/dementia/#edsnlp.pipes.ner.disorders.dementia.factory.create_component">eds.dementia</a></code></td> <td>Dementia extractor</td> </tr> <tr> <td><code><a href="disorders/diabetes/#edsnlp.pipes.ner.disorders.diabetes.factory.create_component">eds.diabetes</a></code></td> <td>Diabetes extractor</td> </tr> <tr> <td><code><a href="disorders/hemiplegia/#edsnlp.pipes.ner.disorders.hemiplegia.factory.create_component">eds.hemiplegia</a></code></td> <td>Hemiplegia extractor</td> </tr> <tr> <td><code><a href="disorders/leukemia/#edsnlp.pipes.ner.disorders.leukemia.factory.create_component">eds.leukemia</a></code></td> <td>Leukemia extractor</td> </tr> <tr> <td><code><a href="disorders/liver-disease/#edsnlp.pipes.ner.disorders.liver_disease.factory.create_component">eds.liver_disease</a></code></td> <td>Liver disease extractor</td> </tr> <tr> <td><code><a href="disorders/lymphoma/#edsnlp.pipes.ner.disorders.lymphoma.factory.create_component">eds.lymphoma</a></code></td> <td>Lymphoma extractor</td> </tr> <tr> <td><code><a href="disorders/myocardial-infarction/#edsnlp.pipes.ner.disorders.myocardial_infarction.factory.create_component">eds.myocardial_infarction</a></code></td> <td>Myocardial infarction extractor</td> </tr> <tr> <td><code><a href="disorders/peptic-ulcer-disease/#edsnlp.pipes.ner.disorders.peptic_ulcer_disease.factory.create_component">eds.peptic_ulcer_disease</a></code></td> <td>Peptic ulcer disease extractor</td> </tr> <tr> <td><code><a href="disorders/peripheral-vascular-disease/#edsnlp.pipes.ner.disorders.peripheral_vascular_disease.factory.create_component">eds.peripheral_vascular_disease</a></code></td> <td>Peripheral vascular disease extractor</td> </tr> <tr> <td><code><a href="disorders/solid-tumor/#edsnlp.pipes.ner.disorders.solid_tumor.factory.create_component">eds.solid_tumor</a></code></td> <td>Solid tumor extractor</td> </tr> <tr> <td><code><a href="behaviors/alcohol/#edsnlp.pipes.ner.behaviors.alcohol.factory.create_component">eds.alcohol</a></code></td> <td>Alcohol consumption extractor</td> </tr> <tr> <td><code><a href="behaviors/tobacco/#edsnlp.pipes.ner.behaviors.tobacco.factory.create_component">eds.tobacco</a></code></td> <td>Tobacco consumption extractor</td> </tr> </tbody> </table> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/pipes/ner/scores/charlson/index.html b/master/pipes/ner/scores/charlson/index.html
index 66c3b3377..41722d6ca 100644
--- a/master/pipes/ner/scores/charlson/index.html
+++ b/master/pipes/ner/scores/charlson/index.html
@@ -22,7 +22,7 @@
 
 <span class="n">ent</span><span class="o">.</span><span class="n">_</span><span class="o">.</span><span class="n">score_value</span>
 <span class="c1"># Out: 7</span>
-</code></pre></div> <h2 id="edsnlp.pipes.ner.scores.charlson.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline object</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span></code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>Name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'charlson'</code> </span> </p> </td> </tr> <tr> <td><code>regex</code></td> <td class="doc-param-details"> <p>A list of regexes to identify the score</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.List">List</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code><span title="edsnlp.pipes.ner.scores.charlson.patterns.regex">regex</span></code> </span> </p> </td> </tr> <tr> <td><code>attr</code></td> <td class="doc-param-details"> <p>Whether to match on the text ('TEXT') or on the normalized text ('NORM')</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'NORM'</code> </span> </p> </td> </tr> <tr> <td><code>value_extract</code></td> <td class="doc-param-details"> <p>Regex with capturing group to get the score value</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code><span title="edsnlp.pipes.ner.scores.charlson.patterns.value_extract">value_extract</span></code> </span> </p> </td> </tr> <tr> <td><code>score_normalization</code></td> <td class="doc-param-details"> <p>Function that takes the "raw" value extracted from the <code>value_extract</code> regex and should return:</p> <ul> <li>None if no score could be extracted</li> <li>The desired score value else</li> </ul> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[str, <span title="typing.Callable">Callable</span>[[<span title="typing.Union">Union</span>[str, None]], <span title="typing.Any">Any</span>]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code><span title="edsnlp.pipes.ner.scores.charlson.patterns.score_normalization_str">score_normalization_str</span></code> </span> </p> </td> </tr> <tr> <td><code>window</code></td> <td class="doc-param-details"> <p>Number of token to include after the score's mention to find the score's value</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>int</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>7</code> </span> </p> </td> </tr> <tr> <td><code>ignore_excluded</code></td> <td class="doc-param-details"> <p>Whether to ignore excluded spans when matching</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>ignore_space_tokens</code></td> <td class="doc-param-details"> <p>Whether to ignore space tokens when matching</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>flags</code></td> <td class="doc-param-details"> <p>Regex flags to use when matching</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[<span title="re.RegexFlag">RegexFlag</span>, int]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>0</code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>Label name to use for the <code>Span</code> object and the extension</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'charlson'</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set matches on the doc</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True, 'charlson': True}</code> </span> </p> </td> </tr> </tbody> </table> <table> <thead> <tr> <th><b>RETURNS</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td> <span class="doc-returns-annotation"> <code><a class="autorefs autorefs-internal" href="../../../../reference/edsnlp/pipes/ner/scores/base_score/#edsnlp.pipes.ner.scores.base_score.SimpleScoreMatcher" title="edsnlp.pipes.ner.scores.base_score.SimpleScoreMatcher">SimpleScoreMatcher</a></code> </span> </td> <td class="doc-returns-details"> <div class="doc-md-description"> </div> </td> </tr> </tbody> </table> </div> </div> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> <h2 id="edsnlp.pipes.ner.scores.charlson.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline object</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span></code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>Name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'charlson'</code> </span> </p> </td> </tr> <tr> <td><code>regex</code></td> <td class="doc-param-details"> <p>A list of regexes to identify the score</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.List">List</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code><span title="edsnlp.pipes.ner.scores.charlson.patterns.regex">regex</span></code> </span> </p> </td> </tr> <tr> <td><code>attr</code></td> <td class="doc-param-details"> <p>Whether to match on the text ('TEXT') or on the normalized text ('NORM')</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'NORM'</code> </span> </p> </td> </tr> <tr> <td><code>value_extract</code></td> <td class="doc-param-details"> <p>Regex with capturing group to get the score value</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code><span title="edsnlp.pipes.ner.scores.charlson.patterns.value_extract">value_extract</span></code> </span> </p> </td> </tr> <tr> <td><code>score_normalization</code></td> <td class="doc-param-details"> <p>Function that takes the "raw" value extracted from the <code>value_extract</code> regex and should return:</p> <ul> <li>None if no score could be extracted</li> <li>The desired score value else</li> </ul> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[str, <span title="typing.Callable">Callable</span>[[<span title="typing.Union">Union</span>[str, None]], <span title="typing.Any">Any</span>]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code><span title="edsnlp.pipes.ner.scores.charlson.patterns.score_normalization_str">score_normalization_str</span></code> </span> </p> </td> </tr> <tr> <td><code>window</code></td> <td class="doc-param-details"> <p>Number of token to include after the score's mention to find the score's value</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>int</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>7</code> </span> </p> </td> </tr> <tr> <td><code>ignore_excluded</code></td> <td class="doc-param-details"> <p>Whether to ignore excluded spans when matching</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>ignore_space_tokens</code></td> <td class="doc-param-details"> <p>Whether to ignore space tokens when matching</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>flags</code></td> <td class="doc-param-details"> <p>Regex flags to use when matching</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[<span title="re.RegexFlag">RegexFlag</span>, int]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>0</code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>Label name to use for the <code>Span</code> object and the extension</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'charlson'</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set matches on the doc</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True, 'charlson': True}</code> </span> </p> </td> </tr> </tbody> </table> <table> <thead> <tr> <th><b>RETURNS</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td> <span class="doc-returns-annotation"> <code><a class="autorefs autorefs-internal" href="../../../../reference/edsnlp/pipes/ner/scores/base_score/#edsnlp.pipes.ner.scores.base_score.SimpleScoreMatcher" title="edsnlp.pipes.ner.scores.base_score.SimpleScoreMatcher">SimpleScoreMatcher</a></code> </span> </td> <td class="doc-returns-details"> <div class="doc-md-description"> </div> </td> </tr> </tbody> </table> </div> </div> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/pipes/ner/scores/elston-ellis/index.html b/master/pipes/ner/scores/elston-ellis/index.html
index 140acc753..ff3055305 100644
--- a/master/pipes/ner/scores/elston-ellis/index.html
+++ b/master/pipes/ner/scores/elston-ellis/index.html
@@ -3,7 +3,7 @@
 
 <span class="n">nlp</span> <span class="o">=</span> <span class="n">edsnlp</span><span class="o">.</span><span class="n"><html><head></head><body><a class="discrete-link" href="../../../../reference/edsnlp/core/pipeline/#edsnlp.core.pipeline.blank">blank</a></body></html></span><span class="p">(</span><span class="s2">"eds"</span><span class="p">)</span>
 <span class="n">nlp</span><span class="o">.</span><span class="n"><html><head></head><body><a class="discrete-link" href="../../../../reference/edsnlp/core/pipeline/#edsnlp.core.pipeline.Pipeline.add_pipe">add_pipe</a></body></html></span><span class="p">(</span><a href="#edsnlp.pipes.ner.scores.elston_ellis.factory.create_component">eds.elston_ellis</a><span class="p">())</span>
-</code></pre></div> <h2 id="edsnlp.pipes.ner.scores.elston_ellis.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline object</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span></code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'elston_ellis'</code> </span> </p> </td> </tr> <tr> <td><code>regex</code></td> <td class="doc-param-details"> <p>A list of regexes to identify the score</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.List">List</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code><span title="edsnlp.pipes.ner.scores.elston_ellis.patterns.regex">regex</span></code> </span> </p> </td> </tr> <tr> <td><code>attr</code></td> <td class="doc-param-details"> <p>Whether to match on the text ('TEXT') or on the normalized text ('NORM')</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'TEXT'</code> </span> </p> </td> </tr> <tr> <td><code>value_extract</code></td> <td class="doc-param-details"> <p>Regex with capturing group to get the score value</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code><span title="edsnlp.pipes.ner.scores.elston_ellis.patterns.value_extract">value_extract</span></code> </span> </p> </td> </tr> <tr> <td><code>score_normalization</code></td> <td class="doc-param-details"> <p>Function that takes the "raw" value extracted from the <code>value_extract</code> regex and should return:</p> <ul> <li>None if no score could be extracted</li> <li>The desired score value else</li> </ul> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[str, <span title="typing.Callable">Callable</span>[[<span title="typing.Union">Union</span>[str, None]], <span title="typing.Any">Any</span>]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code><span title="edsnlp.pipes.ner.scores.elston_ellis.patterns.score_normalization_str">score_normalization_str</span></code> </span> </p> </td> </tr> <tr> <td><code>window</code></td> <td class="doc-param-details"> <p>Number of token to include after the score's mention to find the score's value</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>int</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>20</code> </span> </p> </td> </tr> <tr> <td><code>ignore_excluded</code></td> <td class="doc-param-details"> <p>Whether to ignore excluded spans when matching</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>ignore_space_tokens</code></td> <td class="doc-param-details"> <p>Whether to ignore space tokens when matching</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>flags</code></td> <td class="doc-param-details"> <p>Regex flags to use when matching</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[<span title="re.RegexFlag">RegexFlag</span>, int]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>0</code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>Label name to use for the <code>Span</code> object and the extension</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'elston_ellis'</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set matches on the doc</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True, 'elston_ellis': True}</code> </span> </p> </td> </tr> </tbody> </table> <table> <thead> <tr> <th><b>RETURNS</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td> <span class="doc-returns-annotation"> <code><a class="autorefs autorefs-internal" href="../../../../reference/edsnlp/pipes/ner/scores/base_score/#edsnlp.pipes.ner.scores.base_score.SimpleScoreMatcher" title="edsnlp.pipes.ner.scores.base_score.SimpleScoreMatcher">SimpleScoreMatcher</a></code> </span> </td> <td class="doc-returns-details"> <div class="doc-md-description"> </div> </td> </tr> </tbody> </table> </div> </div> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> <h2 id="edsnlp.pipes.ner.scores.elston_ellis.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline object</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span></code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'elston_ellis'</code> </span> </p> </td> </tr> <tr> <td><code>regex</code></td> <td class="doc-param-details"> <p>A list of regexes to identify the score</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.List">List</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code><span title="edsnlp.pipes.ner.scores.elston_ellis.patterns.regex">regex</span></code> </span> </p> </td> </tr> <tr> <td><code>attr</code></td> <td class="doc-param-details"> <p>Whether to match on the text ('TEXT') or on the normalized text ('NORM')</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'TEXT'</code> </span> </p> </td> </tr> <tr> <td><code>value_extract</code></td> <td class="doc-param-details"> <p>Regex with capturing group to get the score value</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code><span title="edsnlp.pipes.ner.scores.elston_ellis.patterns.value_extract">value_extract</span></code> </span> </p> </td> </tr> <tr> <td><code>score_normalization</code></td> <td class="doc-param-details"> <p>Function that takes the "raw" value extracted from the <code>value_extract</code> regex and should return:</p> <ul> <li>None if no score could be extracted</li> <li>The desired score value else</li> </ul> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[str, <span title="typing.Callable">Callable</span>[[<span title="typing.Union">Union</span>[str, None]], <span title="typing.Any">Any</span>]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code><span title="edsnlp.pipes.ner.scores.elston_ellis.patterns.score_normalization_str">score_normalization_str</span></code> </span> </p> </td> </tr> <tr> <td><code>window</code></td> <td class="doc-param-details"> <p>Number of token to include after the score's mention to find the score's value</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>int</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>20</code> </span> </p> </td> </tr> <tr> <td><code>ignore_excluded</code></td> <td class="doc-param-details"> <p>Whether to ignore excluded spans when matching</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>ignore_space_tokens</code></td> <td class="doc-param-details"> <p>Whether to ignore space tokens when matching</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>flags</code></td> <td class="doc-param-details"> <p>Regex flags to use when matching</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[<span title="re.RegexFlag">RegexFlag</span>, int]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>0</code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>Label name to use for the <code>Span</code> object and the extension</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'elston_ellis'</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set matches on the doc</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True, 'elston_ellis': True}</code> </span> </p> </td> </tr> </tbody> </table> <table> <thead> <tr> <th><b>RETURNS</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td> <span class="doc-returns-annotation"> <code><a class="autorefs autorefs-internal" href="../../../../reference/edsnlp/pipes/ner/scores/base_score/#edsnlp.pipes.ner.scores.base_score.SimpleScoreMatcher" title="edsnlp.pipes.ner.scores.base_score.SimpleScoreMatcher">SimpleScoreMatcher</a></code> </span> </td> <td class="doc-returns-details"> <div class="doc-md-description"> </div> </td> </tr> </tbody> </table> </div> </div> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/pipes/ner/scores/emergency-ccmu/index.html b/master/pipes/ner/scores/emergency-ccmu/index.html
index ba07f1a90..6e7ce93f1 100644
--- a/master/pipes/ner/scores/emergency-ccmu/index.html
+++ b/master/pipes/ner/scores/emergency-ccmu/index.html
@@ -3,7 +3,7 @@
 
 <span class="n">nlp</span> <span class="o">=</span> <span class="n">edsnlp</span><span class="o">.</span><span class="n"><html><head></head><body><a class="discrete-link" href="../../../../reference/edsnlp/core/pipeline/#edsnlp.core.pipeline.blank">blank</a></body></html></span><span class="p">(</span><span class="s2">"eds"</span><span class="p">)</span>
 <span class="n">nlp</span><span class="o">.</span><span class="n"><html><head></head><body><a class="discrete-link" href="../../../../reference/edsnlp/core/pipeline/#edsnlp.core.pipeline.Pipeline.add_pipe">add_pipe</a></body></html></span><span class="p">(</span><a href="#edsnlp.pipes.ner.scores.emergency.ccmu.factory.create_component">eds.emergency_ccmu</a><span class="p">())</span>
-</code></pre></div> <h2 id="edsnlp.pipes.ner.scores.emergency.ccmu.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline object</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span></code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'emergency_ccmu'</code> </span> </p> </td> </tr> <tr> <td><code>regex</code></td> <td class="doc-param-details"> <p>A list of regexes to identify the score</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.List">List</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code><span title="edsnlp.pipes.ner.scores.emergency.ccmu.patterns.regex">regex</span></code> </span> </p> </td> </tr> <tr> <td><code>attr</code></td> <td class="doc-param-details"> <p>Whether to match on the text ('TEXT') or on the normalized text ('NORM')</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'NORM'</code> </span> </p> </td> </tr> <tr> <td><code>value_extract</code></td> <td class="doc-param-details"> <p>Regex with capturing group to get the score value</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code><span title="edsnlp.pipes.ner.scores.emergency.ccmu.patterns.value_extract">value_extract</span></code> </span> </p> </td> </tr> <tr> <td><code>score_normalization</code></td> <td class="doc-param-details"> <p>Function that takes the "raw" value extracted from the <code>value_extract</code> regex and should return:</p> <ul> <li>None if no score could be extracted</li> <li>The desired score value otherwise</li> </ul> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[str, <span title="typing.Callable">Callable</span>[[<span title="typing.Union">Union</span>[str, None]], <span title="typing.Any">Any</span>]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code><span title="edsnlp.pipes.ner.scores.emergency.ccmu.patterns.score_normalization_str">score_normalization_str</span></code> </span> </p> </td> </tr> <tr> <td><code>window</code></td> <td class="doc-param-details"> <p>Number of token to include after the score's mention to find the score's value</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>int</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>20</code> </span> </p> </td> </tr> <tr> <td><code>ignore_excluded</code></td> <td class="doc-param-details"> <p>Whether to ignore excluded spans when matching</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>ignore_space_tokens</code></td> <td class="doc-param-details"> <p>Whether to ignore space tokens when matching</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>flags</code></td> <td class="doc-param-details"> <p>Regex flags to use when matching</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[<span title="re.RegexFlag">RegexFlag</span>, int]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>0</code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>Label name to use for the <code>Span</code> object and the extension</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'emergency_ccmu'</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set matches on the doc</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True, 'emergency_ccmu': True}</code> </span> </p> </td> </tr> </tbody> </table> <table> <thead> <tr> <th><b>RETURNS</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td> <span class="doc-returns-annotation"> <code><a class="autorefs autorefs-internal" href="../../../../reference/edsnlp/pipes/ner/scores/base_score/#edsnlp.pipes.ner.scores.base_score.SimpleScoreMatcher" title="edsnlp.pipes.ner.scores.base_score.SimpleScoreMatcher">SimpleScoreMatcher</a></code> </span> </td> <td class="doc-returns-details"> <div class="doc-md-description"> </div> </td> </tr> </tbody> </table> </div> </div> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> <h2 id="edsnlp.pipes.ner.scores.emergency.ccmu.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline object</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span></code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'emergency_ccmu'</code> </span> </p> </td> </tr> <tr> <td><code>regex</code></td> <td class="doc-param-details"> <p>A list of regexes to identify the score</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.List">List</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code><span title="edsnlp.pipes.ner.scores.emergency.ccmu.patterns.regex">regex</span></code> </span> </p> </td> </tr> <tr> <td><code>attr</code></td> <td class="doc-param-details"> <p>Whether to match on the text ('TEXT') or on the normalized text ('NORM')</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'NORM'</code> </span> </p> </td> </tr> <tr> <td><code>value_extract</code></td> <td class="doc-param-details"> <p>Regex with capturing group to get the score value</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code><span title="edsnlp.pipes.ner.scores.emergency.ccmu.patterns.value_extract">value_extract</span></code> </span> </p> </td> </tr> <tr> <td><code>score_normalization</code></td> <td class="doc-param-details"> <p>Function that takes the "raw" value extracted from the <code>value_extract</code> regex and should return:</p> <ul> <li>None if no score could be extracted</li> <li>The desired score value otherwise</li> </ul> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[str, <span title="typing.Callable">Callable</span>[[<span title="typing.Union">Union</span>[str, None]], <span title="typing.Any">Any</span>]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code><span title="edsnlp.pipes.ner.scores.emergency.ccmu.patterns.score_normalization_str">score_normalization_str</span></code> </span> </p> </td> </tr> <tr> <td><code>window</code></td> <td class="doc-param-details"> <p>Number of token to include after the score's mention to find the score's value</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>int</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>20</code> </span> </p> </td> </tr> <tr> <td><code>ignore_excluded</code></td> <td class="doc-param-details"> <p>Whether to ignore excluded spans when matching</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>ignore_space_tokens</code></td> <td class="doc-param-details"> <p>Whether to ignore space tokens when matching</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>flags</code></td> <td class="doc-param-details"> <p>Regex flags to use when matching</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[<span title="re.RegexFlag">RegexFlag</span>, int]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>0</code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>Label name to use for the <code>Span</code> object and the extension</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'emergency_ccmu'</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set matches on the doc</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True, 'emergency_ccmu': True}</code> </span> </p> </td> </tr> </tbody> </table> <table> <thead> <tr> <th><b>RETURNS</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td> <span class="doc-returns-annotation"> <code><a class="autorefs autorefs-internal" href="../../../../reference/edsnlp/pipes/ner/scores/base_score/#edsnlp.pipes.ner.scores.base_score.SimpleScoreMatcher" title="edsnlp.pipes.ner.scores.base_score.SimpleScoreMatcher">SimpleScoreMatcher</a></code> </span> </td> <td class="doc-returns-details"> <div class="doc-md-description"> </div> </td> </tr> </tbody> </table> </div> </div> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/pipes/ner/scores/emergency-gemsa/index.html b/master/pipes/ner/scores/emergency-gemsa/index.html
index 6d78e3b36..6d3216ec7 100644
--- a/master/pipes/ner/scores/emergency-gemsa/index.html
+++ b/master/pipes/ner/scores/emergency-gemsa/index.html
@@ -3,7 +3,7 @@
 
 <span class="n">nlp</span> <span class="o">=</span> <span class="n">edsnlp</span><span class="o">.</span><span class="n"><html><head></head><body><a class="discrete-link" href="../../../../reference/edsnlp/core/pipeline/#edsnlp.core.pipeline.blank">blank</a></body></html></span><span class="p">(</span><span class="s2">"eds"</span><span class="p">)</span>
 <span class="n">nlp</span><span class="o">.</span><span class="n"><html><head></head><body><a class="discrete-link" href="../../../../reference/edsnlp/core/pipeline/#edsnlp.core.pipeline.Pipeline.add_pipe">add_pipe</a></body></html></span><span class="p">(</span><a href="#edsnlp.pipes.ner.scores.emergency.gemsa.factory.create_component">eds.emergency_gemsa</a><span class="p">())</span>
-</code></pre></div> <h2 id="edsnlp.pipes.ner.scores.emergency.gemsa.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline object</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span></code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'emergency_gemsa'</code> </span> </p> </td> </tr> <tr> <td><code>regex</code></td> <td class="doc-param-details"> <p>A list of regexes to identify the score</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.List">List</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code><span title="edsnlp.pipes.ner.scores.emergency.gemsa.patterns.regex">regex</span></code> </span> </p> </td> </tr> <tr> <td><code>attr</code></td> <td class="doc-param-details"> <p>Whether to match on the text ('TEXT') or on the normalized text ('NORM')</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'NORM'</code> </span> </p> </td> </tr> <tr> <td><code>value_extract</code></td> <td class="doc-param-details"> <p>Regex with capturing group to get the score value</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code><span title="edsnlp.pipes.ner.scores.emergency.gemsa.patterns.value_extract">value_extract</span></code> </span> </p> </td> </tr> <tr> <td><code>score_normalization</code></td> <td class="doc-param-details"> <p>Function that takes the "raw" value extracted from the <code>value_extract</code> regex and should return:</p> <ul> <li>None if no score could be extracted</li> <li>The desired score value otherwise</li> </ul> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[str, <span title="typing.Callable">Callable</span>[[<span title="typing.Union">Union</span>[str, None]], <span title="typing.Any">Any</span>]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code><span title="edsnlp.pipes.ner.scores.emergency.gemsa.patterns.score_normalization_str">score_normalization_str</span></code> </span> </p> </td> </tr> <tr> <td><code>window</code></td> <td class="doc-param-details"> <p>Number of token to include after the score's mention to find the score's value</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>int</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>20</code> </span> </p> </td> </tr> <tr> <td><code>ignore_excluded</code></td> <td class="doc-param-details"> <p>Whether to ignore excluded spans when matching</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>ignore_space_tokens</code></td> <td class="doc-param-details"> <p>Whether to ignore space tokens when matching</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>flags</code></td> <td class="doc-param-details"> <p>Regex flags to use when matching</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[<span title="re.RegexFlag">RegexFlag</span>, int]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>0</code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>Label name to use for the <code>Span</code> object and the extension</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'emergency_gemsa'</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set matches on the doc</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True, 'emergency_gemsa': True}</code> </span> </p> </td> </tr> </tbody> </table> <table> <thead> <tr> <th><b>RETURNS</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td> <span class="doc-returns-annotation"> <code><a class="autorefs autorefs-internal" href="../../../../reference/edsnlp/pipes/ner/scores/base_score/#edsnlp.pipes.ner.scores.base_score.SimpleScoreMatcher" title="edsnlp.pipes.ner.scores.base_score.SimpleScoreMatcher">SimpleScoreMatcher</a></code> </span> </td> <td class="doc-returns-details"> <div class="doc-md-description"> </div> </td> </tr> </tbody> </table> </div> </div> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> <h2 id="edsnlp.pipes.ner.scores.emergency.gemsa.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline object</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span></code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'emergency_gemsa'</code> </span> </p> </td> </tr> <tr> <td><code>regex</code></td> <td class="doc-param-details"> <p>A list of regexes to identify the score</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.List">List</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code><span title="edsnlp.pipes.ner.scores.emergency.gemsa.patterns.regex">regex</span></code> </span> </p> </td> </tr> <tr> <td><code>attr</code></td> <td class="doc-param-details"> <p>Whether to match on the text ('TEXT') or on the normalized text ('NORM')</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'NORM'</code> </span> </p> </td> </tr> <tr> <td><code>value_extract</code></td> <td class="doc-param-details"> <p>Regex with capturing group to get the score value</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code><span title="edsnlp.pipes.ner.scores.emergency.gemsa.patterns.value_extract">value_extract</span></code> </span> </p> </td> </tr> <tr> <td><code>score_normalization</code></td> <td class="doc-param-details"> <p>Function that takes the "raw" value extracted from the <code>value_extract</code> regex and should return:</p> <ul> <li>None if no score could be extracted</li> <li>The desired score value otherwise</li> </ul> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[str, <span title="typing.Callable">Callable</span>[[<span title="typing.Union">Union</span>[str, None]], <span title="typing.Any">Any</span>]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code><span title="edsnlp.pipes.ner.scores.emergency.gemsa.patterns.score_normalization_str">score_normalization_str</span></code> </span> </p> </td> </tr> <tr> <td><code>window</code></td> <td class="doc-param-details"> <p>Number of token to include after the score's mention to find the score's value</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>int</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>20</code> </span> </p> </td> </tr> <tr> <td><code>ignore_excluded</code></td> <td class="doc-param-details"> <p>Whether to ignore excluded spans when matching</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>ignore_space_tokens</code></td> <td class="doc-param-details"> <p>Whether to ignore space tokens when matching</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>flags</code></td> <td class="doc-param-details"> <p>Regex flags to use when matching</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[<span title="re.RegexFlag">RegexFlag</span>, int]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>0</code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>Label name to use for the <code>Span</code> object and the extension</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'emergency_gemsa'</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set matches on the doc</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True, 'emergency_gemsa': True}</code> </span> </p> </td> </tr> </tbody> </table> <table> <thead> <tr> <th><b>RETURNS</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td> <span class="doc-returns-annotation"> <code><a class="autorefs autorefs-internal" href="../../../../reference/edsnlp/pipes/ner/scores/base_score/#edsnlp.pipes.ner.scores.base_score.SimpleScoreMatcher" title="edsnlp.pipes.ner.scores.base_score.SimpleScoreMatcher">SimpleScoreMatcher</a></code> </span> </td> <td class="doc-returns-details"> <div class="doc-md-description"> </div> </td> </tr> </tbody> </table> </div> </div> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/pipes/ner/scores/emergency-priority/index.html b/master/pipes/ner/scores/emergency-priority/index.html
index a9bd9dd89..51012ba88 100644
--- a/master/pipes/ner/scores/emergency-priority/index.html
+++ b/master/pipes/ner/scores/emergency-priority/index.html
@@ -3,7 +3,7 @@
 
 <span class="n">nlp</span> <span class="o">=</span> <span class="n">edsnlp</span><span class="o">.</span><span class="n"><html><head></head><body><a class="discrete-link" href="../../../../reference/edsnlp/core/pipeline/#edsnlp.core.pipeline.blank">blank</a></body></html></span><span class="p">(</span><span class="s2">"eds"</span><span class="p">)</span>
 <span class="n">nlp</span><span class="o">.</span><span class="n"><html><head></head><body><a class="discrete-link" href="../../../../reference/edsnlp/core/pipeline/#edsnlp.core.pipeline.Pipeline.add_pipe">add_pipe</a></body></html></span><span class="p">(</span><a href="#edsnlp.pipes.ner.scores.emergency.priority.factory.create_component">eds.emergency_priority</a><span class="p">())</span>
-</code></pre></div> <h2 id="edsnlp.pipes.ner.scores.emergency.priority.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline object</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span></code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'emergency_priority'</code> </span> </p> </td> </tr> <tr> <td><code>regex</code></td> <td class="doc-param-details"> <p>A list of regexes to identify the score</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.List">List</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code><span title="edsnlp.pipes.ner.scores.emergency.priority.patterns.regex">regex</span></code> </span> </p> </td> </tr> <tr> <td><code>attr</code></td> <td class="doc-param-details"> <p>Whether to match on the text ('TEXT') or on the normalized text ('NORM')</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'NORM'</code> </span> </p> </td> </tr> <tr> <td><code>value_extract</code></td> <td class="doc-param-details"> <p>Regex with capturing group to get the score value</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code><span title="edsnlp.pipes.ner.scores.emergency.priority.patterns.value_extract">value_extract</span></code> </span> </p> </td> </tr> <tr> <td><code>score_normalization</code></td> <td class="doc-param-details"> <p>Function that takes the "raw" value extracted from the <code>value_extract</code> regex and should return:</p> <ul> <li>None if no score could be extracted</li> <li>The desired score value else</li> </ul> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[str, <span title="typing.Callable">Callable</span>[[<span title="typing.Union">Union</span>[str, None]], <span title="typing.Any">Any</span>]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code><span title="edsnlp.pipes.ner.scores.emergency.priority.patterns.score_normalization_str">score_normalization_str</span></code> </span> </p> </td> </tr> <tr> <td><code>window</code></td> <td class="doc-param-details"> <p>Number of token to include after the score's mention to find the score's value</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>int</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>7</code> </span> </p> </td> </tr> <tr> <td><code>ignore_excluded</code></td> <td class="doc-param-details"> <p>Whether to ignore excluded spans when matching</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>ignore_space_tokens</code></td> <td class="doc-param-details"> <p>Whether to ignore space tokens when matching</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>flags</code></td> <td class="doc-param-details"> <p>Regex flags to use when matching</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[<span title="re.RegexFlag">RegexFlag</span>, int]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>0</code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>Label name to use for the <code>Span</code> object and the extension</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'emergency_priority'</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set matches on the doc</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True, 'emergency_priority': True}</code> </span> </p> </td> </tr> </tbody> </table> <table> <thead> <tr> <th><b>RETURNS</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td> <span class="doc-returns-annotation"> <code><a class="autorefs autorefs-internal" href="../../../../reference/edsnlp/pipes/ner/scores/base_score/#edsnlp.pipes.ner.scores.base_score.SimpleScoreMatcher" title="edsnlp.pipes.ner.scores.base_score.SimpleScoreMatcher">SimpleScoreMatcher</a></code> </span> </td> <td class="doc-returns-details"> <div class="doc-md-description"> </div> </td> </tr> </tbody> </table> </div> </div> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> <h2 id="edsnlp.pipes.ner.scores.emergency.priority.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline object</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span></code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'emergency_priority'</code> </span> </p> </td> </tr> <tr> <td><code>regex</code></td> <td class="doc-param-details"> <p>A list of regexes to identify the score</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.List">List</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code><span title="edsnlp.pipes.ner.scores.emergency.priority.patterns.regex">regex</span></code> </span> </p> </td> </tr> <tr> <td><code>attr</code></td> <td class="doc-param-details"> <p>Whether to match on the text ('TEXT') or on the normalized text ('NORM')</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'NORM'</code> </span> </p> </td> </tr> <tr> <td><code>value_extract</code></td> <td class="doc-param-details"> <p>Regex with capturing group to get the score value</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code><span title="edsnlp.pipes.ner.scores.emergency.priority.patterns.value_extract">value_extract</span></code> </span> </p> </td> </tr> <tr> <td><code>score_normalization</code></td> <td class="doc-param-details"> <p>Function that takes the "raw" value extracted from the <code>value_extract</code> regex and should return:</p> <ul> <li>None if no score could be extracted</li> <li>The desired score value else</li> </ul> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[str, <span title="typing.Callable">Callable</span>[[<span title="typing.Union">Union</span>[str, None]], <span title="typing.Any">Any</span>]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code><span title="edsnlp.pipes.ner.scores.emergency.priority.patterns.score_normalization_str">score_normalization_str</span></code> </span> </p> </td> </tr> <tr> <td><code>window</code></td> <td class="doc-param-details"> <p>Number of token to include after the score's mention to find the score's value</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>int</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>7</code> </span> </p> </td> </tr> <tr> <td><code>ignore_excluded</code></td> <td class="doc-param-details"> <p>Whether to ignore excluded spans when matching</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>ignore_space_tokens</code></td> <td class="doc-param-details"> <p>Whether to ignore space tokens when matching</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>flags</code></td> <td class="doc-param-details"> <p>Regex flags to use when matching</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[<span title="re.RegexFlag">RegexFlag</span>, int]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>0</code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>Label name to use for the <code>Span</code> object and the extension</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'emergency_priority'</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set matches on the doc</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True, 'emergency_priority': True}</code> </span> </p> </td> </tr> </tbody> </table> <table> <thead> <tr> <th><b>RETURNS</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td> <span class="doc-returns-annotation"> <code><a class="autorefs autorefs-internal" href="../../../../reference/edsnlp/pipes/ner/scores/base_score/#edsnlp.pipes.ner.scores.base_score.SimpleScoreMatcher" title="edsnlp.pipes.ner.scores.base_score.SimpleScoreMatcher">SimpleScoreMatcher</a></code> </span> </td> <td class="doc-returns-details"> <div class="doc-md-description"> </div> </td> </tr> </tbody> </table> </div> </div> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/pipes/ner/scores/index.html b/master/pipes/ner/scores/index.html
index c19cf79d5..148982d41 100644
--- a/master/pipes/ner/scores/index.html
+++ b/master/pipes/ner/scores/index.html
@@ -27,7 +27,7 @@
     <span class="n">value_extract</span><span class="o">=</span><span class="sa">r</span><span class="s2">"charlson.*[\n\W]*(\d+)"</span><span class="p">,</span>
     <span class="n">score_normalization</span><span class="o">=</span><span class="s2">"score_normalization.charlson"</span><span class="p">,</span>
 <span class="p">)</span>
-</code></pre></div> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/pipes/ner/scores/sofa/index.html b/master/pipes/ner/scores/sofa/index.html
index b1d878319..57a1cfcd3 100644
--- a/master/pipes/ner/scores/sofa/index.html
+++ b/master/pipes/ner/scores/sofa/index.html
@@ -24,7 +24,7 @@
 
 <span class="n">ent</span><span class="o">.</span><span class="n">_</span><span class="o">.</span><span class="n">score_method</span>
 <span class="c1"># Out: '24H'</span>
-</code></pre></div> <p>Score method can here be "24H", "Maximum", "A l'admission" or "Non précisée"</p> <h2 id="edsnlp.pipes.ner.scores.sofa.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline object</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span></code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'sofa'</code> </span> </p> </td> </tr> <tr> <td><code>regex</code></td> <td class="doc-param-details"> <p>A list of regexes to identify the SOFA score</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.List">List</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code><span title="edsnlp.pipes.ner.scores.sofa.patterns.regex">regex</span></code> </span> </p> </td> </tr> <tr> <td><code>attr</code></td> <td class="doc-param-details"> <p>Whether to match on the text ('TEXT') or on the normalized text ('CUSTOM_NORM')</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'NORM'</code> </span> </p> </td> </tr> <tr> <td><code>value_extract</code></td> <td class="doc-param-details"> <p>Regex to extract the score value</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Dict">Dict</span>[str, str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code><span title="edsnlp.pipes.ner.scores.sofa.patterns.value_extract">value_extract</span></code> </span> </p> </td> </tr> <tr> <td><code>score_normalization</code></td> <td class="doc-param-details"> <p>Function that takes the "raw" value extracted from the <code>value_extract</code> regex, and should return - None if no score could be extracted - The desired score value else</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Callable">Callable</span>[[<span title="typing.Union">Union</span>[str, None]], <span title="typing.Any">Any</span>]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code><span title="edsnlp.pipes.ner.scores.sofa.patterns.score_normalization_str">score_normalization_str</span></code> </span> </p> </td> </tr> <tr> <td><code>window</code></td> <td class="doc-param-details"> <p>Number of token to include after the score's mention to find the score's value</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>int</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>10</code> </span> </p> </td> </tr> <tr> <td><code>ignore_excluded</code></td> <td class="doc-param-details"> <p>Whether to ignore excluded spans</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>ignore_space_tokens</code></td> <td class="doc-param-details"> <p>Whether to ignore space tokens</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>flags</code></td> <td class="doc-param-details"> <p>Flags to pass to the regex</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[<span title="re.RegexFlag">RegexFlag</span>, int]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>0</code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>Label name to use for the <code>Span</code> object and the extension</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'sofa'</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set matches on the doc</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True, 'sofa': True}</code> </span> </p> </td> </tr> </tbody> </table> </div> </div> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> <p>Score method can here be "24H", "Maximum", "A l'admission" or "Non précisée"</p> <h2 id="edsnlp.pipes.ner.scores.sofa.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline object</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span></code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'sofa'</code> </span> </p> </td> </tr> <tr> <td><code>regex</code></td> <td class="doc-param-details"> <p>A list of regexes to identify the SOFA score</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.List">List</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code><span title="edsnlp.pipes.ner.scores.sofa.patterns.regex">regex</span></code> </span> </p> </td> </tr> <tr> <td><code>attr</code></td> <td class="doc-param-details"> <p>Whether to match on the text ('TEXT') or on the normalized text ('CUSTOM_NORM')</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'NORM'</code> </span> </p> </td> </tr> <tr> <td><code>value_extract</code></td> <td class="doc-param-details"> <p>Regex to extract the score value</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Dict">Dict</span>[str, str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code><span title="edsnlp.pipes.ner.scores.sofa.patterns.value_extract">value_extract</span></code> </span> </p> </td> </tr> <tr> <td><code>score_normalization</code></td> <td class="doc-param-details"> <p>Function that takes the "raw" value extracted from the <code>value_extract</code> regex, and should return - None if no score could be extracted - The desired score value else</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Callable">Callable</span>[[<span title="typing.Union">Union</span>[str, None]], <span title="typing.Any">Any</span>]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code><span title="edsnlp.pipes.ner.scores.sofa.patterns.score_normalization_str">score_normalization_str</span></code> </span> </p> </td> </tr> <tr> <td><code>window</code></td> <td class="doc-param-details"> <p>Number of token to include after the score's mention to find the score's value</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>int</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>10</code> </span> </p> </td> </tr> <tr> <td><code>ignore_excluded</code></td> <td class="doc-param-details"> <p>Whether to ignore excluded spans</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>ignore_space_tokens</code></td> <td class="doc-param-details"> <p>Whether to ignore space tokens</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>flags</code></td> <td class="doc-param-details"> <p>Flags to pass to the regex</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[<span title="re.RegexFlag">RegexFlag</span>, int]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>0</code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>Label name to use for the <code>Span</code> object and the extension</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'sofa'</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set matches on the doc</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True, 'sofa': True}</code> </span> </p> </td> </tr> </tbody> </table> </div> </div> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/pipes/ner/suicide_attempt/index.html b/master/pipes/ner/suicide_attempt/index.html
index 0551fb1e9..dfd5c22b6 100644
--- a/master/pipes/ner/suicide_attempt/index.html
+++ b/master/pipes/ner/suicide_attempt/index.html
@@ -42,7 +42,7 @@
     <span class="s2">"burn_gas_caustic"</span><span class="p">:</span> <span class="p">[</span><span class="sa">r</span><span class="s2">"(?i)ing[eé]stion\sde\s(produit\s)?caustique"</span><span class="p">],</span>
 <span class="p">}</span>
 <span class="c1"># fmt: on</span>
-</code></pre></div> </details> <h2 id="edsnlp.pipes.ner.suicide_attempt.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline object.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="edsnlp.core.pipeline.PipelineProtocol">PipelineProtocol</span></code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the pipe</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'eds.suicide_attempt'</code> </span> </p> </td> </tr> <tr> <td><code>attr</code></td> <td class="doc-param-details"> <p>Attribute to match on, eg <code>TEXT</code>, <code>NORM</code>, etc.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[str, Dict[str, str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'TEXT'</code> </span> </p> </td> </tr> <tr> <td><code>ignore_excluded</code></td> <td class="doc-param-details"> <p>Whether to skip excluded tokens during matching.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>ignore_space_tokens</code></td> <td class="doc-param-details"> <p>Whether to skip space tokens during matching.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>patterns</code></td> <td class="doc-param-details"> <p>The regex pattern to use</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.List">List</span>[str]</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set matches on the doc</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../../reference/edsnlp/utils/span_getters/#edsnlp.utils.span_getters.SpanSetterArg" title="edsnlp.utils.span_getters.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True}</code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>Label name to use for the <code>Span</code> object and the extension</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'suicide_attempt'</code> </span> </p> </td> </tr> </tbody> </table> <table> <thead> <tr> <th><b>RETURNS</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td> <span class="doc-returns-annotation"> <code><a class="autorefs autorefs-internal" href="../../../reference/edsnlp/pipes/ner/suicide_attempt/suicide_attempt/#edsnlp.pipes.ner.suicide_attempt.suicide_attempt.SuicideAttemptMatcher" title="edsnlp.pipes.ner.suicide_attempt.suicide_attempt.SuicideAttemptMatcher">SuicideAttemptMatcher</a></code> </span> </td> <td class="doc-returns-details"> <div class="doc-md-description"> </div> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.ner.suicide_attempt.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.ner.suicide_attempt.factory.create_component">eds.suicide_attempt</a></code> component was developed by AP-HP's Data Science team, following the insights of the algorithm proposed by <span><a class="citation" href="./#ref-bey_natural_2024" id="edsnlp.pipes.ner.suicide_attempt.factory.create_component--cite-bey_natural_2024">Bey et al., 2024</a></span>.</p> </div> </div> <div class="footnote"><hr/><ol><li id="ref-bey_natural_2024"><p><p id="ref-bey_natural_2024">Bey R., Cohen A., Trebossen V., Dura B., Geoffroy P., Jean C., Landman B., Petit-Jean T., Chatellier G., Sallah K., Tannier X., Bourmaud A. and Delorme R., 2024. Natural language processing of multi-hospital electronic health records for public health surveillance of suicidality. <a href="https://dx.doi.org/10.1038/s44184-023-00046-7" target="_blank">10.1038/s44184-023-00046-7</a></p></p></li></ol><div></div></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> </details> <h2 id="edsnlp.pipes.ner.suicide_attempt.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline object.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="edsnlp.core.pipeline.PipelineProtocol">PipelineProtocol</span></code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the pipe</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'eds.suicide_attempt'</code> </span> </p> </td> </tr> <tr> <td><code>attr</code></td> <td class="doc-param-details"> <p>Attribute to match on, eg <code>TEXT</code>, <code>NORM</code>, etc.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[str, Dict[str, str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'TEXT'</code> </span> </p> </td> </tr> <tr> <td><code>ignore_excluded</code></td> <td class="doc-param-details"> <p>Whether to skip excluded tokens during matching.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>ignore_space_tokens</code></td> <td class="doc-param-details"> <p>Whether to skip space tokens during matching.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>patterns</code></td> <td class="doc-param-details"> <p>The regex pattern to use</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.List">List</span>[str]</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set matches on the doc</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../../reference/edsnlp/utils/span_getters/#edsnlp.utils.span_getters.SpanSetterArg" title="edsnlp.utils.span_getters.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True}</code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>Label name to use for the <code>Span</code> object and the extension</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'suicide_attempt'</code> </span> </p> </td> </tr> </tbody> </table> <table> <thead> <tr> <th><b>RETURNS</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td> <span class="doc-returns-annotation"> <code><a class="autorefs autorefs-internal" href="../../../reference/edsnlp/pipes/ner/suicide_attempt/suicide_attempt/#edsnlp.pipes.ner.suicide_attempt.suicide_attempt.SuicideAttemptMatcher" title="edsnlp.pipes.ner.suicide_attempt.suicide_attempt.SuicideAttemptMatcher">SuicideAttemptMatcher</a></code> </span> </td> <td class="doc-returns-details"> <div class="doc-md-description"> </div> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.ner.suicide_attempt.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.ner.suicide_attempt.factory.create_component">eds.suicide_attempt</a></code> component was developed by AP-HP's Data Science team, following the insights of the algorithm proposed by <span><a class="citation" href="./#ref-bey_natural_2024" id="edsnlp.pipes.ner.suicide_attempt.factory.create_component--cite-bey_natural_2024">Bey et al., 2024</a></span>.</p> </div> </div> <div class="footnote"><hr/><ol><li id="ref-bey_natural_2024"><p><p id="ref-bey_natural_2024">Bey R., Cohen A., Trebossen V., Dura B., Geoffroy P., Jean C., Landman B., Petit-Jean T., Chatellier G., Sallah K., Tannier X., Bourmaud A. and Delorme R., 2024. Natural language processing of multi-hospital electronic health records for public health surveillance of suicidality. <a href="https://dx.doi.org/10.1038/s44184-023-00046-7" target="_blank">10.1038/s44184-023-00046-7</a></p></p></li></ol><div></div></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/pipes/ner/tnm/index.html b/master/pipes/ner/tnm/index.html
index 03283a54e..c8f6eac15 100644
--- a/master/pipes/ner/tnm/index.html
+++ b/master/pipes/ner/tnm/index.html
@@ -22,7 +22,7 @@
 <span class="c1">#  'resection_completeness': None,</span>
 <span class="c1">#  'version': None,</span>
 <span class="c1">#  'version_year': None}</span>
-</code></pre></div> <h2 id="edsnlp.pipes.ner.tnm.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline object</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span>]</code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the pipe</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'tnm'</code> </span> </p> </td> </tr> <tr> <td><code>pattern</code></td> <td class="doc-param-details"> <p>The regex pattern to use for matching ADICAP codes</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="typing.Union">Union</span>[<span title="typing.List">List</span>[str], str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>(?:\b|^)(?&lt;=\(?(?P&lt;version&gt;uicc|accj|tnm|UICC|A...</code> </span> </p> </td> </tr> <tr> <td><code>attr</code></td> <td class="doc-param-details"> <p>Attribute to match on, eg <code>TEXT</code>, <code>NORM</code>, etc.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>TEXT</code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>Label name to use for the <code>Span</code> object and the extension</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>tnm</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set matches on the doc</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True, 'tnm': True}</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.ner.tnm.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The TNM score is based on the development of S. Priou, B. Rance and E. Kempf (<span><a class="citation" href="./#ref-kempf:hal-03519085" id="edsnlp.pipes.ner.tnm.factory.create_component--cite-kempf:hal-03519085">Kempf et al., 2022</a></span>).</p> </div> </div> <div class="footnote"><hr/><ol><li id="ref-kempf:hal-03519085"><p><p id="ref-kempf:hal-03519085">Kempf E., Priou S., Lamé G., Daniel C., Bellamine A., Sommacale D., Belkacemi y., Bey R., Galula G., Taright N., Tannier X., Rance B., Flicoteaux R., Hemery F., Audureau E., Chatellier G. and Tournigand C., 2022. Impact of two waves of Sars-Cov2 outbreak on the number, clinical presentation, care trajectories and survival of patients newly referred for a colorectal cancer: A French multicentric cohort study from a large group of University hospitals. <i>{International Journal of Cancer}</i>. <i>150</i>, pp.1609-1618. <a href="https://dx.doi.org/10.1002/ijc.33928" target="_blank">10.1002/ijc.33928</a></p></p></li></ol><div></div></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> <h2 id="edsnlp.pipes.ner.tnm.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline object</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span>]</code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the pipe</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'tnm'</code> </span> </p> </td> </tr> <tr> <td><code>pattern</code></td> <td class="doc-param-details"> <p>The regex pattern to use for matching ADICAP codes</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="typing.Union">Union</span>[<span title="typing.List">List</span>[str], str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>(?:\b|^)(?&lt;=\(?(?P&lt;version&gt;uicc|accj|tnm|UICC|A...</code> </span> </p> </td> </tr> <tr> <td><code>attr</code></td> <td class="doc-param-details"> <p>Attribute to match on, eg <code>TEXT</code>, <code>NORM</code>, etc.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>TEXT</code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>Label name to use for the <code>Span</code> object and the extension</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>tnm</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set matches on the doc</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True, 'tnm': True}</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.ner.tnm.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The TNM score is based on the development of S. Priou, B. Rance and E. Kempf (<span><a class="citation" href="./#ref-kempf:hal-03519085" id="edsnlp.pipes.ner.tnm.factory.create_component--cite-kempf:hal-03519085">Kempf et al., 2022</a></span>).</p> </div> </div> <div class="footnote"><hr/><ol><li id="ref-kempf:hal-03519085"><p><p id="ref-kempf:hal-03519085">Kempf E., Priou S., Lamé G., Daniel C., Bellamine A., Sommacale D., Belkacemi y., Bey R., Galula G., Taright N., Tannier X., Rance B., Flicoteaux R., Hemery F., Audureau E., Chatellier G. and Tournigand C., 2022. Impact of two waves of Sars-Cov2 outbreak on the number, clinical presentation, care trajectories and survival of patients newly referred for a colorectal cancer: A French multicentric cohort study from a large group of University hospitals. <i>{International Journal of Cancer}</i>. <i>150</i>, pp.1609-1618. <a href="https://dx.doi.org/10.1002/ijc.33928" target="_blank">10.1002/ijc.33928</a></p></p></li></ol><div></div></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/pipes/ner/umls/index.html b/master/pipes/ner/umls/index.html
index 2678d925e..6ea4bbb81 100644
--- a/master/pipes/ner/umls/index.html
+++ b/master/pipes/ner/umls/index.html
@@ -25,7 +25,7 @@
 
 <span class="n">nlp</span> <span class="o">=</span> <span class="n">edsnlp</span><span class="o">.</span><span class="n"><html><head></head><body><a class="discrete-link" href="../../../reference/edsnlp/core/pipeline/#edsnlp.core.pipeline.blank">blank</a></body></html></span><span class="p">(</span><span class="s2">"eds"</span><span class="p">)</span>
 <span class="n">nlp</span><span class="o">.</span><span class="n"><html><head></head><body><a class="discrete-link" href="../../../reference/edsnlp/core/pipeline/#edsnlp.core.pipeline.Pipeline.add_pipe">add_pipe</a></body></html></span><span class="p">(</span><a href="#edsnlp.pipes.ner.umls.factory.create_component">eds.umls</a><span class="p">(</span><span class="n">pattern_config</span><span class="o">=</span><span class="n">pattern_config</span><span class="p">))</span>
-</code></pre></div> <p>See more options of languages and sources <a href="https://www.nlm.nih.gov/research/umls/sourcereleasedocs/index.html">here</a>.</p> <h2 id="edsnlp.pipes.ner.umls.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>Pipeline object</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span></code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the pipe</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'umls'</code> </span> </p> </td> </tr> <tr> <td><code>attr</code></td> <td class="doc-param-details"> <p>Attribute to match on, eg <code>TEXT</code>, <code>NORM</code>, etc.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[str, <span title="typing.Dict">Dict</span>[str, str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'NORM'</code> </span> </p> </td> </tr> <tr> <td><code>ignore_excluded</code></td> <td class="doc-param-details"> <p>Whether to skip excluded tokens during matching.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>ignore_space_tokens</code></td> <td class="doc-param-details"> <p>Whether to skip space tokens during matching.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>term_matcher</code></td> <td class="doc-param-details"> <p>The term matcher to use, either "exact" or "simstring"</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>TerminologyTermMatcher</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'exact'</code> </span> </p> </td> </tr> <tr> <td><code>term_matcher_config</code></td> <td class="doc-param-details"> <p>The configuration for the term matcher</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Dict">Dict</span>[str, <span title="typing.Any">Any</span>]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{}</code> </span> </p> </td> </tr> <tr> <td><code>pattern_config</code></td> <td class="doc-param-details"> <p>The pattern retriever configuration</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Dict">Dict</span>[str, <span title="typing.Any">Any</span>]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>dict(languages=['FRE'], sources=None)</code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>Label name to use for the <code>Span</code> object and the extension</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'umls'</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set matches on the doc</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True, 'umls': True}</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.ner.umls.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.ner.umls.factory.create_component">eds.umls</a></code> pipeline was developed by AP-HP's Data Science team and INRIA SODA's team.</p> </div> </div> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> <p>See more options of languages and sources <a href="https://www.nlm.nih.gov/research/umls/sourcereleasedocs/index.html">here</a>.</p> <h2 id="edsnlp.pipes.ner.umls.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>Pipeline object</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span></code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the pipe</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'umls'</code> </span> </p> </td> </tr> <tr> <td><code>attr</code></td> <td class="doc-param-details"> <p>Attribute to match on, eg <code>TEXT</code>, <code>NORM</code>, etc.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[str, <span title="typing.Dict">Dict</span>[str, str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'NORM'</code> </span> </p> </td> </tr> <tr> <td><code>ignore_excluded</code></td> <td class="doc-param-details"> <p>Whether to skip excluded tokens during matching.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>ignore_space_tokens</code></td> <td class="doc-param-details"> <p>Whether to skip space tokens during matching.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>term_matcher</code></td> <td class="doc-param-details"> <p>The term matcher to use, either "exact" or "simstring"</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>TerminologyTermMatcher</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'exact'</code> </span> </p> </td> </tr> <tr> <td><code>term_matcher_config</code></td> <td class="doc-param-details"> <p>The configuration for the term matcher</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Dict">Dict</span>[str, <span title="typing.Any">Any</span>]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{}</code> </span> </p> </td> </tr> <tr> <td><code>pattern_config</code></td> <td class="doc-param-details"> <p>The pattern retriever configuration</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Dict">Dict</span>[str, <span title="typing.Any">Any</span>]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>dict(languages=['FRE'], sources=None)</code> </span> </p> </td> </tr> <tr> <td><code>label</code></td> <td class="doc-param-details"> <p>Label name to use for the <code>Span</code> object and the extension</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'umls'</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>How to set matches on the doc</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../#edsnlp.pipes.base.SpanSetterArg" title="edsnlp.pipes.base.SpanSetterArg">SpanSetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True, 'umls': True}</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.ner.umls.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.ner.umls.factory.create_component">eds.umls</a></code> pipeline was developed by AP-HP's Data Science team and INRIA SODA's team.</p> </div> </div> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/pipes/qualifiers/family/index.html b/master/pipes/qualifiers/family/index.html
index 27ff49190..d9dd77a37 100644
--- a/master/pipes/qualifiers/family/index.html
+++ b/master/pipes/qualifiers/family/index.html
@@ -24,7 +24,7 @@
 
 <span class="n">doc</span><span class="o">.</span><span class="n">ents</span><span class="p">[</span><span class="mi">1</span><span class="p">]</span><span class="o">.</span><span class="n">_</span><span class="o">.</span><span class="n">family</span>
 <span class="c1"># Out: True</span>
-</code></pre></div> <h2 id="edsnlp.pipes.qualifiers.family.factory.create_component--extensions">Extensions</h2> <p>The <code><a href="#edsnlp.pipes.qualifiers.family.factory.create_component">eds.family</a></code> component declares two extensions, on both <code>Span</code> and <code>Token</code> objects :</p> <ol> <li>The <code>family</code> attribute is a boolean, set to <code>True</code> if the component predicts that the span/token relates to a family member.</li> <li>The <code>family_</code> property is a human-readable string, computed from the <code>family</code> attribute. It implements a simple getter function that outputs <code>PATIENT</code> or <code>FAMILY</code>, depending on the value of <code>family</code>.</li> </ol> <h2 id="edsnlp.pipes.qualifiers.family.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline object.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span></code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The component name.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[str]</code> </span> </p> </td> </tr> <tr> <td><code>attr</code></td> <td class="doc-param-details"> <p>spaCy's attribute to use: a string with the value "TEXT" or "NORM", or a dict with the key 'term_attr' we can also add a key for each regex.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>NORM</code> </span> </p> </td> </tr> <tr> <td><code>family</code></td> <td class="doc-param-details"> <p>List of terms indicating family reference.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="typing.List">List</span>[str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>termination</code></td> <td class="doc-param-details"> <p>List of syntagms termination terms.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="typing.List">List</span>[str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>span_getter</code></td> <td class="doc-param-details"> <p>Which entities should be classified. By default, <code>doc.ents</code></p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../#edsnlp.pipes.base.SpanGetterArg" title="edsnlp.pipes.base.SpanGetterArg">SpanGetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>on_ents_only</code></td> <td class="doc-param-details"> <p>Deprecated, use <code>span_getter</code> instead.</p> <p>Whether to look for matches around detected entities only. Useful for faster inference in downstream tasks.</p> <ul> <li>If True, will look in all ents located in <code>doc.ents</code> only</li> <li>If an iterable of string is passed, will additionally look in <code>doc.spans[key]</code> for each key in the iterable</li> </ul> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[bool, str, <span title="typing.List">List</span>[str], <span title="typing.Set">Set</span>[str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>explain</code></td> <td class="doc-param-details"> <p>Whether to keep track of cues for each entity.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>use_sections</code></td> <td class="doc-param-details"> <p>Whether to use annotated sections (namely <code>antécédents familiaux</code>).</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool, by default `False`</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>True</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.qualifiers.family.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.qualifiers.family.factory.create_component">eds.family</a></code> component was developed by AP-HP's Data Science team.</p> </div> </div> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> <h2 id="edsnlp.pipes.qualifiers.family.factory.create_component--extensions">Extensions</h2> <p>The <code><a href="#edsnlp.pipes.qualifiers.family.factory.create_component">eds.family</a></code> component declares two extensions, on both <code>Span</code> and <code>Token</code> objects :</p> <ol> <li>The <code>family</code> attribute is a boolean, set to <code>True</code> if the component predicts that the span/token relates to a family member.</li> <li>The <code>family_</code> property is a human-readable string, computed from the <code>family</code> attribute. It implements a simple getter function that outputs <code>PATIENT</code> or <code>FAMILY</code>, depending on the value of <code>family</code>.</li> </ol> <h2 id="edsnlp.pipes.qualifiers.family.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline object.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span></code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The component name.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[str]</code> </span> </p> </td> </tr> <tr> <td><code>attr</code></td> <td class="doc-param-details"> <p>spaCy's attribute to use: a string with the value "TEXT" or "NORM", or a dict with the key 'term_attr' we can also add a key for each regex.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>NORM</code> </span> </p> </td> </tr> <tr> <td><code>family</code></td> <td class="doc-param-details"> <p>List of terms indicating family reference.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="typing.List">List</span>[str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>termination</code></td> <td class="doc-param-details"> <p>List of syntagms termination terms.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="typing.List">List</span>[str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>span_getter</code></td> <td class="doc-param-details"> <p>Which entities should be classified. By default, <code>doc.ents</code></p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../#edsnlp.pipes.base.SpanGetterArg" title="edsnlp.pipes.base.SpanGetterArg">SpanGetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>on_ents_only</code></td> <td class="doc-param-details"> <p>Deprecated, use <code>span_getter</code> instead.</p> <p>Whether to look for matches around detected entities only. Useful for faster inference in downstream tasks.</p> <ul> <li>If True, will look in all ents located in <code>doc.ents</code> only</li> <li>If an iterable of string is passed, will additionally look in <code>doc.spans[key]</code> for each key in the iterable</li> </ul> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[bool, str, <span title="typing.List">List</span>[str], <span title="typing.Set">Set</span>[str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>explain</code></td> <td class="doc-param-details"> <p>Whether to keep track of cues for each entity.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>use_sections</code></td> <td class="doc-param-details"> <p>Whether to use annotated sections (namely <code>antécédents familiaux</code>).</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool, by default `False`</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>True</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.qualifiers.family.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.qualifiers.family.factory.create_component">eds.family</a></code> component was developed by AP-HP's Data Science team.</p> </div> </div> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/pipes/qualifiers/history/index.html b/master/pipes/qualifiers/history/index.html
index 8cbedb3ed..96e77370c 100644
--- a/master/pipes/qualifiers/history/index.html
+++ b/master/pipes/qualifiers/history/index.html
@@ -38,7 +38,7 @@
 
 <span class="n">doc</span><span class="o">.</span><span class="n">ents</span><span class="p">[</span><span class="mi">3</span><span class="p">]</span><span class="o">.</span><span class="n">_</span><span class="o">.</span><span class="n">history</span>  <span class="c1"># (2)</span>
 <span class="c1"># Out: False</span>
-</code></pre></div> <ol> <li>The entity is in the section <code>antécédent</code>.</li> <li>The entity is in the section <code>antécédent</code>, however the extracted <code>relative_date</code> refers to an event that took place within 14 days.</li> </ol> <h2 id="edsnlp.pipes.qualifiers.history.factory.create_component--extensions">Extensions</h2> <p>The <code><a href="#edsnlp.pipes.qualifiers.history.factory.create_component">eds.history</a></code> component declares two extensions, on both <code>Span</code> and <code>Token</code> objects :</p> <ol> <li>The <code>history</code> attribute is a boolean, set to <code>True</code> if the component predicts that the span/token is a medical history.</li> <li>The <code>history_</code> property is a human-readable string, computed from the <code>history</code> attribute. It implements a simple getter function that outputs <code>CURRENT</code> or <code>ATCD</code>, depending on the value of <code>history</code>.</li> </ol> <h2 id="edsnlp.pipes.qualifiers.history.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline object.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span></code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The component name.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[str]</code> </span> </p> </td> </tr> <tr> <td><code>history</code></td> <td class="doc-param-details"> <p>List of terms indicating medical history reference.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="typing.List">List</span>[str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>termination</code></td> <td class="doc-param-details"> <p>List of syntagms termination terms.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="typing.List">List</span>[str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>use_sections</code></td> <td class="doc-param-details"> <p>Whether to use section pipeline to detect medical history section.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>use_dates</code></td> <td class="doc-param-details"> <p>Whether to use dates pipeline to detect if the event occurs a long time before the document date.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>attr</code></td> <td class="doc-param-details"> <p>spaCy's attribute to use: a string with the value "TEXT" or "NORM", or a dict with the key 'term_attr' we can also add a key for each regex.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>NORM</code> </span> </p> </td> </tr> <tr> <td><code>history_limit</code></td> <td class="doc-param-details"> <p>The number of days after which the event is considered as history.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[int, <span title="datetime.timedelta">timedelta</span>]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>14</code> </span> </p> </td> </tr> <tr> <td><code>exclude_birthdate</code></td> <td class="doc-param-details"> <p>Whether to exclude the birthdate from history dates.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>True</code> </span> </p> </td> </tr> <tr> <td><code>closest_dates_only</code></td> <td class="doc-param-details"> <p>Whether to include the closest dates only.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>True</code> </span> </p> </td> </tr> <tr> <td><code>span_getter</code></td> <td class="doc-param-details"> <p>Which entities should be classified. By default, <code>doc.ents</code></p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../#edsnlp.pipes.base.SpanGetterArg" title="edsnlp.pipes.base.SpanGetterArg">SpanGetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>on_ents_only</code></td> <td class="doc-param-details"> <p>Deprecated, use <code>span_getter</code> instead.</p> <p>Whether to look for matches around detected entities only. Useful for faster inference in downstream tasks.</p> <ul> <li>If True, will look in all ents located in <code>doc.ents</code> only</li> <li>If an iterable of string is passed, will additionally look in <code>doc.spans[key]</code> for each key in the iterable</li> </ul> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[bool, str, <span title="typing.List">List</span>[str], <span title="typing.Set">Set</span>[str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>explain</code></td> <td class="doc-param-details"> <p>Whether to keep track of cues for each entity.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>tz</code></td> <td class="doc-param-details"> <p>The timezone to use. Defaults to "Europe/Paris".</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="typing.Union">Union</span>[str, <span title="datetime.tzinfo">tzinfo</span>]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.qualifiers.history.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.qualifiers.history.factory.create_component">eds.history</a></code> component was developed by AP-HP's Data Science team.</p> </div> </div> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> <ol> <li>The entity is in the section <code>antécédent</code>.</li> <li>The entity is in the section <code>antécédent</code>, however the extracted <code>relative_date</code> refers to an event that took place within 14 days.</li> </ol> <h2 id="edsnlp.pipes.qualifiers.history.factory.create_component--extensions">Extensions</h2> <p>The <code><a href="#edsnlp.pipes.qualifiers.history.factory.create_component">eds.history</a></code> component declares two extensions, on both <code>Span</code> and <code>Token</code> objects :</p> <ol> <li>The <code>history</code> attribute is a boolean, set to <code>True</code> if the component predicts that the span/token is a medical history.</li> <li>The <code>history_</code> property is a human-readable string, computed from the <code>history</code> attribute. It implements a simple getter function that outputs <code>CURRENT</code> or <code>ATCD</code>, depending on the value of <code>history</code>.</li> </ol> <h2 id="edsnlp.pipes.qualifiers.history.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline object.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span></code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The component name.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[str]</code> </span> </p> </td> </tr> <tr> <td><code>history</code></td> <td class="doc-param-details"> <p>List of terms indicating medical history reference.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="typing.List">List</span>[str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>termination</code></td> <td class="doc-param-details"> <p>List of syntagms termination terms.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="typing.List">List</span>[str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>use_sections</code></td> <td class="doc-param-details"> <p>Whether to use section pipeline to detect medical history section.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>use_dates</code></td> <td class="doc-param-details"> <p>Whether to use dates pipeline to detect if the event occurs a long time before the document date.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>attr</code></td> <td class="doc-param-details"> <p>spaCy's attribute to use: a string with the value "TEXT" or "NORM", or a dict with the key 'term_attr' we can also add a key for each regex.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>NORM</code> </span> </p> </td> </tr> <tr> <td><code>history_limit</code></td> <td class="doc-param-details"> <p>The number of days after which the event is considered as history.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[int, <span title="datetime.timedelta">timedelta</span>]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>14</code> </span> </p> </td> </tr> <tr> <td><code>exclude_birthdate</code></td> <td class="doc-param-details"> <p>Whether to exclude the birthdate from history dates.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>True</code> </span> </p> </td> </tr> <tr> <td><code>closest_dates_only</code></td> <td class="doc-param-details"> <p>Whether to include the closest dates only.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>True</code> </span> </p> </td> </tr> <tr> <td><code>span_getter</code></td> <td class="doc-param-details"> <p>Which entities should be classified. By default, <code>doc.ents</code></p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../#edsnlp.pipes.base.SpanGetterArg" title="edsnlp.pipes.base.SpanGetterArg">SpanGetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>on_ents_only</code></td> <td class="doc-param-details"> <p>Deprecated, use <code>span_getter</code> instead.</p> <p>Whether to look for matches around detected entities only. Useful for faster inference in downstream tasks.</p> <ul> <li>If True, will look in all ents located in <code>doc.ents</code> only</li> <li>If an iterable of string is passed, will additionally look in <code>doc.spans[key]</code> for each key in the iterable</li> </ul> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[bool, str, <span title="typing.List">List</span>[str], <span title="typing.Set">Set</span>[str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>explain</code></td> <td class="doc-param-details"> <p>Whether to keep track of cues for each entity.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>tz</code></td> <td class="doc-param-details"> <p>The timezone to use. Defaults to "Europe/Paris".</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="typing.Union">Union</span>[str, <span title="datetime.tzinfo">tzinfo</span>]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.qualifiers.history.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.qualifiers.history.factory.create_component">eds.history</a></code> component was developed by AP-HP's Data Science team.</p> </div> </div> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/pipes/qualifiers/hypothesis/index.html b/master/pipes/qualifiers/hypothesis/index.html
index 918eecd85..e2dff32aa 100644
--- a/master/pipes/qualifiers/hypothesis/index.html
+++ b/master/pipes/qualifiers/hypothesis/index.html
@@ -22,7 +22,7 @@
 
 <span class="n">doc</span><span class="o">.</span><span class="n">ents</span><span class="p">[</span><span class="mi">1</span><span class="p">]</span><span class="o">.</span><span class="n">_</span><span class="o">.</span><span class="n">hypothesis</span>
 <span class="c1"># Out: True</span>
-</code></pre></div> <h2 id="edsnlp.pipes.qualifiers.hypothesis.factory.create_component--extensions">Extensions</h2> <p>The <code><a href="#edsnlp.pipes.qualifiers.hypothesis.factory.create_component">eds.hypothesis</a></code> component declares two extensions, on both <code>Span</code> and <code>Token</code> objects :</p> <ol> <li>The <code>hypothesis</code> attribute is a boolean, set to <code>True</code> if the component predicts that the span/token is a speculation.</li> <li>The <code>hypothesis_</code> property is a human-readable string, computed from the <code>hypothesis</code> attribute. It implements a simple getter function that outputs <code>HYP</code> or <code>CERT</code>, depending on the value of <code>hypothesis</code>.</li> </ol> <h2 id="edsnlp.pipes.qualifiers.hypothesis.factory.create_component--performance">Performance</h2> <p>The component's performance is measured on three datasets :</p> <ul> <li>The ESSAI (<span><a class="citation" href="./#ref-dalloux2017ESSAI" id="edsnlp.pipes.qualifiers.hypothesis.factory.create_component--cite-dalloux2017ESSAI">Dalloux et al., 2017</a></span>) and CAS (<span><a class="citation" href="./#ref-grabar2018CAS" id="edsnlp.pipes.qualifiers.hypothesis.factory.create_component--cite-grabar2018CAS">Grabar et al., 2018</a></span>) datasets were developed at the CNRS. The two are concatenated.</li> <li>The NegParHyp corpus was specifically developed at APHP's CDW to test the component on actual clinical notes, using pseudonymised notes from the APHP's CDW.</li> </ul> <table> <thead> <tr> <th>Dataset</th> <th>Hypothesis F1</th> </tr> </thead> <tbody> <tr> <td>CAS/ESSAI</td> <td>49%</td> </tr> <tr> <td>NegParHyp</td> <td>52%</td> </tr> </tbody> </table> <div class="admonition note"> <p class="admonition-title">NegParHyp corpus</p> <p>The NegParHyp corpus was built by matching a subset of the MeSH terminology with around 300 documents from AP-HP's clinical data warehouse. Matched entities were then labelled for hypothesis, speculation and hypothesis context.</p> </div> <h2 id="edsnlp.pipes.qualifiers.hypothesis.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline object.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span></code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The component name.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'hypothesis'</code> </span> </p> </td> </tr> <tr> <td><code>attr</code></td> <td class="doc-param-details"> <p>spaCy's attribute to use</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>NORM</code> </span> </p> </td> </tr> <tr> <td><code>pseudo</code></td> <td class="doc-param-details"> <p>List of pseudo hypothesis cues.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="typing.List">List</span>[str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>preceding</code></td> <td class="doc-param-details"> <p>List of preceding hypothesis cues</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="typing.List">List</span>[str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>following</code></td> <td class="doc-param-details"> <p>List of following hypothesis cues.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="typing.List">List</span>[str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>verbs_hyp</code></td> <td class="doc-param-details"> <p>List of hypothetical verbs.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="typing.List">List</span>[str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>verbs_eds</code></td> <td class="doc-param-details"> <p>List of mainstream verbs.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="typing.List">List</span>[str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>termination</code></td> <td class="doc-param-details"> <p>List of termination terms.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="typing.List">List</span>[str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>attr</code></td> <td class="doc-param-details"> <p>spaCy's attribute to use: a string with the value "TEXT" or "NORM", or a dict with the key 'term_attr'</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>NORM</code> </span> </p> </td> </tr> <tr> <td><code>span_getter</code></td> <td class="doc-param-details"> <p>Which entities should be classified. By default, <code>doc.ents</code></p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../#edsnlp.pipes.base.SpanGetterArg" title="edsnlp.pipes.base.SpanGetterArg">SpanGetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>on_ents_only</code></td> <td class="doc-param-details"> <p>Deprecated, use <code>span_getter</code> instead.</p> <p>Whether to look for matches around detected entities only. Useful for faster inference in downstream tasks.</p> <ul> <li>If True, will look in all ents located in <code>doc.ents</code> only</li> <li>If an iterable of string is passed, will additionally look in <code>doc.spans[key]</code> for each key in the iterable</li> </ul> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[bool, str, <span title="typing.List">List</span>[str], <span title="typing.Set">Set</span>[str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>within_ents</code></td> <td class="doc-param-details"> <p>Whether to consider cues within entities.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>explain</code></td> <td class="doc-param-details"> <p>Whether to keep track of cues for each entity.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.qualifiers.hypothesis.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.qualifiers.hypothesis.factory.create_component">eds.hypothesis</a></code> pipeline was developed by AP-HP's Data Science team.</p> </div> </div> <div class="footnote"><hr/><ol><li id="ref-dalloux2017ESSAI"><p><p id="ref-dalloux2017ESSAI">Dalloux C., Claveau V. and Grabar N., 2017. Détection de la négation : corpus français et apprentissage supervisé. <a href="https://hal.archives-ouvertes.fr/hal-01659637" target="_blank">https://hal.archives-ouvertes.fr/hal-01659637</a></p></p></li><li id="ref-grabar2018CAS"><p><p id="ref-grabar2018CAS">Grabar N., Claveau V. and Dalloux C., 2018. CAS: French Corpus with Clinical Cases. <a href="https://hal.archives-ouvertes.fr/hal-01937096" target="_blank">https://hal.archives-ouvertes.fr/hal-01937096</a></p></p></li></ol><div></div></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> <h2 id="edsnlp.pipes.qualifiers.hypothesis.factory.create_component--extensions">Extensions</h2> <p>The <code><a href="#edsnlp.pipes.qualifiers.hypothesis.factory.create_component">eds.hypothesis</a></code> component declares two extensions, on both <code>Span</code> and <code>Token</code> objects :</p> <ol> <li>The <code>hypothesis</code> attribute is a boolean, set to <code>True</code> if the component predicts that the span/token is a speculation.</li> <li>The <code>hypothesis_</code> property is a human-readable string, computed from the <code>hypothesis</code> attribute. It implements a simple getter function that outputs <code>HYP</code> or <code>CERT</code>, depending on the value of <code>hypothesis</code>.</li> </ol> <h2 id="edsnlp.pipes.qualifiers.hypothesis.factory.create_component--performance">Performance</h2> <p>The component's performance is measured on three datasets :</p> <ul> <li>The ESSAI (<span><a class="citation" href="./#ref-dalloux2017ESSAI" id="edsnlp.pipes.qualifiers.hypothesis.factory.create_component--cite-dalloux2017ESSAI">Dalloux et al., 2017</a></span>) and CAS (<span><a class="citation" href="./#ref-grabar2018CAS" id="edsnlp.pipes.qualifiers.hypothesis.factory.create_component--cite-grabar2018CAS">Grabar et al., 2018</a></span>) datasets were developed at the CNRS. The two are concatenated.</li> <li>The NegParHyp corpus was specifically developed at APHP's CDW to test the component on actual clinical notes, using pseudonymised notes from the APHP's CDW.</li> </ul> <table> <thead> <tr> <th>Dataset</th> <th>Hypothesis F1</th> </tr> </thead> <tbody> <tr> <td>CAS/ESSAI</td> <td>49%</td> </tr> <tr> <td>NegParHyp</td> <td>52%</td> </tr> </tbody> </table> <div class="admonition note"> <p class="admonition-title">NegParHyp corpus</p> <p>The NegParHyp corpus was built by matching a subset of the MeSH terminology with around 300 documents from AP-HP's clinical data warehouse. Matched entities were then labelled for hypothesis, speculation and hypothesis context.</p> </div> <h2 id="edsnlp.pipes.qualifiers.hypothesis.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline object.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span></code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The component name.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'hypothesis'</code> </span> </p> </td> </tr> <tr> <td><code>attr</code></td> <td class="doc-param-details"> <p>spaCy's attribute to use</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>NORM</code> </span> </p> </td> </tr> <tr> <td><code>pseudo</code></td> <td class="doc-param-details"> <p>List of pseudo hypothesis cues.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="typing.List">List</span>[str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>preceding</code></td> <td class="doc-param-details"> <p>List of preceding hypothesis cues</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="typing.List">List</span>[str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>following</code></td> <td class="doc-param-details"> <p>List of following hypothesis cues.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="typing.List">List</span>[str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>verbs_hyp</code></td> <td class="doc-param-details"> <p>List of hypothetical verbs.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="typing.List">List</span>[str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>verbs_eds</code></td> <td class="doc-param-details"> <p>List of mainstream verbs.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="typing.List">List</span>[str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>termination</code></td> <td class="doc-param-details"> <p>List of termination terms.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="typing.List">List</span>[str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>attr</code></td> <td class="doc-param-details"> <p>spaCy's attribute to use: a string with the value "TEXT" or "NORM", or a dict with the key 'term_attr'</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>NORM</code> </span> </p> </td> </tr> <tr> <td><code>span_getter</code></td> <td class="doc-param-details"> <p>Which entities should be classified. By default, <code>doc.ents</code></p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../#edsnlp.pipes.base.SpanGetterArg" title="edsnlp.pipes.base.SpanGetterArg">SpanGetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>on_ents_only</code></td> <td class="doc-param-details"> <p>Deprecated, use <code>span_getter</code> instead.</p> <p>Whether to look for matches around detected entities only. Useful for faster inference in downstream tasks.</p> <ul> <li>If True, will look in all ents located in <code>doc.ents</code> only</li> <li>If an iterable of string is passed, will additionally look in <code>doc.spans[key]</code> for each key in the iterable</li> </ul> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[bool, str, <span title="typing.List">List</span>[str], <span title="typing.Set">Set</span>[str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>within_ents</code></td> <td class="doc-param-details"> <p>Whether to consider cues within entities.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>explain</code></td> <td class="doc-param-details"> <p>Whether to keep track of cues for each entity.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.qualifiers.hypothesis.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.qualifiers.hypothesis.factory.create_component">eds.hypothesis</a></code> pipeline was developed by AP-HP's Data Science team.</p> </div> </div> <div class="footnote"><hr/><ol><li id="ref-dalloux2017ESSAI"><p><p id="ref-dalloux2017ESSAI">Dalloux C., Claveau V. and Grabar N., 2017. Détection de la négation : corpus français et apprentissage supervisé. <a href="https://hal.archives-ouvertes.fr/hal-01659637" target="_blank">https://hal.archives-ouvertes.fr/hal-01659637</a></p></p></li><li id="ref-grabar2018CAS"><p><p id="ref-grabar2018CAS">Grabar N., Claveau V. and Dalloux C., 2018. CAS: French Corpus with Clinical Cases. <a href="https://hal.archives-ouvertes.fr/hal-01937096" target="_blank">https://hal.archives-ouvertes.fr/hal-01937096</a></p></p></li></ol><div></div></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/pipes/qualifiers/index.html b/master/pipes/qualifiers/index.html
index d6735adc3..2c678ece3 100644
--- a/master/pipes/qualifiers/index.html
+++ b/master/pipes/qualifiers/index.html
@@ -9,7 +9,7 @@
 
 <span class="o">...</span>
 <span class="n">nlp</span><span class="o">.</span><span class="n">add_pipe</span><span class="p">(</span><a href="../core/sentences/#edsnlp.pipes.core.sentences.factory.create_component">eds.sentences</a><span class="p">())</span>
-</code></pre></div> </div> <h2 id="persisting-the-results">Persisting the results</h2> <p>Our qualifier pipelines write their results to a custom <a href="https://spacy.io/usage/processing-pipelines#custom-components-attributes" target="_blank">spaCy extension</a>, defined on both <code>Span</code> and <code>Token</code> objects. We follow the convention of naming said attribute after the pipeline itself, eg <code>Span._.negation</code> for the<code><a href="negation/#edsnlp.pipes.qualifiers.negation.factory.create_component">eds.negation</a></code> pipeline.</p> <p>We also provide a string representation of the result, computed on the fly by declaring a getter that reads the boolean result of the pipeline. Following spaCy convention, we give this attribute the same name, followed by a <code>_</code>.</p> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> </div> <h2 id="persisting-the-results">Persisting the results</h2> <p>Our qualifier pipelines write their results to a custom <a href="https://spacy.io/usage/processing-pipelines#custom-components-attributes" target="_blank">spaCy extension</a>, defined on both <code>Span</code> and <code>Token</code> objects. We follow the convention of naming said attribute after the pipeline itself, eg <code>Span._.negation</code> for the<code><a href="negation/#edsnlp.pipes.qualifiers.negation.factory.create_component">eds.negation</a></code> pipeline.</p> <p>We also provide a string representation of the result, computed on the fly by declaring a getter that reads the boolean result of the pipeline. Following spaCy convention, we give this attribute the same name, followed by a <code>_</code>.</p> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/pipes/qualifiers/negation/index.html b/master/pipes/qualifiers/negation/index.html
index 8f98baf7d..2da0c3943 100644
--- a/master/pipes/qualifiers/negation/index.html
+++ b/master/pipes/qualifiers/negation/index.html
@@ -22,7 +22,7 @@
 
 <span class="n">doc</span><span class="o">.</span><span class="n">ents</span><span class="p">[</span><span class="mi">1</span><span class="p">]</span><span class="o">.</span><span class="n">_</span><span class="o">.</span><span class="n">negation</span>
 <span class="c1"># Out: True</span>
-</code></pre></div> <ol> <li>The result of the component is kept in the <code>negation</code> custom extension.</li> </ol> <h2 id="edsnlp.pipes.qualifiers.negation.factory.create_component--extensions">Extensions</h2> <p>The <code><a href="#edsnlp.pipes.qualifiers.negation.factory.create_component">eds.negation</a></code> component declares two extensions, on both <code>Span</code> and <code>Token</code> objects :</p> <ol> <li>The <code>negation</code> attribute is a boolean, set to <code>True</code> if the component predicts that the span/token is negated.</li> <li>The <code>negation_</code> property is a human-readable string, computed from the <code>negation</code> attribute. It implements a simple getter function that outputs <code>AFF</code> or <code>NEG</code>, depending on the value of <code>negation</code>.</li> </ol> <h2 id="edsnlp.pipes.qualifiers.negation.factory.create_component--performance">Performance</h2> <p>The component's performance is measured on three datasets :</p> <ul> <li>The ESSAI (<span><a class="citation" href="./#ref-dalloux2017ESSAI" id="edsnlp.pipes.qualifiers.negation.factory.create_component--cite-dalloux2017ESSAI">Dalloux et al., 2017</a></span>) and CAS (<span><a class="citation" href="./#ref-grabar2018CAS" id="edsnlp.pipes.qualifiers.negation.factory.create_component--cite-grabar2018CAS">Grabar et al., 2018</a></span>) datasets were developed at the CNRS. The two are concatenated.</li> <li>The NegParHyp corpus was specifically developed at AP-HP to test the component on actual clinical notes, using pseudonymised notes from the AP-HP.</li> </ul> <table> <thead> <tr> <th>Dataset</th> <th>Negation F1</th> </tr> </thead> <tbody> <tr> <td>CAS/ESSAI</td> <td>71%</td> </tr> <tr> <td>NegParHyp</td> <td>88%</td> </tr> </tbody> </table> <div class="admonition note"> <p class="admonition-title">NegParHyp corpus</p> <p>The NegParHyp corpus was built by matching a subset of the MeSH terminology with around 300 documents from AP-HP's clinical data warehouse. Matched entities were then labelled for negation, speculation and family context.</p> </div> <h2 id="edsnlp.pipes.qualifiers.negation.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline object.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span></code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The component name.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[str]</code> </span> </p> </td> </tr> <tr> <td><code>attr</code></td> <td class="doc-param-details"> <p>spaCy's attribute to use</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>NORM</code> </span> </p> </td> </tr> <tr> <td><code>pseudo</code></td> <td class="doc-param-details"> <p>List of pseudo negation cues.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="typing.List">List</span>[str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>preceding</code></td> <td class="doc-param-details"> <p>List of preceding negation cues</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="typing.List">List</span>[str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>preceding_regex</code></td> <td class="doc-param-details"> <p>List of preceding negation cues, but as regexes.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="typing.List">List</span>[str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>following</code></td> <td class="doc-param-details"> <p>List of following negation cues.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="typing.List">List</span>[str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>verbs</code></td> <td class="doc-param-details"> <p>List of negation verbs.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="typing.List">List</span>[str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>termination</code></td> <td class="doc-param-details"> <p>List of termination terms.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="typing.List">List</span>[str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>span_getter</code></td> <td class="doc-param-details"> <p>Which entities should be classified. By default, <code>doc.ents</code></p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../../reference/edsnlp/utils/span_getters/#edsnlp.utils.span_getters.SpanGetterArg" title="edsnlp.utils.span_getters.SpanGetterArg">SpanGetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>on_ents_only</code></td> <td class="doc-param-details"> <p>Deprecated, use <code>span_getter</code> instead.</p> <p>Whether to look for matches around detected entities only. Useful for faster inference in downstream tasks.</p> <ul> <li>If True, will look in all ents located in <code>doc.ents</code> only</li> <li>If an iterable of string is passed, will additionally look in <code>doc.spans[key]</code> for each key in the iterable</li> </ul> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[bool, str, <span title="typing.List">List</span>[str], <span title="typing.Set">Set</span>[str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>within_ents</code></td> <td class="doc-param-details"> <p>Whether to consider cues within entities.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>explain</code></td> <td class="doc-param-details"> <p>Whether to keep track of cues for each entity.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.qualifiers.negation.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.qualifiers.negation.factory.create_component">eds.negation</a></code> component was developed by AP-HP's Data Science team.</p> </div> </div> <div class="footnote"><hr/><ol><li id="ref-chapman_simple_2001"><p><p id="ref-chapman_simple_2001">Chapman W.W., Bridewell W., Hanbury P., Cooper G.F. and Buchanan B.G., 2001. A Simple Algorithm for Identifying Negated Findings and Diseases in Discharge Summaries. <i>Journal of Biomedical Informatics</i>. <i>34</i>, pp.301--310. <a href="https://dx.doi.org/10.1006/jbin.2001.1029" target="_blank">10.1006/jbin.2001.1029</a></p></p></li><li id="ref-dalloux2017ESSAI"><p><p id="ref-dalloux2017ESSAI">Dalloux C., Claveau V. and Grabar N., 2017. Détection de la négation : corpus français et apprentissage supervisé. <a href="https://hal.archives-ouvertes.fr/hal-01659637" target="_blank">https://hal.archives-ouvertes.fr/hal-01659637</a></p></p></li><li id="ref-grabar2018CAS"><p><p id="ref-grabar2018CAS">Grabar N., Claveau V. and Dalloux C., 2018. CAS: French Corpus with Clinical Cases. <a href="https://hal.archives-ouvertes.fr/hal-01937096" target="_blank">https://hal.archives-ouvertes.fr/hal-01937096</a></p></p></li></ol><div></div></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> <ol> <li>The result of the component is kept in the <code>negation</code> custom extension.</li> </ol> <h2 id="edsnlp.pipes.qualifiers.negation.factory.create_component--extensions">Extensions</h2> <p>The <code><a href="#edsnlp.pipes.qualifiers.negation.factory.create_component">eds.negation</a></code> component declares two extensions, on both <code>Span</code> and <code>Token</code> objects :</p> <ol> <li>The <code>negation</code> attribute is a boolean, set to <code>True</code> if the component predicts that the span/token is negated.</li> <li>The <code>negation_</code> property is a human-readable string, computed from the <code>negation</code> attribute. It implements a simple getter function that outputs <code>AFF</code> or <code>NEG</code>, depending on the value of <code>negation</code>.</li> </ol> <h2 id="edsnlp.pipes.qualifiers.negation.factory.create_component--performance">Performance</h2> <p>The component's performance is measured on three datasets :</p> <ul> <li>The ESSAI (<span><a class="citation" href="./#ref-dalloux2017ESSAI" id="edsnlp.pipes.qualifiers.negation.factory.create_component--cite-dalloux2017ESSAI">Dalloux et al., 2017</a></span>) and CAS (<span><a class="citation" href="./#ref-grabar2018CAS" id="edsnlp.pipes.qualifiers.negation.factory.create_component--cite-grabar2018CAS">Grabar et al., 2018</a></span>) datasets were developed at the CNRS. The two are concatenated.</li> <li>The NegParHyp corpus was specifically developed at AP-HP to test the component on actual clinical notes, using pseudonymised notes from the AP-HP.</li> </ul> <table> <thead> <tr> <th>Dataset</th> <th>Negation F1</th> </tr> </thead> <tbody> <tr> <td>CAS/ESSAI</td> <td>71%</td> </tr> <tr> <td>NegParHyp</td> <td>88%</td> </tr> </tbody> </table> <div class="admonition note"> <p class="admonition-title">NegParHyp corpus</p> <p>The NegParHyp corpus was built by matching a subset of the MeSH terminology with around 300 documents from AP-HP's clinical data warehouse. Matched entities were then labelled for negation, speculation and family context.</p> </div> <h2 id="edsnlp.pipes.qualifiers.negation.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline object.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span></code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The component name.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[str]</code> </span> </p> </td> </tr> <tr> <td><code>attr</code></td> <td class="doc-param-details"> <p>spaCy's attribute to use</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>NORM</code> </span> </p> </td> </tr> <tr> <td><code>pseudo</code></td> <td class="doc-param-details"> <p>List of pseudo negation cues.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="typing.List">List</span>[str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>preceding</code></td> <td class="doc-param-details"> <p>List of preceding negation cues</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="typing.List">List</span>[str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>preceding_regex</code></td> <td class="doc-param-details"> <p>List of preceding negation cues, but as regexes.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="typing.List">List</span>[str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>following</code></td> <td class="doc-param-details"> <p>List of following negation cues.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="typing.List">List</span>[str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>verbs</code></td> <td class="doc-param-details"> <p>List of negation verbs.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="typing.List">List</span>[str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>termination</code></td> <td class="doc-param-details"> <p>List of termination terms.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="typing.List">List</span>[str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>span_getter</code></td> <td class="doc-param-details"> <p>Which entities should be classified. By default, <code>doc.ents</code></p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../../reference/edsnlp/utils/span_getters/#edsnlp.utils.span_getters.SpanGetterArg" title="edsnlp.utils.span_getters.SpanGetterArg">SpanGetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>on_ents_only</code></td> <td class="doc-param-details"> <p>Deprecated, use <code>span_getter</code> instead.</p> <p>Whether to look for matches around detected entities only. Useful for faster inference in downstream tasks.</p> <ul> <li>If True, will look in all ents located in <code>doc.ents</code> only</li> <li>If an iterable of string is passed, will additionally look in <code>doc.spans[key]</code> for each key in the iterable</li> </ul> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[bool, str, <span title="typing.List">List</span>[str], <span title="typing.Set">Set</span>[str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>within_ents</code></td> <td class="doc-param-details"> <p>Whether to consider cues within entities.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>explain</code></td> <td class="doc-param-details"> <p>Whether to keep track of cues for each entity.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.qualifiers.negation.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.qualifiers.negation.factory.create_component">eds.negation</a></code> component was developed by AP-HP's Data Science team.</p> </div> </div> <div class="footnote"><hr/><ol><li id="ref-chapman_simple_2001"><p><p id="ref-chapman_simple_2001">Chapman W.W., Bridewell W., Hanbury P., Cooper G.F. and Buchanan B.G., 2001. A Simple Algorithm for Identifying Negated Findings and Diseases in Discharge Summaries. <i>Journal of Biomedical Informatics</i>. <i>34</i>, pp.301--310. <a href="https://dx.doi.org/10.1006/jbin.2001.1029" target="_blank">10.1006/jbin.2001.1029</a></p></p></li><li id="ref-dalloux2017ESSAI"><p><p id="ref-dalloux2017ESSAI">Dalloux C., Claveau V. and Grabar N., 2017. Détection de la négation : corpus français et apprentissage supervisé. <a href="https://hal.archives-ouvertes.fr/hal-01659637" target="_blank">https://hal.archives-ouvertes.fr/hal-01659637</a></p></p></li><li id="ref-grabar2018CAS"><p><p id="ref-grabar2018CAS">Grabar N., Claveau V. and Dalloux C., 2018. CAS: French Corpus with Clinical Cases. <a href="https://hal.archives-ouvertes.fr/hal-01937096" target="_blank">https://hal.archives-ouvertes.fr/hal-01937096</a></p></p></li></ol><div></div></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/pipes/qualifiers/reported-speech/index.html b/master/pipes/qualifiers/reported-speech/index.html
index ec75089a5..f14381fcd 100644
--- a/master/pipes/qualifiers/reported-speech/index.html
+++ b/master/pipes/qualifiers/reported-speech/index.html
@@ -22,7 +22,7 @@
 
 <span class="n">doc</span><span class="o">.</span><span class="n">ents</span><span class="p">[</span><span class="mi">1</span><span class="p">]</span><span class="o">.</span><span class="n">_</span><span class="o">.</span><span class="n">reported_speech</span>
 <span class="c1"># Out: True</span>
-</code></pre></div> <h2 id="edsnlp.pipes.qualifiers.reported_speech.factory.create_component--extensions">Extensions</h2> <p>The <code><a href="#edsnlp.pipes.qualifiers.reported_speech.factory.create_component">eds.reported_speech</a></code> component declares two extensions, on both <code>Span</code> and <code>Token</code> objects :</p> <ol> <li>The <code>reported_speech</code> attribute is a boolean, set to <code>True</code> if the component predicts that the span/token is reported.</li> <li>The <code>reported_speech_</code> property is a human-readable string, computed from the <code>reported_speech</code> attribute. It implements a simple getter function that outputs <code>DIRECT</code> or <code>REPORTED</code>, depending on the value of <code>reported_speech</code>.</li> </ol> <h2 id="edsnlp.pipes.qualifiers.reported_speech.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>spaCy nlp pipeline to use for matching.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span></code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The component name.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'reported_speech'</code> </span> </p> </td> </tr> <tr> <td><code>quotation</code></td> <td class="doc-param-details"> <p>String gathering all quotation cues.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>verbs</code></td> <td class="doc-param-details"> <p>List of reported speech verbs.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.List">List</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>following</code></td> <td class="doc-param-details"> <p>List of terms following a reported speech.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.List">List</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>preceding</code></td> <td class="doc-param-details"> <p>List of terms preceding a reported speech.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.List">List</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>attr</code></td> <td class="doc-param-details"> <p>spaCy's attribute to use: a string with the value "TEXT" or "NORM", or a dict with the key 'term_attr' we can also add a key for each regex.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>NORM</code> </span> </p> </td> </tr> <tr> <td><code>span_getter</code></td> <td class="doc-param-details"> <p>Which entities should be classified. By default, <code>doc.ents</code></p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../../reference/edsnlp/utils/span_getters/#edsnlp.utils.span_getters.SpanGetterArg" title="edsnlp.utils.span_getters.SpanGetterArg">SpanGetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>on_ents_only</code></td> <td class="doc-param-details"> <p>Whether to look for matches around detected entities only. Useful for faster inference in downstream tasks.</p> <ul> <li>If True, will look in all ents located in <code>doc.ents</code> only</li> <li>If an iterable of string is passed, will additionally look in <code>doc.spans[key]</code> for each key in the iterable</li> </ul> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[bool, str, <span title="typing.List">List</span>[str], <span title="typing.Set">Set</span>[str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>within_ents</code></td> <td class="doc-param-details"> <p>Whether to consider cues within entities.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>explain</code></td> <td class="doc-param-details"> <p>Whether to keep track of cues for each entity.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.qualifiers.reported_speech.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.qualifiers.reported_speech.factory.create_component">eds.reported_speech</a></code> component was developed by AP-HP's Data Science team.</p> </div> </div> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> <h2 id="edsnlp.pipes.qualifiers.reported_speech.factory.create_component--extensions">Extensions</h2> <p>The <code><a href="#edsnlp.pipes.qualifiers.reported_speech.factory.create_component">eds.reported_speech</a></code> component declares two extensions, on both <code>Span</code> and <code>Token</code> objects :</p> <ol> <li>The <code>reported_speech</code> attribute is a boolean, set to <code>True</code> if the component predicts that the span/token is reported.</li> <li>The <code>reported_speech_</code> property is a human-readable string, computed from the <code>reported_speech</code> attribute. It implements a simple getter function that outputs <code>DIRECT</code> or <code>REPORTED</code>, depending on the value of <code>reported_speech</code>.</li> </ol> <h2 id="edsnlp.pipes.qualifiers.reported_speech.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>spaCy nlp pipeline to use for matching.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span></code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The component name.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'reported_speech'</code> </span> </p> </td> </tr> <tr> <td><code>quotation</code></td> <td class="doc-param-details"> <p>String gathering all quotation cues.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>verbs</code></td> <td class="doc-param-details"> <p>List of reported speech verbs.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.List">List</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>following</code></td> <td class="doc-param-details"> <p>List of terms following a reported speech.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.List">List</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>preceding</code></td> <td class="doc-param-details"> <p>List of terms preceding a reported speech.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.List">List</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>attr</code></td> <td class="doc-param-details"> <p>spaCy's attribute to use: a string with the value "TEXT" or "NORM", or a dict with the key 'term_attr' we can also add a key for each regex.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>NORM</code> </span> </p> </td> </tr> <tr> <td><code>span_getter</code></td> <td class="doc-param-details"> <p>Which entities should be classified. By default, <code>doc.ents</code></p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../../reference/edsnlp/utils/span_getters/#edsnlp.utils.span_getters.SpanGetterArg" title="edsnlp.utils.span_getters.SpanGetterArg">SpanGetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>on_ents_only</code></td> <td class="doc-param-details"> <p>Whether to look for matches around detected entities only. Useful for faster inference in downstream tasks.</p> <ul> <li>If True, will look in all ents located in <code>doc.ents</code> only</li> <li>If an iterable of string is passed, will additionally look in <code>doc.spans[key]</code> for each key in the iterable</li> </ul> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Union">Union</span>[bool, str, <span title="typing.List">List</span>[str], <span title="typing.Set">Set</span>[str]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>within_ents</code></td> <td class="doc-param-details"> <p>Whether to consider cues within entities.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> <tr> <td><code>explain</code></td> <td class="doc-param-details"> <p>Whether to keep track of cues for each entity.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.qualifiers.reported_speech.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.qualifiers.reported_speech.factory.create_component">eds.reported_speech</a></code> component was developed by AP-HP's Data Science team.</p> </div> </div> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/pipes/trainable/embeddings/span_pooler/index.html b/master/pipes/trainable/embeddings/span_pooler/index.html
index f4ed5c7c7..b11c403fe 100644
--- a/master/pipes/trainable/embeddings/span_pooler/index.html
+++ b/master/pipes/trainable/embeddings/span_pooler/index.html
@@ -1,5 +1,5 @@
 <!DOCTYPE html>
-<html class="no-js" lang="en"> <head><meta charset="utf-8"/><meta content="width=device-width,initial-scale=1" name="viewport"/><link href="../text_cnn/" rel="prev"/><link href="../../ner/" rel="next"/><link href="../../../../assets/logo/edsnlp.svg" rel="icon"/><meta content="mkdocs-1.6.1, mkdocs-material-9.5.42" name="generator"/><title>Span Pooler - EDS-NLP</title><link href="../../../../assets/stylesheets/main.0253249f.min.css" rel="stylesheet"/><link href="../../../../assets/stylesheets/palette.06af60db.min.css" rel="stylesheet"/><link crossorigin="" href="https://fonts.gstatic.com" rel="preconnect"/><link href="https://fonts.googleapis.com/css?family=Roboto:300,300i,400,400i,700,700i%7CRoboto+Mono:400,400i,700,700i&amp;display=fallback" rel="stylesheet"/><style>:root{--md-text-font:"Roboto";--md-code-font:"Roboto Mono"}</style><link href="../../../../assets/_mkdocstrings.css" rel="stylesheet"/><link href="../../../../assets/stylesheets/extra.css" rel="stylesheet"/><link href="../../../../assets/stylesheets/cards.css" rel="stylesheet"/><link href="../../../../assets/termynal/termynal.css" rel="stylesheet"/><script>__md_scope=new URL("../../../..",location),__md_hash=e=>[...e].reduce(((e,_)=>(e<<5)-e+_.charCodeAt(0)),0),__md_get=(e,_=localStorage,t=__md_scope)=>JSON.parse(_.getItem(t.pathname+"."+e)),__md_set=(e,_,t=localStorage,a=__md_scope)=>{try{t.setItem(a.pathname+"."+e,JSON.stringify(_))}catch(e){}}</script></head> <body data-md-color-accent="indigo" data-md-color-primary="indigo" data-md-color-scheme="default" dir="ltr"> <input autocomplete="off" class="md-toggle" data-md-toggle="drawer" id="__drawer" type="checkbox"/> <input autocomplete="off" class="md-toggle" data-md-toggle="search" id="__search" type="checkbox"/> <label class="md-overlay" for="__drawer"></label> <div data-md-component="skip"> <a class="md-skip" href="#edsnlp.pipes.trainable.embeddings.span_pooler.factory.create_component"> Skip to content </a> </div> <div data-md-component="announce"> <aside class="md-banner"> <div class="md-banner__inner md-grid md-typeset"> <button aria-label="Don't show this again" class="md-banner__button md-icon"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M19 6.41 17.59 5 12 10.59 6.41 5 5 6.41 10.59 12 5 17.59 6.41 19 12 13.41 17.59 19 19 17.59 13.41 12z"></path></svg> </button> Check out the new <a href="../../../../tutorials/training">Model Training tutorial</a> ! </div> <script>var el=document.querySelector("[data-md-component=announce]");if(el){var content=el.querySelector(".md-typeset");__md_hash(content.innerHTML)===__md_get("__announce")&&(el.hidden=!0)}</script> </aside> </div> <div data-md-color-scheme="default" data-md-component="outdated" hidden=""> </div> <header class="md-header md-header--shadow" data-md-component="header"> <nav aria-label="Header" class="md-header__inner md-grid"> <a aria-label="EDS-NLP" class="md-header__button md-logo" data-md-component="logo" href="../../../.." title="EDS-NLP"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a3 3 0 0 0 3-3 3 3 0 0 0-3-3 3 3 0 0 0-3 3 3 3 0 0 0 3 3m0 3.54C9.64 9.35 6.5 8 3 8v11c3.5 0 6.64 1.35 9 3.54 2.36-2.19 5.5-3.54 9-3.54V8c-3.5 0-6.64 1.35-9 3.54"></path></svg> </a> <label class="md-header__button md-icon" for="__drawer"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M3 6h18v2H3zm0 5h18v2H3zm0 5h18v2H3z"></path></svg> </label> <div class="md-header__title" data-md-component="header-title"> <div class="md-header__ellipsis"> <div class="md-header__topic"> <span class="md-ellipsis"> EDS-NLP </span> </div> <div class="md-header__topic" data-md-component="header-topic"> <span class="md-ellipsis"> Span Pooler </span> </div> </div> </div> <form class="md-header__option" data-md-component="palette"> <input aria-label="Switch to dark mode" class="md-option" data-md-color-accent="indigo" data-md-color-media="" data-md-color-primary="indigo" data-md-color-scheme="default" id="__palette_0" name="__palette" type="radio"/> <label class="md-header__button md-icon" for="__palette_1" hidden="" title="Switch to dark mode"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 18c-.89 0-1.74-.2-2.5-.55C11.56 16.5 13 14.42 13 12s-1.44-4.5-3.5-5.45C10.26 6.2 11.11 6 12 6a6 6 0 0 1 6 6 6 6 0 0 1-6 6m8-9.31V4h-4.69L12 .69 8.69 4H4v4.69L.69 12 4 15.31V20h4.69L12 23.31 15.31 20H20v-4.69L23.31 12z"></path></svg> </label> <input aria-label="Switch to light mode" class="md-option" data-md-color-accent="indigo" data-md-color-media="" data-md-color-primary="indigo" data-md-color-scheme="slate" id="__palette_1" name="__palette" type="radio"/> <label class="md-header__button md-icon" for="__palette_0" hidden="" title="Switch to light mode"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a4 4 0 0 0-4 4 4 4 0 0 0 4 4 4 4 0 0 0 4-4 4 4 0 0 0-4-4m0 10a6 6 0 0 1-6-6 6 6 0 0 1 6-6 6 6 0 0 1 6 6 6 6 0 0 1-6 6m8-9.31V4h-4.69L12 .69 8.69 4H4v4.69L.69 12 4 15.31V20h4.69L12 23.31 15.31 20H20v-4.69L23.31 12z"></path></svg> </label> </form> <script>var palette=__md_get("__palette");if(palette&&palette.color){if("(prefers-color-scheme)"===palette.color.media){var media=matchMedia("(prefers-color-scheme: light)"),input=document.querySelector(media.matches?"[data-md-color-media='(prefers-color-scheme: light)']":"[data-md-color-media='(prefers-color-scheme: dark)']");palette.color.media=input.getAttribute("data-md-color-media"),palette.color.scheme=input.getAttribute("data-md-color-scheme"),palette.color.primary=input.getAttribute("data-md-color-primary"),palette.color.accent=input.getAttribute("data-md-color-accent")}for(var[key,value]of Object.entries(palette.color))document.body.setAttribute("data-md-color-"+key,value)}</script> <label class="md-header__button md-icon" for="__search"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M9.5 3A6.5 6.5 0 0 1 16 9.5c0 1.61-.59 3.09-1.56 4.23l.27.27h.79l5 5-1.5 1.5-5-5v-.79l-.27-.27A6.52 6.52 0 0 1 9.5 16 6.5 6.5 0 0 1 3 9.5 6.5 6.5 0 0 1 9.5 3m0 2C7 5 5 7 5 9.5S7 14 9.5 14 14 12 14 9.5 12 5 9.5 5"></path></svg> </label> <div class="md-search" data-md-component="search" role="dialog"> <label class="md-search__overlay" for="__search"></label> <div class="md-search__inner" role="search"> <form class="md-search__form" name="search"> <input aria-label="Search" autocapitalize="off" autocomplete="off" autocorrect="off" class="md-search__input" data-md-component="search-query" name="query" placeholder="Search" required="" spellcheck="false" type="text"/> <label class="md-search__icon md-icon" for="__search"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M9.5 3A6.5 6.5 0 0 1 16 9.5c0 1.61-.59 3.09-1.56 4.23l.27.27h.79l5 5-1.5 1.5-5-5v-.79l-.27-.27A6.52 6.52 0 0 1 9.5 16 6.5 6.5 0 0 1 3 9.5 6.5 6.5 0 0 1 9.5 3m0 2C7 5 5 7 5 9.5S7 14 9.5 14 14 12 14 9.5 12 5 9.5 5"></path></svg> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M20 11v2H8l5.5 5.5-1.42 1.42L4.16 12l7.92-7.92L13.5 5.5 8 11z"></path></svg> </label> <nav aria-label="Search" class="md-search__options"> <button aria-label="Clear" class="md-search__icon md-icon" tabindex="-1" title="Clear" type="reset"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M19 6.41 17.59 5 12 10.59 6.41 5 5 6.41 10.59 12 5 17.59 6.41 19 12 13.41 17.59 19 19 17.59 13.41 12z"></path></svg> </button> </nav> </form> <div class="md-search__output"> <div class="md-search__scrollwrap" data-md-scrollfix="" tabindex="0"> <div class="md-search-result" data-md-component="search-result"> <div class="md-search-result__meta"> Initializing search </div> <ol class="md-search-result__list" role="presentation"></ol> </div> </div> </div> </div> </div> <div class="md-header__source"> <a class="md-source" data-md-component="source" href="https://github.com/aphp/edsnlp" title="Go to repository"> <div class="md-source__icon md-icon"> <svg viewbox="0 0 448 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M439.55 236.05 244 40.45a28.87 28.87 0 0 0-40.81 0l-40.66 40.63 51.52 51.52c27.06-9.14 52.68 16.77 43.39 43.68l49.66 49.66c34.23-11.8 61.18 31 35.47 56.69-26.49 26.49-70.21-2.87-56-37.34L240.22 199v121.85c25.3 12.54 22.26 41.85 9.08 55a34.34 34.34 0 0 1-48.55 0c-17.57-17.6-11.07-46.91 11.25-56v-123c-20.8-8.51-24.6-30.74-18.64-45L142.57 101 8.45 235.14a28.86 28.86 0 0 0 0 40.81l195.61 195.6a28.86 28.86 0 0 0 40.8 0l194.69-194.69a28.86 28.86 0 0 0 0-40.81"></path></svg> </div> <div class="md-source__repository"> aphp/edsnlp </div> </a> </div> </nav> </header> <div class="md-container" data-md-component="container"> <main class="md-main" data-md-component="main"> <div class="md-main__inner md-grid"> <div class="md-sidebar md-sidebar--primary" data-md-component="sidebar" data-md-type="navigation"> <div class="md-sidebar__scrollwrap"> <div class="md-sidebar__inner"> <nav aria-label="Navigation" class="md-nav md-nav--primary" data-md-level="0"> <label class="md-nav__title" for="__drawer"> <a aria-label="EDS-NLP" class="md-nav__button md-logo" data-md-component="logo" href="../../../.." title="EDS-NLP"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a3 3 0 0 0 3-3 3 3 0 0 0-3-3 3 3 0 0 0-3 3 3 3 0 0 0 3 3m0 3.54C9.64 9.35 6.5 8 3 8v11c3.5 0 6.64 1.35 9 3.54 2.36-2.19 5.5-3.54 9-3.54V8c-3.5 0-6.64 1.35-9 3.54"></path></svg> </a> EDS-NLP </label> <div class="md-nav__source"> <a class="md-source" data-md-component="source" href="https://github.com/aphp/edsnlp" title="Go to repository"> <div class="md-source__icon md-icon"> <svg viewbox="0 0 448 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M439.55 236.05 244 40.45a28.87 28.87 0 0 0-40.81 0l-40.66 40.63 51.52 51.52c27.06-9.14 52.68 16.77 43.39 43.68l49.66 49.66c34.23-11.8 61.18 31 35.47 56.69-26.49 26.49-70.21-2.87-56-37.34L240.22 199v121.85c25.3 12.54 22.26 41.85 9.08 55a34.34 34.34 0 0 1-48.55 0c-17.57-17.6-11.07-46.91 11.25-56v-123c-20.8-8.51-24.6-30.74-18.64-45L142.57 101 8.45 235.14a28.86 28.86 0 0 0 0 40.81l195.61 195.6a28.86 28.86 0 0 0 40.8 0l194.69-194.69a28.86 28.86 0 0 0 0-40.81"></path></svg> </div> <div class="md-source__repository"> aphp/edsnlp </div> </a> </div> <ul class="md-nav__list" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href="../../../.."> <span class="md-ellipsis"> Getting started </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="https://aphp.github.io/edsnlp/demo" target="_blank"> <span class="md-ellipsis"> Demo </span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../../../tutorials/"> <span class="md-ellipsis"> Tutorials </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--active md-nav__item--nested"> <input checked="" class="md-nav__toggle md-toggle" id="__nav_4" type="checkbox"/> <div class="md-nav__link md-nav__container"> <a class="md-nav__link" href="../../../"> <span class="md-ellipsis"> Pipes </span> </a> <label class="md-nav__link" for="__nav_4" id="__nav_4_label" tabindex="0"> <span class="md-nav__icon md-icon"></span> </label> </div> <nav aria-expanded="true" aria-labelledby="__nav_4_label" class="md-nav" data-md-level="1"> <label class="md-nav__title" for="__nav_4"> <span class="md-nav__icon md-icon"></span> Pipes </label> <ul class="md-nav__list" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href="../../../"> <span class="md-ellipsis"> Overview </span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../../core/"> <span class="md-ellipsis"> Core Pipelines </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../../qualifiers/"> <span class="md-ellipsis"> Qualifiers </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../../misc/"> <span class="md-ellipsis"> Miscellaneous </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../../ner/"> <span class="md-ellipsis"> Named Entity Recognition </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--active md-nav__item--nested"> <input checked="" class="md-nav__toggle md-toggle" id="__nav_4_7" type="checkbox"/> <div class="md-nav__link md-nav__container"> <a class="md-nav__link" href="../../"> <span class="md-ellipsis"> Trainable components </span> </a> <label class="md-nav__link" for="__nav_4_7" id="__nav_4_7_label" tabindex="0"> <span class="md-nav__icon md-icon"></span> </label> </div> <nav aria-expanded="true" aria-labelledby="__nav_4_7_label" class="md-nav" data-md-level="2"> <label class="md-nav__title" for="__nav_4_7"> <span class="md-nav__icon md-icon"></span> Trainable components </label> <ul class="md-nav__list" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href="../../"> <span class="md-ellipsis"> Overview </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../transformer/"> <span class="md-ellipsis"> Transformer </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../text_cnn/"> <span class="md-ellipsis"> Text CNN </span> </a> </li> <li class="md-nav__item md-nav__item--active"> <input class="md-nav__toggle md-toggle" id="__toc" type="checkbox"/> <label class="md-nav__link md-nav__link--active" for="__toc"> <span class="md-ellipsis"> Span Pooler </span> <span class="md-nav__icon md-icon"></span> </label> <a class="md-nav__link md-nav__link--active" href="./"> <span class="md-ellipsis"> Span Pooler </span> </a> <nav aria-label="Table of contents" class="md-nav md-nav--secondary"> <label class="md-nav__title" for="__toc"> <span class="md-nav__icon md-icon"></span> Table of contents </label> <ul class="md-nav__list" data-md-component="toc" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href="#edsnlp.pipes.trainable.embeddings.span_pooler.factory.create_component--parameters"> <span class="md-ellipsis"> Parameters </span> </a> </li> </ul> </nav> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../ner/"> <span class="md-ellipsis"> NER </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../span-classifier/"> <span class="md-ellipsis"> Span Classifier </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../span-linker/"> <span class="md-ellipsis"> Span Linker </span> </a> </li> </ul> </nav> </li> </ul> </nav> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../../../tokenizers/"> <span class="md-ellipsis"> Tokenizers </span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../../../data/"> <span class="md-ellipsis"> Data Connectors </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../../../concepts/pipeline/"> <span class="md-ellipsis"> Concepts </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../../../utilities/"> <span class="md-ellipsis"> Utilities </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../../../reference/edsnlp/"> <span class="md-ellipsis"> Code Reference </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../../../contributing/"> <span class="md-ellipsis"> Contributing to EDS-NLP </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../../../changelog/"> <span class="md-ellipsis"> Changelog </span> </a> </li> </ul> </nav> </div> </div> </div> <div class="md-sidebar md-sidebar--secondary" data-md-component="sidebar" data-md-type="toc"> <div class="md-sidebar__scrollwrap"> <div class="md-sidebar__inner"> <nav aria-label="Table of contents" class="md-nav md-nav--secondary"> <label class="md-nav__title" for="__toc"> <span class="md-nav__icon md-icon"></span> Table of contents </label> <ul class="md-nav__list" data-md-component="toc" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href="#edsnlp.pipes.trainable.embeddings.span_pooler.factory.create_component--parameters"> <span class="md-ellipsis"> Parameters </span> </a> </li> </ul> </nav> </div> </div> </div> <div class="md-content" data-md-component="content"> <article class="md-content__inner md-typeset"> <h1 class="sourced-heading" id="edsnlp.pipes.trainable.embeddings.span_pooler.factory.create_component">Span Pooler<span class="sourced-heading-spacer"></span><a href="https://github.com/aphp/edsnlp/blob/1ebc7d72c/edsnlp/pipes/trainable/embeddings/span_pooler/span_pooler.py#L50" target="_blank">[source]</a></h1> <div class="doc doc-object doc-attribute"> <div class="doc doc-contents first"> <p>The <code><a href="#edsnlp.pipes.trainable.embeddings.span_pooler.factory.create_component">eds.span_pooler</a></code> component is a trainable span embedding component. It generates span embeddings from a word embedding component and a span getter. It can be used to train a span classifier, as in <code><a href="../../span-classifier/#edsnlp.pipes.trainable.span_classifier.factory.create_component">eds.span_classifier</a></code>.</p> <h2 id="edsnlp.pipes.trainable.embeddings.span_pooler.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline object</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<a class="autorefs autorefs-internal" href="../../../../concepts/pipeline/#edsnlp.core.pipeline.Pipeline" title="edsnlp.core.pipeline.Pipeline">Pipeline</a>]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>Name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'span_pooler'</code> </span> </p> </td> </tr> <tr> <td><code>embedding</code></td> <td class="doc-param-details"> <p>The word embedding component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="edsnlp.pipes.trainable.embeddings.typing.WordEmbeddingComponent">WordEmbeddingComponent</span></code> </span> </p> </td> </tr> <tr> <td><code>pooling_mode</code></td> <td class="doc-param-details"> <p>How word embeddings are aggregated into a single embedding per span.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing_extensions.Literal">Literal</span>['max', 'sum', 'mean']</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>mean</code> </span> </p> </td> </tr> <tr> <td><code>hidden_size</code></td> <td class="doc-param-details"> <p>The size of the hidden layer. If None, no projection is done and the output of the span pooler is used directly.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[int]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> </tbody> </table> </div> </div> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+<html class="no-js" lang="en"> <head><meta charset="utf-8"/><meta content="width=device-width,initial-scale=1" name="viewport"/><link href="../text_cnn/" rel="prev"/><link href="../../ner/" rel="next"/><link href="../../../../assets/logo/edsnlp.svg" rel="icon"/><meta content="mkdocs-1.6.1, mkdocs-material-9.5.42" name="generator"/><title>Span Pooler - EDS-NLP</title><link href="../../../../assets/stylesheets/main.0253249f.min.css" rel="stylesheet"/><link href="../../../../assets/stylesheets/palette.06af60db.min.css" rel="stylesheet"/><link crossorigin="" href="https://fonts.gstatic.com" rel="preconnect"/><link href="https://fonts.googleapis.com/css?family=Roboto:300,300i,400,400i,700,700i%7CRoboto+Mono:400,400i,700,700i&amp;display=fallback" rel="stylesheet"/><style>:root{--md-text-font:"Roboto";--md-code-font:"Roboto Mono"}</style><link href="../../../../assets/_mkdocstrings.css" rel="stylesheet"/><link href="../../../../assets/stylesheets/extra.css" rel="stylesheet"/><link href="../../../../assets/stylesheets/cards.css" rel="stylesheet"/><link href="../../../../assets/termynal/termynal.css" rel="stylesheet"/><script>__md_scope=new URL("../../../..",location),__md_hash=e=>[...e].reduce(((e,_)=>(e<<5)-e+_.charCodeAt(0)),0),__md_get=(e,_=localStorage,t=__md_scope)=>JSON.parse(_.getItem(t.pathname+"."+e)),__md_set=(e,_,t=localStorage,a=__md_scope)=>{try{t.setItem(a.pathname+"."+e,JSON.stringify(_))}catch(e){}}</script></head> <body data-md-color-accent="indigo" data-md-color-primary="indigo" data-md-color-scheme="default" dir="ltr"> <input autocomplete="off" class="md-toggle" data-md-toggle="drawer" id="__drawer" type="checkbox"/> <input autocomplete="off" class="md-toggle" data-md-toggle="search" id="__search" type="checkbox"/> <label class="md-overlay" for="__drawer"></label> <div data-md-component="skip"> <a class="md-skip" href="#edsnlp.pipes.trainable.embeddings.span_pooler.factory.create_component"> Skip to content </a> </div> <div data-md-component="announce"> <aside class="md-banner"> <div class="md-banner__inner md-grid md-typeset"> <button aria-label="Don't show this again" class="md-banner__button md-icon"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M19 6.41 17.59 5 12 10.59 6.41 5 5 6.41 10.59 12 5 17.59 6.41 19 12 13.41 17.59 19 19 17.59 13.41 12z"></path></svg> </button> Check out the new <a href="../../../../tutorials/training">Model Training tutorial</a> ! </div> <script>var el=document.querySelector("[data-md-component=announce]");if(el){var content=el.querySelector(".md-typeset");__md_hash(content.innerHTML)===__md_get("__announce")&&(el.hidden=!0)}</script> </aside> </div> <div data-md-color-scheme="default" data-md-component="outdated" hidden=""> </div> <header class="md-header md-header--shadow" data-md-component="header"> <nav aria-label="Header" class="md-header__inner md-grid"> <a aria-label="EDS-NLP" class="md-header__button md-logo" data-md-component="logo" href="../../../.." title="EDS-NLP"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a3 3 0 0 0 3-3 3 3 0 0 0-3-3 3 3 0 0 0-3 3 3 3 0 0 0 3 3m0 3.54C9.64 9.35 6.5 8 3 8v11c3.5 0 6.64 1.35 9 3.54 2.36-2.19 5.5-3.54 9-3.54V8c-3.5 0-6.64 1.35-9 3.54"></path></svg> </a> <label class="md-header__button md-icon" for="__drawer"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M3 6h18v2H3zm0 5h18v2H3zm0 5h18v2H3z"></path></svg> </label> <div class="md-header__title" data-md-component="header-title"> <div class="md-header__ellipsis"> <div class="md-header__topic"> <span class="md-ellipsis"> EDS-NLP </span> </div> <div class="md-header__topic" data-md-component="header-topic"> <span class="md-ellipsis"> Span Pooler </span> </div> </div> </div> <form class="md-header__option" data-md-component="palette"> <input aria-label="Switch to dark mode" class="md-option" data-md-color-accent="indigo" data-md-color-media="" data-md-color-primary="indigo" data-md-color-scheme="default" id="__palette_0" name="__palette" type="radio"/> <label class="md-header__button md-icon" for="__palette_1" hidden="" title="Switch to dark mode"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 18c-.89 0-1.74-.2-2.5-.55C11.56 16.5 13 14.42 13 12s-1.44-4.5-3.5-5.45C10.26 6.2 11.11 6 12 6a6 6 0 0 1 6 6 6 6 0 0 1-6 6m8-9.31V4h-4.69L12 .69 8.69 4H4v4.69L.69 12 4 15.31V20h4.69L12 23.31 15.31 20H20v-4.69L23.31 12z"></path></svg> </label> <input aria-label="Switch to light mode" class="md-option" data-md-color-accent="indigo" data-md-color-media="" data-md-color-primary="indigo" data-md-color-scheme="slate" id="__palette_1" name="__palette" type="radio"/> <label class="md-header__button md-icon" for="__palette_0" hidden="" title="Switch to light mode"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a4 4 0 0 0-4 4 4 4 0 0 0 4 4 4 4 0 0 0 4-4 4 4 0 0 0-4-4m0 10a6 6 0 0 1-6-6 6 6 0 0 1 6-6 6 6 0 0 1 6 6 6 6 0 0 1-6 6m8-9.31V4h-4.69L12 .69 8.69 4H4v4.69L.69 12 4 15.31V20h4.69L12 23.31 15.31 20H20v-4.69L23.31 12z"></path></svg> </label> </form> <script>var palette=__md_get("__palette");if(palette&&palette.color){if("(prefers-color-scheme)"===palette.color.media){var media=matchMedia("(prefers-color-scheme: light)"),input=document.querySelector(media.matches?"[data-md-color-media='(prefers-color-scheme: light)']":"[data-md-color-media='(prefers-color-scheme: dark)']");palette.color.media=input.getAttribute("data-md-color-media"),palette.color.scheme=input.getAttribute("data-md-color-scheme"),palette.color.primary=input.getAttribute("data-md-color-primary"),palette.color.accent=input.getAttribute("data-md-color-accent")}for(var[key,value]of Object.entries(palette.color))document.body.setAttribute("data-md-color-"+key,value)}</script> <label class="md-header__button md-icon" for="__search"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M9.5 3A6.5 6.5 0 0 1 16 9.5c0 1.61-.59 3.09-1.56 4.23l.27.27h.79l5 5-1.5 1.5-5-5v-.79l-.27-.27A6.52 6.52 0 0 1 9.5 16 6.5 6.5 0 0 1 3 9.5 6.5 6.5 0 0 1 9.5 3m0 2C7 5 5 7 5 9.5S7 14 9.5 14 14 12 14 9.5 12 5 9.5 5"></path></svg> </label> <div class="md-search" data-md-component="search" role="dialog"> <label class="md-search__overlay" for="__search"></label> <div class="md-search__inner" role="search"> <form class="md-search__form" name="search"> <input aria-label="Search" autocapitalize="off" autocomplete="off" autocorrect="off" class="md-search__input" data-md-component="search-query" name="query" placeholder="Search" required="" spellcheck="false" type="text"/> <label class="md-search__icon md-icon" for="__search"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M9.5 3A6.5 6.5 0 0 1 16 9.5c0 1.61-.59 3.09-1.56 4.23l.27.27h.79l5 5-1.5 1.5-5-5v-.79l-.27-.27A6.52 6.52 0 0 1 9.5 16 6.5 6.5 0 0 1 3 9.5 6.5 6.5 0 0 1 9.5 3m0 2C7 5 5 7 5 9.5S7 14 9.5 14 14 12 14 9.5 12 5 9.5 5"></path></svg> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M20 11v2H8l5.5 5.5-1.42 1.42L4.16 12l7.92-7.92L13.5 5.5 8 11z"></path></svg> </label> <nav aria-label="Search" class="md-search__options"> <button aria-label="Clear" class="md-search__icon md-icon" tabindex="-1" title="Clear" type="reset"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M19 6.41 17.59 5 12 10.59 6.41 5 5 6.41 10.59 12 5 17.59 6.41 19 12 13.41 17.59 19 19 17.59 13.41 12z"></path></svg> </button> </nav> </form> <div class="md-search__output"> <div class="md-search__scrollwrap" data-md-scrollfix="" tabindex="0"> <div class="md-search-result" data-md-component="search-result"> <div class="md-search-result__meta"> Initializing search </div> <ol class="md-search-result__list" role="presentation"></ol> </div> </div> </div> </div> </div> <div class="md-header__source"> <a class="md-source" data-md-component="source" href="https://github.com/aphp/edsnlp" title="Go to repository"> <div class="md-source__icon md-icon"> <svg viewbox="0 0 448 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M439.55 236.05 244 40.45a28.87 28.87 0 0 0-40.81 0l-40.66 40.63 51.52 51.52c27.06-9.14 52.68 16.77 43.39 43.68l49.66 49.66c34.23-11.8 61.18 31 35.47 56.69-26.49 26.49-70.21-2.87-56-37.34L240.22 199v121.85c25.3 12.54 22.26 41.85 9.08 55a34.34 34.34 0 0 1-48.55 0c-17.57-17.6-11.07-46.91 11.25-56v-123c-20.8-8.51-24.6-30.74-18.64-45L142.57 101 8.45 235.14a28.86 28.86 0 0 0 0 40.81l195.61 195.6a28.86 28.86 0 0 0 40.8 0l194.69-194.69a28.86 28.86 0 0 0 0-40.81"></path></svg> </div> <div class="md-source__repository"> aphp/edsnlp </div> </a> </div> </nav> </header> <div class="md-container" data-md-component="container"> <main class="md-main" data-md-component="main"> <div class="md-main__inner md-grid"> <div class="md-sidebar md-sidebar--primary" data-md-component="sidebar" data-md-type="navigation"> <div class="md-sidebar__scrollwrap"> <div class="md-sidebar__inner"> <nav aria-label="Navigation" class="md-nav md-nav--primary" data-md-level="0"> <label class="md-nav__title" for="__drawer"> <a aria-label="EDS-NLP" class="md-nav__button md-logo" data-md-component="logo" href="../../../.." title="EDS-NLP"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a3 3 0 0 0 3-3 3 3 0 0 0-3-3 3 3 0 0 0-3 3 3 3 0 0 0 3 3m0 3.54C9.64 9.35 6.5 8 3 8v11c3.5 0 6.64 1.35 9 3.54 2.36-2.19 5.5-3.54 9-3.54V8c-3.5 0-6.64 1.35-9 3.54"></path></svg> </a> EDS-NLP </label> <div class="md-nav__source"> <a class="md-source" data-md-component="source" href="https://github.com/aphp/edsnlp" title="Go to repository"> <div class="md-source__icon md-icon"> <svg viewbox="0 0 448 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M439.55 236.05 244 40.45a28.87 28.87 0 0 0-40.81 0l-40.66 40.63 51.52 51.52c27.06-9.14 52.68 16.77 43.39 43.68l49.66 49.66c34.23-11.8 61.18 31 35.47 56.69-26.49 26.49-70.21-2.87-56-37.34L240.22 199v121.85c25.3 12.54 22.26 41.85 9.08 55a34.34 34.34 0 0 1-48.55 0c-17.57-17.6-11.07-46.91 11.25-56v-123c-20.8-8.51-24.6-30.74-18.64-45L142.57 101 8.45 235.14a28.86 28.86 0 0 0 0 40.81l195.61 195.6a28.86 28.86 0 0 0 40.8 0l194.69-194.69a28.86 28.86 0 0 0 0-40.81"></path></svg> </div> <div class="md-source__repository"> aphp/edsnlp </div> </a> </div> <ul class="md-nav__list" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href="../../../.."> <span class="md-ellipsis"> Getting started </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="https://aphp.github.io/edsnlp/demo" target="_blank"> <span class="md-ellipsis"> Demo </span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../../../tutorials/"> <span class="md-ellipsis"> Tutorials </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--active md-nav__item--nested"> <input checked="" class="md-nav__toggle md-toggle" id="__nav_4" type="checkbox"/> <div class="md-nav__link md-nav__container"> <a class="md-nav__link" href="../../../"> <span class="md-ellipsis"> Pipes </span> </a> <label class="md-nav__link" for="__nav_4" id="__nav_4_label" tabindex="0"> <span class="md-nav__icon md-icon"></span> </label> </div> <nav aria-expanded="true" aria-labelledby="__nav_4_label" class="md-nav" data-md-level="1"> <label class="md-nav__title" for="__nav_4"> <span class="md-nav__icon md-icon"></span> Pipes </label> <ul class="md-nav__list" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href="../../../"> <span class="md-ellipsis"> Overview </span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../../core/"> <span class="md-ellipsis"> Core Pipelines </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../../qualifiers/"> <span class="md-ellipsis"> Qualifiers </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../../misc/"> <span class="md-ellipsis"> Miscellaneous </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../../ner/"> <span class="md-ellipsis"> Named Entity Recognition </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--active md-nav__item--nested"> <input checked="" class="md-nav__toggle md-toggle" id="__nav_4_7" type="checkbox"/> <div class="md-nav__link md-nav__container"> <a class="md-nav__link" href="../../"> <span class="md-ellipsis"> Trainable components </span> </a> <label class="md-nav__link" for="__nav_4_7" id="__nav_4_7_label" tabindex="0"> <span class="md-nav__icon md-icon"></span> </label> </div> <nav aria-expanded="true" aria-labelledby="__nav_4_7_label" class="md-nav" data-md-level="2"> <label class="md-nav__title" for="__nav_4_7"> <span class="md-nav__icon md-icon"></span> Trainable components </label> <ul class="md-nav__list" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href="../../"> <span class="md-ellipsis"> Overview </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../transformer/"> <span class="md-ellipsis"> Transformer </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../text_cnn/"> <span class="md-ellipsis"> Text CNN </span> </a> </li> <li class="md-nav__item md-nav__item--active"> <input class="md-nav__toggle md-toggle" id="__toc" type="checkbox"/> <label class="md-nav__link md-nav__link--active" for="__toc"> <span class="md-ellipsis"> Span Pooler </span> <span class="md-nav__icon md-icon"></span> </label> <a class="md-nav__link md-nav__link--active" href="./"> <span class="md-ellipsis"> Span Pooler </span> </a> <nav aria-label="Table of contents" class="md-nav md-nav--secondary"> <label class="md-nav__title" for="__toc"> <span class="md-nav__icon md-icon"></span> Table of contents </label> <ul class="md-nav__list" data-md-component="toc" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href="#edsnlp.pipes.trainable.embeddings.span_pooler.factory.create_component--parameters"> <span class="md-ellipsis"> Parameters </span> </a> </li> </ul> </nav> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../ner/"> <span class="md-ellipsis"> NER </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../span-classifier/"> <span class="md-ellipsis"> Span Classifier </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../span-linker/"> <span class="md-ellipsis"> Span Linker </span> </a> </li> </ul> </nav> </li> </ul> </nav> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../../../tokenizers/"> <span class="md-ellipsis"> Tokenizers </span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../../../data/"> <span class="md-ellipsis"> Data Connectors </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../../../concepts/pipeline/"> <span class="md-ellipsis"> Concepts </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../../../utilities/"> <span class="md-ellipsis"> Utilities </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../../../reference/edsnlp/"> <span class="md-ellipsis"> Code Reference </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../../../contributing/"> <span class="md-ellipsis"> Contributing to EDS-NLP </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../../../changelog/"> <span class="md-ellipsis"> Changelog </span> </a> </li> </ul> </nav> </div> </div> </div> <div class="md-sidebar md-sidebar--secondary" data-md-component="sidebar" data-md-type="toc"> <div class="md-sidebar__scrollwrap"> <div class="md-sidebar__inner"> <nav aria-label="Table of contents" class="md-nav md-nav--secondary"> <label class="md-nav__title" for="__toc"> <span class="md-nav__icon md-icon"></span> Table of contents </label> <ul class="md-nav__list" data-md-component="toc" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href="#edsnlp.pipes.trainable.embeddings.span_pooler.factory.create_component--parameters"> <span class="md-ellipsis"> Parameters </span> </a> </li> </ul> </nav> </div> </div> </div> <div class="md-content" data-md-component="content"> <article class="md-content__inner md-typeset"> <h1 class="sourced-heading" id="edsnlp.pipes.trainable.embeddings.span_pooler.factory.create_component">Span Pooler<span class="sourced-heading-spacer"></span><a href="https://github.com/aphp/edsnlp/blob/1ebc7d72c/edsnlp/pipes/trainable/embeddings/span_pooler/span_pooler.py#L50" target="_blank">[source]</a></h1> <div class="doc doc-object doc-attribute"> <div class="doc doc-contents first"> <p>The <code><a href="#edsnlp.pipes.trainable.embeddings.span_pooler.factory.create_component">eds.span_pooler</a></code> component is a trainable span embedding component. It generates span embeddings from a word embedding component and a span getter. It can be used to train a span classifier, as in <code><a href="../../span-classifier/#edsnlp.pipes.trainable.span_classifier.factory.create_component">eds.span_classifier</a></code>.</p> <h2 id="edsnlp.pipes.trainable.embeddings.span_pooler.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline object</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<a class="autorefs autorefs-internal" href="../../../../concepts/pipeline/#edsnlp.core.pipeline.Pipeline" title="edsnlp.core.pipeline.Pipeline">Pipeline</a>]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>Name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'span_pooler'</code> </span> </p> </td> </tr> <tr> <td><code>embedding</code></td> <td class="doc-param-details"> <p>The word embedding component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="edsnlp.pipes.trainable.embeddings.typing.WordEmbeddingComponent">WordEmbeddingComponent</span></code> </span> </p> </td> </tr> <tr> <td><code>pooling_mode</code></td> <td class="doc-param-details"> <p>How word embeddings are aggregated into a single embedding per span.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing_extensions.Literal">Literal</span>['max', 'sum', 'mean']</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>mean</code> </span> </p> </td> </tr> <tr> <td><code>hidden_size</code></td> <td class="doc-param-details"> <p>The size of the hidden layer. If None, no projection is done and the output of the span pooler is used directly.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[int]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> </tbody> </table> </div> </div> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/pipes/trainable/embeddings/text_cnn/index.html b/master/pipes/trainable/embeddings/text_cnn/index.html
index 10d39d043..825451fa8 100644
--- a/master/pipes/trainable/embeddings/text_cnn/index.html
+++ b/master/pipes/trainable/embeddings/text_cnn/index.html
@@ -1,5 +1,5 @@
 <!DOCTYPE html>
-<html class="no-js" lang="en"> <head><meta charset="utf-8"/><meta content="width=device-width,initial-scale=1" name="viewport"/><link href="../transformer/" rel="prev"/><link href="../span_pooler/" rel="next"/><link href="../../../../assets/logo/edsnlp.svg" rel="icon"/><meta content="mkdocs-1.6.1, mkdocs-material-9.5.42" name="generator"/><title>Text CNN - EDS-NLP</title><link href="../../../../assets/stylesheets/main.0253249f.min.css" rel="stylesheet"/><link href="../../../../assets/stylesheets/palette.06af60db.min.css" rel="stylesheet"/><link crossorigin="" href="https://fonts.gstatic.com" rel="preconnect"/><link href="https://fonts.googleapis.com/css?family=Roboto:300,300i,400,400i,700,700i%7CRoboto+Mono:400,400i,700,700i&amp;display=fallback" rel="stylesheet"/><style>:root{--md-text-font:"Roboto";--md-code-font:"Roboto Mono"}</style><link href="../../../../assets/_mkdocstrings.css" rel="stylesheet"/><link href="../../../../assets/stylesheets/extra.css" rel="stylesheet"/><link href="../../../../assets/stylesheets/cards.css" rel="stylesheet"/><link href="../../../../assets/termynal/termynal.css" rel="stylesheet"/><script>__md_scope=new URL("../../../..",location),__md_hash=e=>[...e].reduce(((e,_)=>(e<<5)-e+_.charCodeAt(0)),0),__md_get=(e,_=localStorage,t=__md_scope)=>JSON.parse(_.getItem(t.pathname+"."+e)),__md_set=(e,_,t=localStorage,a=__md_scope)=>{try{t.setItem(a.pathname+"."+e,JSON.stringify(_))}catch(e){}}</script></head> <body data-md-color-accent="indigo" data-md-color-primary="indigo" data-md-color-scheme="default" dir="ltr"> <input autocomplete="off" class="md-toggle" data-md-toggle="drawer" id="__drawer" type="checkbox"/> <input autocomplete="off" class="md-toggle" data-md-toggle="search" id="__search" type="checkbox"/> <label class="md-overlay" for="__drawer"></label> <div data-md-component="skip"> <a class="md-skip" href="#edsnlp.pipes.trainable.embeddings.text_cnn.factory.create_component"> Skip to content </a> </div> <div data-md-component="announce"> <aside class="md-banner"> <div class="md-banner__inner md-grid md-typeset"> <button aria-label="Don't show this again" class="md-banner__button md-icon"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M19 6.41 17.59 5 12 10.59 6.41 5 5 6.41 10.59 12 5 17.59 6.41 19 12 13.41 17.59 19 19 17.59 13.41 12z"></path></svg> </button> Check out the new <a href="../../../../tutorials/training">Model Training tutorial</a> ! </div> <script>var el=document.querySelector("[data-md-component=announce]");if(el){var content=el.querySelector(".md-typeset");__md_hash(content.innerHTML)===__md_get("__announce")&&(el.hidden=!0)}</script> </aside> </div> <div data-md-color-scheme="default" data-md-component="outdated" hidden=""> </div> <header class="md-header md-header--shadow" data-md-component="header"> <nav aria-label="Header" class="md-header__inner md-grid"> <a aria-label="EDS-NLP" class="md-header__button md-logo" data-md-component="logo" href="../../../.." title="EDS-NLP"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a3 3 0 0 0 3-3 3 3 0 0 0-3-3 3 3 0 0 0-3 3 3 3 0 0 0 3 3m0 3.54C9.64 9.35 6.5 8 3 8v11c3.5 0 6.64 1.35 9 3.54 2.36-2.19 5.5-3.54 9-3.54V8c-3.5 0-6.64 1.35-9 3.54"></path></svg> </a> <label class="md-header__button md-icon" for="__drawer"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M3 6h18v2H3zm0 5h18v2H3zm0 5h18v2H3z"></path></svg> </label> <div class="md-header__title" data-md-component="header-title"> <div class="md-header__ellipsis"> <div class="md-header__topic"> <span class="md-ellipsis"> EDS-NLP </span> </div> <div class="md-header__topic" data-md-component="header-topic"> <span class="md-ellipsis"> Text CNN </span> </div> </div> </div> <form class="md-header__option" data-md-component="palette"> <input aria-label="Switch to dark mode" class="md-option" data-md-color-accent="indigo" data-md-color-media="" data-md-color-primary="indigo" data-md-color-scheme="default" id="__palette_0" name="__palette" type="radio"/> <label class="md-header__button md-icon" for="__palette_1" hidden="" title="Switch to dark mode"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 18c-.89 0-1.74-.2-2.5-.55C11.56 16.5 13 14.42 13 12s-1.44-4.5-3.5-5.45C10.26 6.2 11.11 6 12 6a6 6 0 0 1 6 6 6 6 0 0 1-6 6m8-9.31V4h-4.69L12 .69 8.69 4H4v4.69L.69 12 4 15.31V20h4.69L12 23.31 15.31 20H20v-4.69L23.31 12z"></path></svg> </label> <input aria-label="Switch to light mode" class="md-option" data-md-color-accent="indigo" data-md-color-media="" data-md-color-primary="indigo" data-md-color-scheme="slate" id="__palette_1" name="__palette" type="radio"/> <label class="md-header__button md-icon" for="__palette_0" hidden="" title="Switch to light mode"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a4 4 0 0 0-4 4 4 4 0 0 0 4 4 4 4 0 0 0 4-4 4 4 0 0 0-4-4m0 10a6 6 0 0 1-6-6 6 6 0 0 1 6-6 6 6 0 0 1 6 6 6 6 0 0 1-6 6m8-9.31V4h-4.69L12 .69 8.69 4H4v4.69L.69 12 4 15.31V20h4.69L12 23.31 15.31 20H20v-4.69L23.31 12z"></path></svg> </label> </form> <script>var palette=__md_get("__palette");if(palette&&palette.color){if("(prefers-color-scheme)"===palette.color.media){var media=matchMedia("(prefers-color-scheme: light)"),input=document.querySelector(media.matches?"[data-md-color-media='(prefers-color-scheme: light)']":"[data-md-color-media='(prefers-color-scheme: dark)']");palette.color.media=input.getAttribute("data-md-color-media"),palette.color.scheme=input.getAttribute("data-md-color-scheme"),palette.color.primary=input.getAttribute("data-md-color-primary"),palette.color.accent=input.getAttribute("data-md-color-accent")}for(var[key,value]of Object.entries(palette.color))document.body.setAttribute("data-md-color-"+key,value)}</script> <label class="md-header__button md-icon" for="__search"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M9.5 3A6.5 6.5 0 0 1 16 9.5c0 1.61-.59 3.09-1.56 4.23l.27.27h.79l5 5-1.5 1.5-5-5v-.79l-.27-.27A6.52 6.52 0 0 1 9.5 16 6.5 6.5 0 0 1 3 9.5 6.5 6.5 0 0 1 9.5 3m0 2C7 5 5 7 5 9.5S7 14 9.5 14 14 12 14 9.5 12 5 9.5 5"></path></svg> </label> <div class="md-search" data-md-component="search" role="dialog"> <label class="md-search__overlay" for="__search"></label> <div class="md-search__inner" role="search"> <form class="md-search__form" name="search"> <input aria-label="Search" autocapitalize="off" autocomplete="off" autocorrect="off" class="md-search__input" data-md-component="search-query" name="query" placeholder="Search" required="" spellcheck="false" type="text"/> <label class="md-search__icon md-icon" for="__search"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M9.5 3A6.5 6.5 0 0 1 16 9.5c0 1.61-.59 3.09-1.56 4.23l.27.27h.79l5 5-1.5 1.5-5-5v-.79l-.27-.27A6.52 6.52 0 0 1 9.5 16 6.5 6.5 0 0 1 3 9.5 6.5 6.5 0 0 1 9.5 3m0 2C7 5 5 7 5 9.5S7 14 9.5 14 14 12 14 9.5 12 5 9.5 5"></path></svg> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M20 11v2H8l5.5 5.5-1.42 1.42L4.16 12l7.92-7.92L13.5 5.5 8 11z"></path></svg> </label> <nav aria-label="Search" class="md-search__options"> <button aria-label="Clear" class="md-search__icon md-icon" tabindex="-1" title="Clear" type="reset"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M19 6.41 17.59 5 12 10.59 6.41 5 5 6.41 10.59 12 5 17.59 6.41 19 12 13.41 17.59 19 19 17.59 13.41 12z"></path></svg> </button> </nav> </form> <div class="md-search__output"> <div class="md-search__scrollwrap" data-md-scrollfix="" tabindex="0"> <div class="md-search-result" data-md-component="search-result"> <div class="md-search-result__meta"> Initializing search </div> <ol class="md-search-result__list" role="presentation"></ol> </div> </div> </div> </div> </div> <div class="md-header__source"> <a class="md-source" data-md-component="source" href="https://github.com/aphp/edsnlp" title="Go to repository"> <div class="md-source__icon md-icon"> <svg viewbox="0 0 448 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M439.55 236.05 244 40.45a28.87 28.87 0 0 0-40.81 0l-40.66 40.63 51.52 51.52c27.06-9.14 52.68 16.77 43.39 43.68l49.66 49.66c34.23-11.8 61.18 31 35.47 56.69-26.49 26.49-70.21-2.87-56-37.34L240.22 199v121.85c25.3 12.54 22.26 41.85 9.08 55a34.34 34.34 0 0 1-48.55 0c-17.57-17.6-11.07-46.91 11.25-56v-123c-20.8-8.51-24.6-30.74-18.64-45L142.57 101 8.45 235.14a28.86 28.86 0 0 0 0 40.81l195.61 195.6a28.86 28.86 0 0 0 40.8 0l194.69-194.69a28.86 28.86 0 0 0 0-40.81"></path></svg> </div> <div class="md-source__repository"> aphp/edsnlp </div> </a> </div> </nav> </header> <div class="md-container" data-md-component="container"> <main class="md-main" data-md-component="main"> <div class="md-main__inner md-grid"> <div class="md-sidebar md-sidebar--primary" data-md-component="sidebar" data-md-type="navigation"> <div class="md-sidebar__scrollwrap"> <div class="md-sidebar__inner"> <nav aria-label="Navigation" class="md-nav md-nav--primary" data-md-level="0"> <label class="md-nav__title" for="__drawer"> <a aria-label="EDS-NLP" class="md-nav__button md-logo" data-md-component="logo" href="../../../.." title="EDS-NLP"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a3 3 0 0 0 3-3 3 3 0 0 0-3-3 3 3 0 0 0-3 3 3 3 0 0 0 3 3m0 3.54C9.64 9.35 6.5 8 3 8v11c3.5 0 6.64 1.35 9 3.54 2.36-2.19 5.5-3.54 9-3.54V8c-3.5 0-6.64 1.35-9 3.54"></path></svg> </a> EDS-NLP </label> <div class="md-nav__source"> <a class="md-source" data-md-component="source" href="https://github.com/aphp/edsnlp" title="Go to repository"> <div class="md-source__icon md-icon"> <svg viewbox="0 0 448 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M439.55 236.05 244 40.45a28.87 28.87 0 0 0-40.81 0l-40.66 40.63 51.52 51.52c27.06-9.14 52.68 16.77 43.39 43.68l49.66 49.66c34.23-11.8 61.18 31 35.47 56.69-26.49 26.49-70.21-2.87-56-37.34L240.22 199v121.85c25.3 12.54 22.26 41.85 9.08 55a34.34 34.34 0 0 1-48.55 0c-17.57-17.6-11.07-46.91 11.25-56v-123c-20.8-8.51-24.6-30.74-18.64-45L142.57 101 8.45 235.14a28.86 28.86 0 0 0 0 40.81l195.61 195.6a28.86 28.86 0 0 0 40.8 0l194.69-194.69a28.86 28.86 0 0 0 0-40.81"></path></svg> </div> <div class="md-source__repository"> aphp/edsnlp </div> </a> </div> <ul class="md-nav__list" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href="../../../.."> <span class="md-ellipsis"> Getting started </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="https://aphp.github.io/edsnlp/demo" target="_blank"> <span class="md-ellipsis"> Demo </span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../../../tutorials/"> <span class="md-ellipsis"> Tutorials </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--active md-nav__item--nested"> <input checked="" class="md-nav__toggle md-toggle" id="__nav_4" type="checkbox"/> <div class="md-nav__link md-nav__container"> <a class="md-nav__link" href="../../../"> <span class="md-ellipsis"> Pipes </span> </a> <label class="md-nav__link" for="__nav_4" id="__nav_4_label" tabindex="0"> <span class="md-nav__icon md-icon"></span> </label> </div> <nav aria-expanded="true" aria-labelledby="__nav_4_label" class="md-nav" data-md-level="1"> <label class="md-nav__title" for="__nav_4"> <span class="md-nav__icon md-icon"></span> Pipes </label> <ul class="md-nav__list" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href="../../../"> <span class="md-ellipsis"> Overview </span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../../core/"> <span class="md-ellipsis"> Core Pipelines </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../../qualifiers/"> <span class="md-ellipsis"> Qualifiers </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../../misc/"> <span class="md-ellipsis"> Miscellaneous </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../../ner/"> <span class="md-ellipsis"> Named Entity Recognition </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--active md-nav__item--nested"> <input checked="" class="md-nav__toggle md-toggle" id="__nav_4_7" type="checkbox"/> <div class="md-nav__link md-nav__container"> <a class="md-nav__link" href="../../"> <span class="md-ellipsis"> Trainable components </span> </a> <label class="md-nav__link" for="__nav_4_7" id="__nav_4_7_label" tabindex="0"> <span class="md-nav__icon md-icon"></span> </label> </div> <nav aria-expanded="true" aria-labelledby="__nav_4_7_label" class="md-nav" data-md-level="2"> <label class="md-nav__title" for="__nav_4_7"> <span class="md-nav__icon md-icon"></span> Trainable components </label> <ul class="md-nav__list" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href="../../"> <span class="md-ellipsis"> Overview </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../transformer/"> <span class="md-ellipsis"> Transformer </span> </a> </li> <li class="md-nav__item md-nav__item--active"> <input class="md-nav__toggle md-toggle" id="__toc" type="checkbox"/> <label class="md-nav__link md-nav__link--active" for="__toc"> <span class="md-ellipsis"> Text CNN </span> <span class="md-nav__icon md-icon"></span> </label> <a class="md-nav__link md-nav__link--active" href="./"> <span class="md-ellipsis"> Text CNN </span> </a> <nav aria-label="Table of contents" class="md-nav md-nav--secondary"> <label class="md-nav__title" for="__toc"> <span class="md-nav__icon md-icon"></span> Table of contents </label> <ul class="md-nav__list" data-md-component="toc" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href="#edsnlp.pipes.trainable.embeddings.text_cnn.factory.create_component--parameters"> <span class="md-ellipsis"> Parameters </span> </a> </li> </ul> </nav> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../span_pooler/"> <span class="md-ellipsis"> Span Pooler </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../ner/"> <span class="md-ellipsis"> NER </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../span-classifier/"> <span class="md-ellipsis"> Span Classifier </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../span-linker/"> <span class="md-ellipsis"> Span Linker </span> </a> </li> </ul> </nav> </li> </ul> </nav> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../../../tokenizers/"> <span class="md-ellipsis"> Tokenizers </span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../../../data/"> <span class="md-ellipsis"> Data Connectors </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../../../concepts/pipeline/"> <span class="md-ellipsis"> Concepts </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../../../utilities/"> <span class="md-ellipsis"> Utilities </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../../../reference/edsnlp/"> <span class="md-ellipsis"> Code Reference </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../../../contributing/"> <span class="md-ellipsis"> Contributing to EDS-NLP </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../../../changelog/"> <span class="md-ellipsis"> Changelog </span> </a> </li> </ul> </nav> </div> </div> </div> <div class="md-sidebar md-sidebar--secondary" data-md-component="sidebar" data-md-type="toc"> <div class="md-sidebar__scrollwrap"> <div class="md-sidebar__inner"> <nav aria-label="Table of contents" class="md-nav md-nav--secondary"> <label class="md-nav__title" for="__toc"> <span class="md-nav__icon md-icon"></span> Table of contents </label> <ul class="md-nav__list" data-md-component="toc" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href="#edsnlp.pipes.trainable.embeddings.text_cnn.factory.create_component--parameters"> <span class="md-ellipsis"> Parameters </span> </a> </li> </ul> </nav> </div> </div> </div> <div class="md-content" data-md-component="content"> <article class="md-content__inner md-typeset"> <h1 class="sourced-heading" id="edsnlp.pipes.trainable.embeddings.text_cnn.factory.create_component">Text CNN<span class="sourced-heading-spacer"></span><a href="https://github.com/aphp/edsnlp/blob/1ebc7d72c/edsnlp/pipes/trainable/embeddings/text_cnn/text_cnn.py#L25" target="_blank">[source]</a></h1> <div class="doc doc-object doc-attribute"> <div class="doc doc-contents first"> <p>The <code><a href="#edsnlp.pipes.trainable.embeddings.text_cnn.factory.create_component">eds.text_cnn</a></code> component is a simple 1D convolutional network to contextualize word embeddings (as computed by the <code>embedding</code> component passed as argument).</p> <p>To be memory efficient when handling batches of variable-length sequences, this module employs sequence packing, while taking care of avoiding contamination between the different docs.</p> <h2 id="edsnlp.pipes.trainable.embeddings.text_cnn.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline object</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>PipelineProtocol</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'text_cnn'</code> </span> </p> </td> </tr> <tr> <td><code>embedding</code></td> <td class="doc-param-details"> <p>Embedding module to apply to the input</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>TorchComponent[<span title="edsnlp.pipes.trainable.embeddings.typing.WordEmbeddingBatchOutput">WordEmbeddingBatchOutput</span>, <span title="edsnlp.core.torch_component.BatchInput">BatchInput</span>]</code> </span> </p> </td> </tr> <tr> <td><code>output_size</code></td> <td class="doc-param-details"> <p>Size of the output embeddings Defaults to the <code>input_size</code></p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[int]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>out_channels</code></td> <td class="doc-param-details"> <p>Number of channels</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>int</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>kernel_sizes</code></td> <td class="doc-param-details"> <p>Window size of each kernel</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Sequence">Sequence</span>[int]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>(3, 4, 5)</code> </span> </p> </td> </tr> <tr> <td><code>activation</code></td> <td class="doc-param-details"> <p>Activation function to use</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>relu</code> </span> </p> </td> </tr> <tr> <td><code>residual</code></td> <td class="doc-param-details"> <p>Whether to use residual connections</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>True</code> </span> </p> </td> </tr> <tr> <td><code>normalize</code></td> <td class="doc-param-details"> <p>Whether to normalize before or after the residual connection</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing_extensions.Literal">Literal</span>['pre', 'post', 'none']</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>pre</code> </span> </p> </td> </tr> </tbody> </table> </div> </div> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+<html class="no-js" lang="en"> <head><meta charset="utf-8"/><meta content="width=device-width,initial-scale=1" name="viewport"/><link href="../transformer/" rel="prev"/><link href="../span_pooler/" rel="next"/><link href="../../../../assets/logo/edsnlp.svg" rel="icon"/><meta content="mkdocs-1.6.1, mkdocs-material-9.5.42" name="generator"/><title>Text CNN - EDS-NLP</title><link href="../../../../assets/stylesheets/main.0253249f.min.css" rel="stylesheet"/><link href="../../../../assets/stylesheets/palette.06af60db.min.css" rel="stylesheet"/><link crossorigin="" href="https://fonts.gstatic.com" rel="preconnect"/><link href="https://fonts.googleapis.com/css?family=Roboto:300,300i,400,400i,700,700i%7CRoboto+Mono:400,400i,700,700i&amp;display=fallback" rel="stylesheet"/><style>:root{--md-text-font:"Roboto";--md-code-font:"Roboto Mono"}</style><link href="../../../../assets/_mkdocstrings.css" rel="stylesheet"/><link href="../../../../assets/stylesheets/extra.css" rel="stylesheet"/><link href="../../../../assets/stylesheets/cards.css" rel="stylesheet"/><link href="../../../../assets/termynal/termynal.css" rel="stylesheet"/><script>__md_scope=new URL("../../../..",location),__md_hash=e=>[...e].reduce(((e,_)=>(e<<5)-e+_.charCodeAt(0)),0),__md_get=(e,_=localStorage,t=__md_scope)=>JSON.parse(_.getItem(t.pathname+"."+e)),__md_set=(e,_,t=localStorage,a=__md_scope)=>{try{t.setItem(a.pathname+"."+e,JSON.stringify(_))}catch(e){}}</script></head> <body data-md-color-accent="indigo" data-md-color-primary="indigo" data-md-color-scheme="default" dir="ltr"> <input autocomplete="off" class="md-toggle" data-md-toggle="drawer" id="__drawer" type="checkbox"/> <input autocomplete="off" class="md-toggle" data-md-toggle="search" id="__search" type="checkbox"/> <label class="md-overlay" for="__drawer"></label> <div data-md-component="skip"> <a class="md-skip" href="#edsnlp.pipes.trainable.embeddings.text_cnn.factory.create_component"> Skip to content </a> </div> <div data-md-component="announce"> <aside class="md-banner"> <div class="md-banner__inner md-grid md-typeset"> <button aria-label="Don't show this again" class="md-banner__button md-icon"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M19 6.41 17.59 5 12 10.59 6.41 5 5 6.41 10.59 12 5 17.59 6.41 19 12 13.41 17.59 19 19 17.59 13.41 12z"></path></svg> </button> Check out the new <a href="../../../../tutorials/training">Model Training tutorial</a> ! </div> <script>var el=document.querySelector("[data-md-component=announce]");if(el){var content=el.querySelector(".md-typeset");__md_hash(content.innerHTML)===__md_get("__announce")&&(el.hidden=!0)}</script> </aside> </div> <div data-md-color-scheme="default" data-md-component="outdated" hidden=""> </div> <header class="md-header md-header--shadow" data-md-component="header"> <nav aria-label="Header" class="md-header__inner md-grid"> <a aria-label="EDS-NLP" class="md-header__button md-logo" data-md-component="logo" href="../../../.." title="EDS-NLP"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a3 3 0 0 0 3-3 3 3 0 0 0-3-3 3 3 0 0 0-3 3 3 3 0 0 0 3 3m0 3.54C9.64 9.35 6.5 8 3 8v11c3.5 0 6.64 1.35 9 3.54 2.36-2.19 5.5-3.54 9-3.54V8c-3.5 0-6.64 1.35-9 3.54"></path></svg> </a> <label class="md-header__button md-icon" for="__drawer"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M3 6h18v2H3zm0 5h18v2H3zm0 5h18v2H3z"></path></svg> </label> <div class="md-header__title" data-md-component="header-title"> <div class="md-header__ellipsis"> <div class="md-header__topic"> <span class="md-ellipsis"> EDS-NLP </span> </div> <div class="md-header__topic" data-md-component="header-topic"> <span class="md-ellipsis"> Text CNN </span> </div> </div> </div> <form class="md-header__option" data-md-component="palette"> <input aria-label="Switch to dark mode" class="md-option" data-md-color-accent="indigo" data-md-color-media="" data-md-color-primary="indigo" data-md-color-scheme="default" id="__palette_0" name="__palette" type="radio"/> <label class="md-header__button md-icon" for="__palette_1" hidden="" title="Switch to dark mode"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 18c-.89 0-1.74-.2-2.5-.55C11.56 16.5 13 14.42 13 12s-1.44-4.5-3.5-5.45C10.26 6.2 11.11 6 12 6a6 6 0 0 1 6 6 6 6 0 0 1-6 6m8-9.31V4h-4.69L12 .69 8.69 4H4v4.69L.69 12 4 15.31V20h4.69L12 23.31 15.31 20H20v-4.69L23.31 12z"></path></svg> </label> <input aria-label="Switch to light mode" class="md-option" data-md-color-accent="indigo" data-md-color-media="" data-md-color-primary="indigo" data-md-color-scheme="slate" id="__palette_1" name="__palette" type="radio"/> <label class="md-header__button md-icon" for="__palette_0" hidden="" title="Switch to light mode"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a4 4 0 0 0-4 4 4 4 0 0 0 4 4 4 4 0 0 0 4-4 4 4 0 0 0-4-4m0 10a6 6 0 0 1-6-6 6 6 0 0 1 6-6 6 6 0 0 1 6 6 6 6 0 0 1-6 6m8-9.31V4h-4.69L12 .69 8.69 4H4v4.69L.69 12 4 15.31V20h4.69L12 23.31 15.31 20H20v-4.69L23.31 12z"></path></svg> </label> </form> <script>var palette=__md_get("__palette");if(palette&&palette.color){if("(prefers-color-scheme)"===palette.color.media){var media=matchMedia("(prefers-color-scheme: light)"),input=document.querySelector(media.matches?"[data-md-color-media='(prefers-color-scheme: light)']":"[data-md-color-media='(prefers-color-scheme: dark)']");palette.color.media=input.getAttribute("data-md-color-media"),palette.color.scheme=input.getAttribute("data-md-color-scheme"),palette.color.primary=input.getAttribute("data-md-color-primary"),palette.color.accent=input.getAttribute("data-md-color-accent")}for(var[key,value]of Object.entries(palette.color))document.body.setAttribute("data-md-color-"+key,value)}</script> <label class="md-header__button md-icon" for="__search"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M9.5 3A6.5 6.5 0 0 1 16 9.5c0 1.61-.59 3.09-1.56 4.23l.27.27h.79l5 5-1.5 1.5-5-5v-.79l-.27-.27A6.52 6.52 0 0 1 9.5 16 6.5 6.5 0 0 1 3 9.5 6.5 6.5 0 0 1 9.5 3m0 2C7 5 5 7 5 9.5S7 14 9.5 14 14 12 14 9.5 12 5 9.5 5"></path></svg> </label> <div class="md-search" data-md-component="search" role="dialog"> <label class="md-search__overlay" for="__search"></label> <div class="md-search__inner" role="search"> <form class="md-search__form" name="search"> <input aria-label="Search" autocapitalize="off" autocomplete="off" autocorrect="off" class="md-search__input" data-md-component="search-query" name="query" placeholder="Search" required="" spellcheck="false" type="text"/> <label class="md-search__icon md-icon" for="__search"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M9.5 3A6.5 6.5 0 0 1 16 9.5c0 1.61-.59 3.09-1.56 4.23l.27.27h.79l5 5-1.5 1.5-5-5v-.79l-.27-.27A6.52 6.52 0 0 1 9.5 16 6.5 6.5 0 0 1 3 9.5 6.5 6.5 0 0 1 9.5 3m0 2C7 5 5 7 5 9.5S7 14 9.5 14 14 12 14 9.5 12 5 9.5 5"></path></svg> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M20 11v2H8l5.5 5.5-1.42 1.42L4.16 12l7.92-7.92L13.5 5.5 8 11z"></path></svg> </label> <nav aria-label="Search" class="md-search__options"> <button aria-label="Clear" class="md-search__icon md-icon" tabindex="-1" title="Clear" type="reset"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M19 6.41 17.59 5 12 10.59 6.41 5 5 6.41 10.59 12 5 17.59 6.41 19 12 13.41 17.59 19 19 17.59 13.41 12z"></path></svg> </button> </nav> </form> <div class="md-search__output"> <div class="md-search__scrollwrap" data-md-scrollfix="" tabindex="0"> <div class="md-search-result" data-md-component="search-result"> <div class="md-search-result__meta"> Initializing search </div> <ol class="md-search-result__list" role="presentation"></ol> </div> </div> </div> </div> </div> <div class="md-header__source"> <a class="md-source" data-md-component="source" href="https://github.com/aphp/edsnlp" title="Go to repository"> <div class="md-source__icon md-icon"> <svg viewbox="0 0 448 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M439.55 236.05 244 40.45a28.87 28.87 0 0 0-40.81 0l-40.66 40.63 51.52 51.52c27.06-9.14 52.68 16.77 43.39 43.68l49.66 49.66c34.23-11.8 61.18 31 35.47 56.69-26.49 26.49-70.21-2.87-56-37.34L240.22 199v121.85c25.3 12.54 22.26 41.85 9.08 55a34.34 34.34 0 0 1-48.55 0c-17.57-17.6-11.07-46.91 11.25-56v-123c-20.8-8.51-24.6-30.74-18.64-45L142.57 101 8.45 235.14a28.86 28.86 0 0 0 0 40.81l195.61 195.6a28.86 28.86 0 0 0 40.8 0l194.69-194.69a28.86 28.86 0 0 0 0-40.81"></path></svg> </div> <div class="md-source__repository"> aphp/edsnlp </div> </a> </div> </nav> </header> <div class="md-container" data-md-component="container"> <main class="md-main" data-md-component="main"> <div class="md-main__inner md-grid"> <div class="md-sidebar md-sidebar--primary" data-md-component="sidebar" data-md-type="navigation"> <div class="md-sidebar__scrollwrap"> <div class="md-sidebar__inner"> <nav aria-label="Navigation" class="md-nav md-nav--primary" data-md-level="0"> <label class="md-nav__title" for="__drawer"> <a aria-label="EDS-NLP" class="md-nav__button md-logo" data-md-component="logo" href="../../../.." title="EDS-NLP"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a3 3 0 0 0 3-3 3 3 0 0 0-3-3 3 3 0 0 0-3 3 3 3 0 0 0 3 3m0 3.54C9.64 9.35 6.5 8 3 8v11c3.5 0 6.64 1.35 9 3.54 2.36-2.19 5.5-3.54 9-3.54V8c-3.5 0-6.64 1.35-9 3.54"></path></svg> </a> EDS-NLP </label> <div class="md-nav__source"> <a class="md-source" data-md-component="source" href="https://github.com/aphp/edsnlp" title="Go to repository"> <div class="md-source__icon md-icon"> <svg viewbox="0 0 448 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M439.55 236.05 244 40.45a28.87 28.87 0 0 0-40.81 0l-40.66 40.63 51.52 51.52c27.06-9.14 52.68 16.77 43.39 43.68l49.66 49.66c34.23-11.8 61.18 31 35.47 56.69-26.49 26.49-70.21-2.87-56-37.34L240.22 199v121.85c25.3 12.54 22.26 41.85 9.08 55a34.34 34.34 0 0 1-48.55 0c-17.57-17.6-11.07-46.91 11.25-56v-123c-20.8-8.51-24.6-30.74-18.64-45L142.57 101 8.45 235.14a28.86 28.86 0 0 0 0 40.81l195.61 195.6a28.86 28.86 0 0 0 40.8 0l194.69-194.69a28.86 28.86 0 0 0 0-40.81"></path></svg> </div> <div class="md-source__repository"> aphp/edsnlp </div> </a> </div> <ul class="md-nav__list" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href="../../../.."> <span class="md-ellipsis"> Getting started </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="https://aphp.github.io/edsnlp/demo" target="_blank"> <span class="md-ellipsis"> Demo </span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../../../tutorials/"> <span class="md-ellipsis"> Tutorials </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--active md-nav__item--nested"> <input checked="" class="md-nav__toggle md-toggle" id="__nav_4" type="checkbox"/> <div class="md-nav__link md-nav__container"> <a class="md-nav__link" href="../../../"> <span class="md-ellipsis"> Pipes </span> </a> <label class="md-nav__link" for="__nav_4" id="__nav_4_label" tabindex="0"> <span class="md-nav__icon md-icon"></span> </label> </div> <nav aria-expanded="true" aria-labelledby="__nav_4_label" class="md-nav" data-md-level="1"> <label class="md-nav__title" for="__nav_4"> <span class="md-nav__icon md-icon"></span> Pipes </label> <ul class="md-nav__list" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href="../../../"> <span class="md-ellipsis"> Overview </span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../../core/"> <span class="md-ellipsis"> Core Pipelines </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../../qualifiers/"> <span class="md-ellipsis"> Qualifiers </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../../misc/"> <span class="md-ellipsis"> Miscellaneous </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../../ner/"> <span class="md-ellipsis"> Named Entity Recognition </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--active md-nav__item--nested"> <input checked="" class="md-nav__toggle md-toggle" id="__nav_4_7" type="checkbox"/> <div class="md-nav__link md-nav__container"> <a class="md-nav__link" href="../../"> <span class="md-ellipsis"> Trainable components </span> </a> <label class="md-nav__link" for="__nav_4_7" id="__nav_4_7_label" tabindex="0"> <span class="md-nav__icon md-icon"></span> </label> </div> <nav aria-expanded="true" aria-labelledby="__nav_4_7_label" class="md-nav" data-md-level="2"> <label class="md-nav__title" for="__nav_4_7"> <span class="md-nav__icon md-icon"></span> Trainable components </label> <ul class="md-nav__list" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href="../../"> <span class="md-ellipsis"> Overview </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../transformer/"> <span class="md-ellipsis"> Transformer </span> </a> </li> <li class="md-nav__item md-nav__item--active"> <input class="md-nav__toggle md-toggle" id="__toc" type="checkbox"/> <label class="md-nav__link md-nav__link--active" for="__toc"> <span class="md-ellipsis"> Text CNN </span> <span class="md-nav__icon md-icon"></span> </label> <a class="md-nav__link md-nav__link--active" href="./"> <span class="md-ellipsis"> Text CNN </span> </a> <nav aria-label="Table of contents" class="md-nav md-nav--secondary"> <label class="md-nav__title" for="__toc"> <span class="md-nav__icon md-icon"></span> Table of contents </label> <ul class="md-nav__list" data-md-component="toc" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href="#edsnlp.pipes.trainable.embeddings.text_cnn.factory.create_component--parameters"> <span class="md-ellipsis"> Parameters </span> </a> </li> </ul> </nav> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../span_pooler/"> <span class="md-ellipsis"> Span Pooler </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../ner/"> <span class="md-ellipsis"> NER </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../span-classifier/"> <span class="md-ellipsis"> Span Classifier </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../span-linker/"> <span class="md-ellipsis"> Span Linker </span> </a> </li> </ul> </nav> </li> </ul> </nav> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../../../tokenizers/"> <span class="md-ellipsis"> Tokenizers </span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../../../data/"> <span class="md-ellipsis"> Data Connectors </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../../../concepts/pipeline/"> <span class="md-ellipsis"> Concepts </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../../../utilities/"> <span class="md-ellipsis"> Utilities </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../../../reference/edsnlp/"> <span class="md-ellipsis"> Code Reference </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../../../contributing/"> <span class="md-ellipsis"> Contributing to EDS-NLP </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../../../changelog/"> <span class="md-ellipsis"> Changelog </span> </a> </li> </ul> </nav> </div> </div> </div> <div class="md-sidebar md-sidebar--secondary" data-md-component="sidebar" data-md-type="toc"> <div class="md-sidebar__scrollwrap"> <div class="md-sidebar__inner"> <nav aria-label="Table of contents" class="md-nav md-nav--secondary"> <label class="md-nav__title" for="__toc"> <span class="md-nav__icon md-icon"></span> Table of contents </label> <ul class="md-nav__list" data-md-component="toc" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href="#edsnlp.pipes.trainable.embeddings.text_cnn.factory.create_component--parameters"> <span class="md-ellipsis"> Parameters </span> </a> </li> </ul> </nav> </div> </div> </div> <div class="md-content" data-md-component="content"> <article class="md-content__inner md-typeset"> <h1 class="sourced-heading" id="edsnlp.pipes.trainable.embeddings.text_cnn.factory.create_component">Text CNN<span class="sourced-heading-spacer"></span><a href="https://github.com/aphp/edsnlp/blob/1ebc7d72c/edsnlp/pipes/trainable/embeddings/text_cnn/text_cnn.py#L25" target="_blank">[source]</a></h1> <div class="doc doc-object doc-attribute"> <div class="doc doc-contents first"> <p>The <code><a href="#edsnlp.pipes.trainable.embeddings.text_cnn.factory.create_component">eds.text_cnn</a></code> component is a simple 1D convolutional network to contextualize word embeddings (as computed by the <code>embedding</code> component passed as argument).</p> <p>To be memory efficient when handling batches of variable-length sequences, this module employs sequence packing, while taking care of avoiding contamination between the different docs.</p> <h2 id="edsnlp.pipes.trainable.embeddings.text_cnn.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline object</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>PipelineProtocol</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'text_cnn'</code> </span> </p> </td> </tr> <tr> <td><code>embedding</code></td> <td class="doc-param-details"> <p>Embedding module to apply to the input</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>TorchComponent[<span title="edsnlp.pipes.trainable.embeddings.typing.WordEmbeddingBatchOutput">WordEmbeddingBatchOutput</span>, <span title="edsnlp.core.torch_component.BatchInput">BatchInput</span>]</code> </span> </p> </td> </tr> <tr> <td><code>output_size</code></td> <td class="doc-param-details"> <p>Size of the output embeddings Defaults to the <code>input_size</code></p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[int]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>out_channels</code></td> <td class="doc-param-details"> <p>Number of channels</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>int</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>kernel_sizes</code></td> <td class="doc-param-details"> <p>Window size of each kernel</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Sequence">Sequence</span>[int]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>(3, 4, 5)</code> </span> </p> </td> </tr> <tr> <td><code>activation</code></td> <td class="doc-param-details"> <p>Activation function to use</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>relu</code> </span> </p> </td> </tr> <tr> <td><code>residual</code></td> <td class="doc-param-details"> <p>Whether to use residual connections</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>True</code> </span> </p> </td> </tr> <tr> <td><code>normalize</code></td> <td class="doc-param-details"> <p>Whether to normalize before or after the residual connection</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing_extensions.Literal">Literal</span>['pre', 'post', 'none']</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>pre</code> </span> </p> </td> </tr> </tbody> </table> </div> </div> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/pipes/trainable/embeddings/transformer/index.html b/master/pipes/trainable/embeddings/transformer/index.html
index a778c9959..1cbb032e8 100644
--- a/master/pipes/trainable/embeddings/transformer/index.html
+++ b/master/pipes/trainable/embeddings/transformer/index.html
@@ -9,7 +9,7 @@
         <span class="n">stride</span><span class="o">=</span><span class="mi">96</span><span class="p">,</span>
     <span class="p">),</span>
 <span class="p">)</span>
-</code></pre></div> <p>You can then compose this embedding with a task specific component such as <code><a href="../../ner/#edsnlp.pipes.trainable.ner_crf.factory.create_component">eds.ner_crf</a></code>.</p> <h2 id="edsnlp.pipes.trainable.embeddings.transformer.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline instance</p> <p> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The component name</p> <p> </p> </td> </tr> <tr> <td><code>model</code></td> <td class="doc-param-details"> <p>The Huggingface model name or path</p> <p> </p> </td> </tr> <tr> <td><code>window</code></td> <td class="doc-param-details"> <p>The window size to use when splitting long documents into smaller windows before feeding them to the Transformer model (default: 512 = 512 - 2)</p> <p> <span class="doc-param-default"> <b>DEFAULT:</b> <code>128</code> </span> </p> </td> </tr> <tr> <td><code>stride</code></td> <td class="doc-param-details"> <p>The stride (distance between windows) to use when splitting long documents into smaller windows: (default: 96)</p> <p> <span class="doc-param-default"> <b>DEFAULT:</b> <code>96</code> </span> </p> </td> </tr> <tr> <td><code>training_stride</code></td> <td class="doc-param-details"> <p>If False, the stride will be set to the window size during training, meaning that there will be no overlap between windows. If True, the stride will be set to the <code>stride</code> parameter during training, just like during inference.</p> <p> <span class="doc-param-default"> <b>DEFAULT:</b> <code>True</code> </span> </p> </td> </tr> <tr> <td><code>max_tokens_per_device</code></td> <td class="doc-param-details"> <p>The maximum number of tokens that can be processed by the model on a single device. This does not affect the results but can be used to reduce the memory usage of the model, at the cost of a longer processing time.</p> <p>If "auto", the component will try to estimate the maximum number of tokens that can be processed by the model on the current device at a given time.</p> <p> <span class="doc-param-default"> <b>DEFAULT:</b> <code>auto</code> </span> </p> </td> </tr> <tr> <td><code>span_getter</code></td> <td class="doc-param-details"> <p>Which spans of the document should be embedded. Defaults to the full document if None.</p> <p> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> </tbody> </table> </div> </div> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> <p>You can then compose this embedding with a task specific component such as <code><a href="../../ner/#edsnlp.pipes.trainable.ner_crf.factory.create_component">eds.ner_crf</a></code>.</p> <h2 id="edsnlp.pipes.trainable.embeddings.transformer.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline instance</p> <p> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>The component name</p> <p> </p> </td> </tr> <tr> <td><code>model</code></td> <td class="doc-param-details"> <p>The Huggingface model name or path</p> <p> </p> </td> </tr> <tr> <td><code>window</code></td> <td class="doc-param-details"> <p>The window size to use when splitting long documents into smaller windows before feeding them to the Transformer model (default: 512 = 512 - 2)</p> <p> <span class="doc-param-default"> <b>DEFAULT:</b> <code>128</code> </span> </p> </td> </tr> <tr> <td><code>stride</code></td> <td class="doc-param-details"> <p>The stride (distance between windows) to use when splitting long documents into smaller windows: (default: 96)</p> <p> <span class="doc-param-default"> <b>DEFAULT:</b> <code>96</code> </span> </p> </td> </tr> <tr> <td><code>training_stride</code></td> <td class="doc-param-details"> <p>If False, the stride will be set to the window size during training, meaning that there will be no overlap between windows. If True, the stride will be set to the <code>stride</code> parameter during training, just like during inference.</p> <p> <span class="doc-param-default"> <b>DEFAULT:</b> <code>True</code> </span> </p> </td> </tr> <tr> <td><code>max_tokens_per_device</code></td> <td class="doc-param-details"> <p>The maximum number of tokens that can be processed by the model on a single device. This does not affect the results but can be used to reduce the memory usage of the model, at the cost of a longer processing time.</p> <p>If "auto", the component will try to estimate the maximum number of tokens that can be processed by the model on the current device at a given time.</p> <p> <span class="doc-param-default"> <b>DEFAULT:</b> <code>auto</code> </span> </p> </td> </tr> <tr> <td><code>span_getter</code></td> <td class="doc-param-details"> <p>Which spans of the document should be embedded. Defaults to the full document if None.</p> <p> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> </tbody> </table> </div> </div> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/pipes/trainable/index.html b/master/pipes/trainable/index.html
index eb090455e..ff3743a02 100644
--- a/master/pipes/trainable/index.html
+++ b/master/pipes/trainable/index.html
@@ -1,5 +1,5 @@
 <!DOCTYPE html>
-<html class="no-js" lang="en"> <head><meta charset="utf-8"/><meta content="width=device-width,initial-scale=1" name="viewport"/><link href="../ner/suicide_attempt/" rel="prev"/><link href="embeddings/transformer/" rel="next"/><link href="../../assets/logo/edsnlp.svg" rel="icon"/><meta content="mkdocs-1.6.1, mkdocs-material-9.5.42" name="generator"/><title>Overview - EDS-NLP</title><link href="../../assets/stylesheets/main.0253249f.min.css" rel="stylesheet"/><link href="../../assets/stylesheets/palette.06af60db.min.css" rel="stylesheet"/><link crossorigin="" href="https://fonts.gstatic.com" rel="preconnect"/><link href="https://fonts.googleapis.com/css?family=Roboto:300,300i,400,400i,700,700i%7CRoboto+Mono:400,400i,700,700i&amp;display=fallback" rel="stylesheet"/><style>:root{--md-text-font:"Roboto";--md-code-font:"Roboto Mono"}</style><link href="../../assets/_mkdocstrings.css" rel="stylesheet"/><link href="../../assets/stylesheets/extra.css" rel="stylesheet"/><link href="../../assets/stylesheets/cards.css" rel="stylesheet"/><link href="../../assets/termynal/termynal.css" rel="stylesheet"/><script>__md_scope=new URL("../..",location),__md_hash=e=>[...e].reduce(((e,_)=>(e<<5)-e+_.charCodeAt(0)),0),__md_get=(e,_=localStorage,t=__md_scope)=>JSON.parse(_.getItem(t.pathname+"."+e)),__md_set=(e,_,t=localStorage,a=__md_scope)=>{try{t.setItem(a.pathname+"."+e,JSON.stringify(_))}catch(e){}}</script></head> <body data-md-color-accent="indigo" data-md-color-primary="indigo" data-md-color-scheme="default" dir="ltr"> <input autocomplete="off" class="md-toggle" data-md-toggle="drawer" id="__drawer" type="checkbox"/> <input autocomplete="off" class="md-toggle" data-md-toggle="search" id="__search" type="checkbox"/> <label class="md-overlay" for="__drawer"></label> <div data-md-component="skip"> <a class="md-skip" href="#trainable-components-overview"> Skip to content </a> </div> <div data-md-component="announce"> <aside class="md-banner"> <div class="md-banner__inner md-grid md-typeset"> <button aria-label="Don't show this again" class="md-banner__button md-icon"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M19 6.41 17.59 5 12 10.59 6.41 5 5 6.41 10.59 12 5 17.59 6.41 19 12 13.41 17.59 19 19 17.59 13.41 12z"></path></svg> </button> Check out the new <a href="../../tutorials/training">Model Training tutorial</a> ! </div> <script>var el=document.querySelector("[data-md-component=announce]");if(el){var content=el.querySelector(".md-typeset");__md_hash(content.innerHTML)===__md_get("__announce")&&(el.hidden=!0)}</script> </aside> </div> <div data-md-color-scheme="default" data-md-component="outdated" hidden=""> </div> <header class="md-header md-header--shadow" data-md-component="header"> <nav aria-label="Header" class="md-header__inner md-grid"> <a aria-label="EDS-NLP" class="md-header__button md-logo" data-md-component="logo" href="../.." title="EDS-NLP"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a3 3 0 0 0 3-3 3 3 0 0 0-3-3 3 3 0 0 0-3 3 3 3 0 0 0 3 3m0 3.54C9.64 9.35 6.5 8 3 8v11c3.5 0 6.64 1.35 9 3.54 2.36-2.19 5.5-3.54 9-3.54V8c-3.5 0-6.64 1.35-9 3.54"></path></svg> </a> <label class="md-header__button md-icon" for="__drawer"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M3 6h18v2H3zm0 5h18v2H3zm0 5h18v2H3z"></path></svg> </label> <div class="md-header__title" data-md-component="header-title"> <div class="md-header__ellipsis"> <div class="md-header__topic"> <span class="md-ellipsis"> EDS-NLP </span> </div> <div class="md-header__topic" data-md-component="header-topic"> <span class="md-ellipsis"> Overview </span> </div> </div> </div> <form class="md-header__option" data-md-component="palette"> <input aria-label="Switch to dark mode" class="md-option" data-md-color-accent="indigo" data-md-color-media="" data-md-color-primary="indigo" data-md-color-scheme="default" id="__palette_0" name="__palette" type="radio"/> <label class="md-header__button md-icon" for="__palette_1" hidden="" title="Switch to dark mode"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 18c-.89 0-1.74-.2-2.5-.55C11.56 16.5 13 14.42 13 12s-1.44-4.5-3.5-5.45C10.26 6.2 11.11 6 12 6a6 6 0 0 1 6 6 6 6 0 0 1-6 6m8-9.31V4h-4.69L12 .69 8.69 4H4v4.69L.69 12 4 15.31V20h4.69L12 23.31 15.31 20H20v-4.69L23.31 12z"></path></svg> </label> <input aria-label="Switch to light mode" class="md-option" data-md-color-accent="indigo" data-md-color-media="" data-md-color-primary="indigo" data-md-color-scheme="slate" id="__palette_1" name="__palette" type="radio"/> <label class="md-header__button md-icon" for="__palette_0" hidden="" title="Switch to light mode"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a4 4 0 0 0-4 4 4 4 0 0 0 4 4 4 4 0 0 0 4-4 4 4 0 0 0-4-4m0 10a6 6 0 0 1-6-6 6 6 0 0 1 6-6 6 6 0 0 1 6 6 6 6 0 0 1-6 6m8-9.31V4h-4.69L12 .69 8.69 4H4v4.69L.69 12 4 15.31V20h4.69L12 23.31 15.31 20H20v-4.69L23.31 12z"></path></svg> </label> </form> <script>var palette=__md_get("__palette");if(palette&&palette.color){if("(prefers-color-scheme)"===palette.color.media){var media=matchMedia("(prefers-color-scheme: light)"),input=document.querySelector(media.matches?"[data-md-color-media='(prefers-color-scheme: light)']":"[data-md-color-media='(prefers-color-scheme: dark)']");palette.color.media=input.getAttribute("data-md-color-media"),palette.color.scheme=input.getAttribute("data-md-color-scheme"),palette.color.primary=input.getAttribute("data-md-color-primary"),palette.color.accent=input.getAttribute("data-md-color-accent")}for(var[key,value]of Object.entries(palette.color))document.body.setAttribute("data-md-color-"+key,value)}</script> <label class="md-header__button md-icon" for="__search"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M9.5 3A6.5 6.5 0 0 1 16 9.5c0 1.61-.59 3.09-1.56 4.23l.27.27h.79l5 5-1.5 1.5-5-5v-.79l-.27-.27A6.52 6.52 0 0 1 9.5 16 6.5 6.5 0 0 1 3 9.5 6.5 6.5 0 0 1 9.5 3m0 2C7 5 5 7 5 9.5S7 14 9.5 14 14 12 14 9.5 12 5 9.5 5"></path></svg> </label> <div class="md-search" data-md-component="search" role="dialog"> <label class="md-search__overlay" for="__search"></label> <div class="md-search__inner" role="search"> <form class="md-search__form" name="search"> <input aria-label="Search" autocapitalize="off" autocomplete="off" autocorrect="off" class="md-search__input" data-md-component="search-query" name="query" placeholder="Search" required="" spellcheck="false" type="text"/> <label class="md-search__icon md-icon" for="__search"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M9.5 3A6.5 6.5 0 0 1 16 9.5c0 1.61-.59 3.09-1.56 4.23l.27.27h.79l5 5-1.5 1.5-5-5v-.79l-.27-.27A6.52 6.52 0 0 1 9.5 16 6.5 6.5 0 0 1 3 9.5 6.5 6.5 0 0 1 9.5 3m0 2C7 5 5 7 5 9.5S7 14 9.5 14 14 12 14 9.5 12 5 9.5 5"></path></svg> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M20 11v2H8l5.5 5.5-1.42 1.42L4.16 12l7.92-7.92L13.5 5.5 8 11z"></path></svg> </label> <nav aria-label="Search" class="md-search__options"> <button aria-label="Clear" class="md-search__icon md-icon" tabindex="-1" title="Clear" type="reset"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M19 6.41 17.59 5 12 10.59 6.41 5 5 6.41 10.59 12 5 17.59 6.41 19 12 13.41 17.59 19 19 17.59 13.41 12z"></path></svg> </button> </nav> </form> <div class="md-search__output"> <div class="md-search__scrollwrap" data-md-scrollfix="" tabindex="0"> <div class="md-search-result" data-md-component="search-result"> <div class="md-search-result__meta"> Initializing search </div> <ol class="md-search-result__list" role="presentation"></ol> </div> </div> </div> </div> </div> <div class="md-header__source"> <a class="md-source" data-md-component="source" href="https://github.com/aphp/edsnlp" title="Go to repository"> <div class="md-source__icon md-icon"> <svg viewbox="0 0 448 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M439.55 236.05 244 40.45a28.87 28.87 0 0 0-40.81 0l-40.66 40.63 51.52 51.52c27.06-9.14 52.68 16.77 43.39 43.68l49.66 49.66c34.23-11.8 61.18 31 35.47 56.69-26.49 26.49-70.21-2.87-56-37.34L240.22 199v121.85c25.3 12.54 22.26 41.85 9.08 55a34.34 34.34 0 0 1-48.55 0c-17.57-17.6-11.07-46.91 11.25-56v-123c-20.8-8.51-24.6-30.74-18.64-45L142.57 101 8.45 235.14a28.86 28.86 0 0 0 0 40.81l195.61 195.6a28.86 28.86 0 0 0 40.8 0l194.69-194.69a28.86 28.86 0 0 0 0-40.81"></path></svg> </div> <div class="md-source__repository"> aphp/edsnlp </div> </a> </div> </nav> </header> <div class="md-container" data-md-component="container"> <main class="md-main" data-md-component="main"> <div class="md-main__inner md-grid"> <div class="md-sidebar md-sidebar--primary" data-md-component="sidebar" data-md-type="navigation"> <div class="md-sidebar__scrollwrap"> <div class="md-sidebar__inner"> <nav aria-label="Navigation" class="md-nav md-nav--primary" data-md-level="0"> <label class="md-nav__title" for="__drawer"> <a aria-label="EDS-NLP" class="md-nav__button md-logo" data-md-component="logo" href="../.." title="EDS-NLP"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a3 3 0 0 0 3-3 3 3 0 0 0-3-3 3 3 0 0 0-3 3 3 3 0 0 0 3 3m0 3.54C9.64 9.35 6.5 8 3 8v11c3.5 0 6.64 1.35 9 3.54 2.36-2.19 5.5-3.54 9-3.54V8c-3.5 0-6.64 1.35-9 3.54"></path></svg> </a> EDS-NLP </label> <div class="md-nav__source"> <a class="md-source" data-md-component="source" href="https://github.com/aphp/edsnlp" title="Go to repository"> <div class="md-source__icon md-icon"> <svg viewbox="0 0 448 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M439.55 236.05 244 40.45a28.87 28.87 0 0 0-40.81 0l-40.66 40.63 51.52 51.52c27.06-9.14 52.68 16.77 43.39 43.68l49.66 49.66c34.23-11.8 61.18 31 35.47 56.69-26.49 26.49-70.21-2.87-56-37.34L240.22 199v121.85c25.3 12.54 22.26 41.85 9.08 55a34.34 34.34 0 0 1-48.55 0c-17.57-17.6-11.07-46.91 11.25-56v-123c-20.8-8.51-24.6-30.74-18.64-45L142.57 101 8.45 235.14a28.86 28.86 0 0 0 0 40.81l195.61 195.6a28.86 28.86 0 0 0 40.8 0l194.69-194.69a28.86 28.86 0 0 0 0-40.81"></path></svg> </div> <div class="md-source__repository"> aphp/edsnlp </div> </a> </div> <ul class="md-nav__list" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href="../.."> <span class="md-ellipsis"> Getting started </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="https://aphp.github.io/edsnlp/demo" target="_blank"> <span class="md-ellipsis"> Demo </span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../tutorials/"> <span class="md-ellipsis"> Tutorials </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--active md-nav__item--nested"> <input checked="" class="md-nav__toggle md-toggle" id="__nav_4" type="checkbox"/> <div class="md-nav__link md-nav__container"> <a class="md-nav__link" href="../"> <span class="md-ellipsis"> Pipes </span> </a> <label class="md-nav__link" for="__nav_4" id="__nav_4_label" tabindex="0"> <span class="md-nav__icon md-icon"></span> </label> </div> <nav aria-expanded="true" aria-labelledby="__nav_4_label" class="md-nav" data-md-level="1"> <label class="md-nav__title" for="__nav_4"> <span class="md-nav__icon md-icon"></span> Pipes </label> <ul class="md-nav__list" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href="../"> <span class="md-ellipsis"> Overview </span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../core/"> <span class="md-ellipsis"> Core Pipelines </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../qualifiers/"> <span class="md-ellipsis"> Qualifiers </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../misc/"> <span class="md-ellipsis"> Miscellaneous </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../ner/"> <span class="md-ellipsis"> Named Entity Recognition </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--active md-nav__item--nested"> <input checked="" class="md-nav__toggle md-toggle" id="__nav_4_7" type="checkbox"/> <div class="md-nav__link md-nav__container"> <a class="md-nav__link" href="./"> <span class="md-ellipsis"> Trainable components </span> </a> <label class="md-nav__link" for="__nav_4_7" id="__nav_4_7_label" tabindex="0"> <span class="md-nav__icon md-icon"></span> </label> </div> <nav aria-expanded="true" aria-labelledby="__nav_4_7_label" class="md-nav" data-md-level="2"> <label class="md-nav__title" for="__nav_4_7"> <span class="md-nav__icon md-icon"></span> Trainable components </label> <ul class="md-nav__list" data-md-scrollfix=""> <li class="md-nav__item md-nav__item--active"> <input class="md-nav__toggle md-toggle" id="__toc" type="checkbox"/> <label class="md-nav__link md-nav__link--active" for="__toc"> <span class="md-ellipsis"> Overview </span> <span class="md-nav__icon md-icon"></span> </label> <a class="md-nav__link md-nav__link--active" href="./"> <span class="md-ellipsis"> Overview </span> </a> <nav aria-label="Table of contents" class="md-nav md-nav--secondary"> <label class="md-nav__title" for="__toc"> <span class="md-nav__icon md-icon"></span> Table of contents </label> <ul class="md-nav__list" data-md-component="toc" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href="#available-components"> <span class="md-ellipsis"> Available components : </span> </a> </li> </ul> </nav> </li> <li class="md-nav__item"> <a class="md-nav__link" href="embeddings/transformer/"> <span class="md-ellipsis"> Transformer </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="embeddings/text_cnn/"> <span class="md-ellipsis"> Text CNN </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="embeddings/span_pooler/"> <span class="md-ellipsis"> Span Pooler </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="ner/"> <span class="md-ellipsis"> NER </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="span-classifier/"> <span class="md-ellipsis"> Span Classifier </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="span-linker/"> <span class="md-ellipsis"> Span Linker </span> </a> </li> </ul> </nav> </li> </ul> </nav> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../tokenizers/"> <span class="md-ellipsis"> Tokenizers </span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../data/"> <span class="md-ellipsis"> Data Connectors </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../concepts/pipeline/"> <span class="md-ellipsis"> Concepts </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../utilities/"> <span class="md-ellipsis"> Utilities </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../reference/edsnlp/"> <span class="md-ellipsis"> Code Reference </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../contributing/"> <span class="md-ellipsis"> Contributing to EDS-NLP </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../changelog/"> <span class="md-ellipsis"> Changelog </span> </a> </li> </ul> </nav> </div> </div> </div> <div class="md-sidebar md-sidebar--secondary" data-md-component="sidebar" data-md-type="toc"> <div class="md-sidebar__scrollwrap"> <div class="md-sidebar__inner"> <nav aria-label="Table of contents" class="md-nav md-nav--secondary"> <label class="md-nav__title" for="__toc"> <span class="md-nav__icon md-icon"></span> Table of contents </label> <ul class="md-nav__list" data-md-component="toc" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href="#available-components"> <span class="md-ellipsis"> Available components : </span> </a> </li> </ul> </nav> </div> </div> </div> <div class="md-content" data-md-component="content"> <article class="md-content__inner md-typeset"> <h1 id="trainable-components-overview">Trainable components overview</h1> <p>In addition to its rule-based pipeline components, EDS-NLP offers new trainable components to fit and run machine learning models for classic biomedical information extraction tasks.</p> <p>All trainable components implement the <a class="autorefs autorefs-internal" href="../../concepts/torch-component/#edsnlp.core.torch_component.TorchComponent"><code>TorchComponent</code></a> class, which provides a common API for training and inference.</p> <h2 id="available-components">Available components :</h2> <table> <thead> <tr> <th>Name</th> <th>Description</th> </tr> </thead> <tbody> <tr> <td><code><a href="embeddings/transformer/#edsnlp.pipes.trainable.embeddings.transformer.factory.create_component">eds.transformer</a></code></td> <td>Embed text with a transformer model</td> </tr> <tr> <td><code><a href="embeddings/text_cnn/#edsnlp.pipes.trainable.embeddings.text_cnn.factory.create_component">eds.text_cnn</a></code></td> <td>Contextualize embeddings with a CNN</td> </tr> <tr> <td><code><a href="embeddings/span_pooler/#edsnlp.pipes.trainable.embeddings.span_pooler.factory.create_component">eds.span_pooler</a></code></td> <td>A span embedding component that aggregates word embeddings</td> </tr> <tr> <td><code><a href="ner/#edsnlp.pipes.trainable.ner_crf.factory.create_component">eds.ner_crf</a></code></td> <td>A trainable component to extract entities</td> </tr> <tr> <td><code><a href="span-classifier/#edsnlp.pipes.trainable.span_classifier.factory.create_component">eds.span_classifier</a></code></td> <td>A trainable component for multi-class multi-label span classification</td> </tr> <tr> <td><code><a href="span-linker/#edsnlp.pipes.trainable.span_linker.factory.create_component">eds.span_linker</a></code></td> <td>A trainable entity linker (i.e. to a list of concepts)</td> </tr> </tbody> </table> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+<html class="no-js" lang="en"> <head><meta charset="utf-8"/><meta content="width=device-width,initial-scale=1" name="viewport"/><link href="../ner/suicide_attempt/" rel="prev"/><link href="embeddings/transformer/" rel="next"/><link href="../../assets/logo/edsnlp.svg" rel="icon"/><meta content="mkdocs-1.6.1, mkdocs-material-9.5.42" name="generator"/><title>Overview - EDS-NLP</title><link href="../../assets/stylesheets/main.0253249f.min.css" rel="stylesheet"/><link href="../../assets/stylesheets/palette.06af60db.min.css" rel="stylesheet"/><link crossorigin="" href="https://fonts.gstatic.com" rel="preconnect"/><link href="https://fonts.googleapis.com/css?family=Roboto:300,300i,400,400i,700,700i%7CRoboto+Mono:400,400i,700,700i&amp;display=fallback" rel="stylesheet"/><style>:root{--md-text-font:"Roboto";--md-code-font:"Roboto Mono"}</style><link href="../../assets/_mkdocstrings.css" rel="stylesheet"/><link href="../../assets/stylesheets/extra.css" rel="stylesheet"/><link href="../../assets/stylesheets/cards.css" rel="stylesheet"/><link href="../../assets/termynal/termynal.css" rel="stylesheet"/><script>__md_scope=new URL("../..",location),__md_hash=e=>[...e].reduce(((e,_)=>(e<<5)-e+_.charCodeAt(0)),0),__md_get=(e,_=localStorage,t=__md_scope)=>JSON.parse(_.getItem(t.pathname+"."+e)),__md_set=(e,_,t=localStorage,a=__md_scope)=>{try{t.setItem(a.pathname+"."+e,JSON.stringify(_))}catch(e){}}</script></head> <body data-md-color-accent="indigo" data-md-color-primary="indigo" data-md-color-scheme="default" dir="ltr"> <input autocomplete="off" class="md-toggle" data-md-toggle="drawer" id="__drawer" type="checkbox"/> <input autocomplete="off" class="md-toggle" data-md-toggle="search" id="__search" type="checkbox"/> <label class="md-overlay" for="__drawer"></label> <div data-md-component="skip"> <a class="md-skip" href="#trainable-components-overview"> Skip to content </a> </div> <div data-md-component="announce"> <aside class="md-banner"> <div class="md-banner__inner md-grid md-typeset"> <button aria-label="Don't show this again" class="md-banner__button md-icon"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M19 6.41 17.59 5 12 10.59 6.41 5 5 6.41 10.59 12 5 17.59 6.41 19 12 13.41 17.59 19 19 17.59 13.41 12z"></path></svg> </button> Check out the new <a href="../../tutorials/training">Model Training tutorial</a> ! </div> <script>var el=document.querySelector("[data-md-component=announce]");if(el){var content=el.querySelector(".md-typeset");__md_hash(content.innerHTML)===__md_get("__announce")&&(el.hidden=!0)}</script> </aside> </div> <div data-md-color-scheme="default" data-md-component="outdated" hidden=""> </div> <header class="md-header md-header--shadow" data-md-component="header"> <nav aria-label="Header" class="md-header__inner md-grid"> <a aria-label="EDS-NLP" class="md-header__button md-logo" data-md-component="logo" href="../.." title="EDS-NLP"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a3 3 0 0 0 3-3 3 3 0 0 0-3-3 3 3 0 0 0-3 3 3 3 0 0 0 3 3m0 3.54C9.64 9.35 6.5 8 3 8v11c3.5 0 6.64 1.35 9 3.54 2.36-2.19 5.5-3.54 9-3.54V8c-3.5 0-6.64 1.35-9 3.54"></path></svg> </a> <label class="md-header__button md-icon" for="__drawer"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M3 6h18v2H3zm0 5h18v2H3zm0 5h18v2H3z"></path></svg> </label> <div class="md-header__title" data-md-component="header-title"> <div class="md-header__ellipsis"> <div class="md-header__topic"> <span class="md-ellipsis"> EDS-NLP </span> </div> <div class="md-header__topic" data-md-component="header-topic"> <span class="md-ellipsis"> Overview </span> </div> </div> </div> <form class="md-header__option" data-md-component="palette"> <input aria-label="Switch to dark mode" class="md-option" data-md-color-accent="indigo" data-md-color-media="" data-md-color-primary="indigo" data-md-color-scheme="default" id="__palette_0" name="__palette" type="radio"/> <label class="md-header__button md-icon" for="__palette_1" hidden="" title="Switch to dark mode"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 18c-.89 0-1.74-.2-2.5-.55C11.56 16.5 13 14.42 13 12s-1.44-4.5-3.5-5.45C10.26 6.2 11.11 6 12 6a6 6 0 0 1 6 6 6 6 0 0 1-6 6m8-9.31V4h-4.69L12 .69 8.69 4H4v4.69L.69 12 4 15.31V20h4.69L12 23.31 15.31 20H20v-4.69L23.31 12z"></path></svg> </label> <input aria-label="Switch to light mode" class="md-option" data-md-color-accent="indigo" data-md-color-media="" data-md-color-primary="indigo" data-md-color-scheme="slate" id="__palette_1" name="__palette" type="radio"/> <label class="md-header__button md-icon" for="__palette_0" hidden="" title="Switch to light mode"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a4 4 0 0 0-4 4 4 4 0 0 0 4 4 4 4 0 0 0 4-4 4 4 0 0 0-4-4m0 10a6 6 0 0 1-6-6 6 6 0 0 1 6-6 6 6 0 0 1 6 6 6 6 0 0 1-6 6m8-9.31V4h-4.69L12 .69 8.69 4H4v4.69L.69 12 4 15.31V20h4.69L12 23.31 15.31 20H20v-4.69L23.31 12z"></path></svg> </label> </form> <script>var palette=__md_get("__palette");if(palette&&palette.color){if("(prefers-color-scheme)"===palette.color.media){var media=matchMedia("(prefers-color-scheme: light)"),input=document.querySelector(media.matches?"[data-md-color-media='(prefers-color-scheme: light)']":"[data-md-color-media='(prefers-color-scheme: dark)']");palette.color.media=input.getAttribute("data-md-color-media"),palette.color.scheme=input.getAttribute("data-md-color-scheme"),palette.color.primary=input.getAttribute("data-md-color-primary"),palette.color.accent=input.getAttribute("data-md-color-accent")}for(var[key,value]of Object.entries(palette.color))document.body.setAttribute("data-md-color-"+key,value)}</script> <label class="md-header__button md-icon" for="__search"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M9.5 3A6.5 6.5 0 0 1 16 9.5c0 1.61-.59 3.09-1.56 4.23l.27.27h.79l5 5-1.5 1.5-5-5v-.79l-.27-.27A6.52 6.52 0 0 1 9.5 16 6.5 6.5 0 0 1 3 9.5 6.5 6.5 0 0 1 9.5 3m0 2C7 5 5 7 5 9.5S7 14 9.5 14 14 12 14 9.5 12 5 9.5 5"></path></svg> </label> <div class="md-search" data-md-component="search" role="dialog"> <label class="md-search__overlay" for="__search"></label> <div class="md-search__inner" role="search"> <form class="md-search__form" name="search"> <input aria-label="Search" autocapitalize="off" autocomplete="off" autocorrect="off" class="md-search__input" data-md-component="search-query" name="query" placeholder="Search" required="" spellcheck="false" type="text"/> <label class="md-search__icon md-icon" for="__search"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M9.5 3A6.5 6.5 0 0 1 16 9.5c0 1.61-.59 3.09-1.56 4.23l.27.27h.79l5 5-1.5 1.5-5-5v-.79l-.27-.27A6.52 6.52 0 0 1 9.5 16 6.5 6.5 0 0 1 3 9.5 6.5 6.5 0 0 1 9.5 3m0 2C7 5 5 7 5 9.5S7 14 9.5 14 14 12 14 9.5 12 5 9.5 5"></path></svg> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M20 11v2H8l5.5 5.5-1.42 1.42L4.16 12l7.92-7.92L13.5 5.5 8 11z"></path></svg> </label> <nav aria-label="Search" class="md-search__options"> <button aria-label="Clear" class="md-search__icon md-icon" tabindex="-1" title="Clear" type="reset"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M19 6.41 17.59 5 12 10.59 6.41 5 5 6.41 10.59 12 5 17.59 6.41 19 12 13.41 17.59 19 19 17.59 13.41 12z"></path></svg> </button> </nav> </form> <div class="md-search__output"> <div class="md-search__scrollwrap" data-md-scrollfix="" tabindex="0"> <div class="md-search-result" data-md-component="search-result"> <div class="md-search-result__meta"> Initializing search </div> <ol class="md-search-result__list" role="presentation"></ol> </div> </div> </div> </div> </div> <div class="md-header__source"> <a class="md-source" data-md-component="source" href="https://github.com/aphp/edsnlp" title="Go to repository"> <div class="md-source__icon md-icon"> <svg viewbox="0 0 448 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M439.55 236.05 244 40.45a28.87 28.87 0 0 0-40.81 0l-40.66 40.63 51.52 51.52c27.06-9.14 52.68 16.77 43.39 43.68l49.66 49.66c34.23-11.8 61.18 31 35.47 56.69-26.49 26.49-70.21-2.87-56-37.34L240.22 199v121.85c25.3 12.54 22.26 41.85 9.08 55a34.34 34.34 0 0 1-48.55 0c-17.57-17.6-11.07-46.91 11.25-56v-123c-20.8-8.51-24.6-30.74-18.64-45L142.57 101 8.45 235.14a28.86 28.86 0 0 0 0 40.81l195.61 195.6a28.86 28.86 0 0 0 40.8 0l194.69-194.69a28.86 28.86 0 0 0 0-40.81"></path></svg> </div> <div class="md-source__repository"> aphp/edsnlp </div> </a> </div> </nav> </header> <div class="md-container" data-md-component="container"> <main class="md-main" data-md-component="main"> <div class="md-main__inner md-grid"> <div class="md-sidebar md-sidebar--primary" data-md-component="sidebar" data-md-type="navigation"> <div class="md-sidebar__scrollwrap"> <div class="md-sidebar__inner"> <nav aria-label="Navigation" class="md-nav md-nav--primary" data-md-level="0"> <label class="md-nav__title" for="__drawer"> <a aria-label="EDS-NLP" class="md-nav__button md-logo" data-md-component="logo" href="../.." title="EDS-NLP"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a3 3 0 0 0 3-3 3 3 0 0 0-3-3 3 3 0 0 0-3 3 3 3 0 0 0 3 3m0 3.54C9.64 9.35 6.5 8 3 8v11c3.5 0 6.64 1.35 9 3.54 2.36-2.19 5.5-3.54 9-3.54V8c-3.5 0-6.64 1.35-9 3.54"></path></svg> </a> EDS-NLP </label> <div class="md-nav__source"> <a class="md-source" data-md-component="source" href="https://github.com/aphp/edsnlp" title="Go to repository"> <div class="md-source__icon md-icon"> <svg viewbox="0 0 448 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M439.55 236.05 244 40.45a28.87 28.87 0 0 0-40.81 0l-40.66 40.63 51.52 51.52c27.06-9.14 52.68 16.77 43.39 43.68l49.66 49.66c34.23-11.8 61.18 31 35.47 56.69-26.49 26.49-70.21-2.87-56-37.34L240.22 199v121.85c25.3 12.54 22.26 41.85 9.08 55a34.34 34.34 0 0 1-48.55 0c-17.57-17.6-11.07-46.91 11.25-56v-123c-20.8-8.51-24.6-30.74-18.64-45L142.57 101 8.45 235.14a28.86 28.86 0 0 0 0 40.81l195.61 195.6a28.86 28.86 0 0 0 40.8 0l194.69-194.69a28.86 28.86 0 0 0 0-40.81"></path></svg> </div> <div class="md-source__repository"> aphp/edsnlp </div> </a> </div> <ul class="md-nav__list" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href="../.."> <span class="md-ellipsis"> Getting started </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="https://aphp.github.io/edsnlp/demo" target="_blank"> <span class="md-ellipsis"> Demo </span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../tutorials/"> <span class="md-ellipsis"> Tutorials </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--active md-nav__item--nested"> <input checked="" class="md-nav__toggle md-toggle" id="__nav_4" type="checkbox"/> <div class="md-nav__link md-nav__container"> <a class="md-nav__link" href="../"> <span class="md-ellipsis"> Pipes </span> </a> <label class="md-nav__link" for="__nav_4" id="__nav_4_label" tabindex="0"> <span class="md-nav__icon md-icon"></span> </label> </div> <nav aria-expanded="true" aria-labelledby="__nav_4_label" class="md-nav" data-md-level="1"> <label class="md-nav__title" for="__nav_4"> <span class="md-nav__icon md-icon"></span> Pipes </label> <ul class="md-nav__list" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href="../"> <span class="md-ellipsis"> Overview </span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../core/"> <span class="md-ellipsis"> Core Pipelines </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../qualifiers/"> <span class="md-ellipsis"> Qualifiers </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../misc/"> <span class="md-ellipsis"> Miscellaneous </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../ner/"> <span class="md-ellipsis"> Named Entity Recognition </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--active md-nav__item--nested"> <input checked="" class="md-nav__toggle md-toggle" id="__nav_4_7" type="checkbox"/> <div class="md-nav__link md-nav__container"> <a class="md-nav__link" href="./"> <span class="md-ellipsis"> Trainable components </span> </a> <label class="md-nav__link" for="__nav_4_7" id="__nav_4_7_label" tabindex="0"> <span class="md-nav__icon md-icon"></span> </label> </div> <nav aria-expanded="true" aria-labelledby="__nav_4_7_label" class="md-nav" data-md-level="2"> <label class="md-nav__title" for="__nav_4_7"> <span class="md-nav__icon md-icon"></span> Trainable components </label> <ul class="md-nav__list" data-md-scrollfix=""> <li class="md-nav__item md-nav__item--active"> <input class="md-nav__toggle md-toggle" id="__toc" type="checkbox"/> <label class="md-nav__link md-nav__link--active" for="__toc"> <span class="md-ellipsis"> Overview </span> <span class="md-nav__icon md-icon"></span> </label> <a class="md-nav__link md-nav__link--active" href="./"> <span class="md-ellipsis"> Overview </span> </a> <nav aria-label="Table of contents" class="md-nav md-nav--secondary"> <label class="md-nav__title" for="__toc"> <span class="md-nav__icon md-icon"></span> Table of contents </label> <ul class="md-nav__list" data-md-component="toc" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href="#available-components"> <span class="md-ellipsis"> Available components : </span> </a> </li> </ul> </nav> </li> <li class="md-nav__item"> <a class="md-nav__link" href="embeddings/transformer/"> <span class="md-ellipsis"> Transformer </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="embeddings/text_cnn/"> <span class="md-ellipsis"> Text CNN </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="embeddings/span_pooler/"> <span class="md-ellipsis"> Span Pooler </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="ner/"> <span class="md-ellipsis"> NER </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="span-classifier/"> <span class="md-ellipsis"> Span Classifier </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="span-linker/"> <span class="md-ellipsis"> Span Linker </span> </a> </li> </ul> </nav> </li> </ul> </nav> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../tokenizers/"> <span class="md-ellipsis"> Tokenizers </span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../data/"> <span class="md-ellipsis"> Data Connectors </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../concepts/pipeline/"> <span class="md-ellipsis"> Concepts </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../utilities/"> <span class="md-ellipsis"> Utilities </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../reference/edsnlp/"> <span class="md-ellipsis"> Code Reference </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../contributing/"> <span class="md-ellipsis"> Contributing to EDS-NLP </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../changelog/"> <span class="md-ellipsis"> Changelog </span> </a> </li> </ul> </nav> </div> </div> </div> <div class="md-sidebar md-sidebar--secondary" data-md-component="sidebar" data-md-type="toc"> <div class="md-sidebar__scrollwrap"> <div class="md-sidebar__inner"> <nav aria-label="Table of contents" class="md-nav md-nav--secondary"> <label class="md-nav__title" for="__toc"> <span class="md-nav__icon md-icon"></span> Table of contents </label> <ul class="md-nav__list" data-md-component="toc" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href="#available-components"> <span class="md-ellipsis"> Available components : </span> </a> </li> </ul> </nav> </div> </div> </div> <div class="md-content" data-md-component="content"> <article class="md-content__inner md-typeset"> <h1 id="trainable-components-overview">Trainable components overview</h1> <p>In addition to its rule-based pipeline components, EDS-NLP offers new trainable components to fit and run machine learning models for classic biomedical information extraction tasks.</p> <p>All trainable components implement the <a class="autorefs autorefs-internal" href="../../concepts/torch-component/#edsnlp.core.torch_component.TorchComponent"><code>TorchComponent</code></a> class, which provides a common API for training and inference.</p> <h2 id="available-components">Available components :</h2> <table> <thead> <tr> <th>Name</th> <th>Description</th> </tr> </thead> <tbody> <tr> <td><code><a href="embeddings/transformer/#edsnlp.pipes.trainable.embeddings.transformer.factory.create_component">eds.transformer</a></code></td> <td>Embed text with a transformer model</td> </tr> <tr> <td><code><a href="embeddings/text_cnn/#edsnlp.pipes.trainable.embeddings.text_cnn.factory.create_component">eds.text_cnn</a></code></td> <td>Contextualize embeddings with a CNN</td> </tr> <tr> <td><code><a href="embeddings/span_pooler/#edsnlp.pipes.trainable.embeddings.span_pooler.factory.create_component">eds.span_pooler</a></code></td> <td>A span embedding component that aggregates word embeddings</td> </tr> <tr> <td><code><a href="ner/#edsnlp.pipes.trainable.ner_crf.factory.create_component">eds.ner_crf</a></code></td> <td>A trainable component to extract entities</td> </tr> <tr> <td><code><a href="span-classifier/#edsnlp.pipes.trainable.span_classifier.factory.create_component">eds.span_classifier</a></code></td> <td>A trainable component for multi-class multi-label span classification</td> </tr> <tr> <td><code><a href="span-linker/#edsnlp.pipes.trainable.span_linker.factory.create_component">eds.span_linker</a></code></td> <td>A trainable entity linker (i.e. to a list of concepts)</td> </tr> </tbody> </table> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/pipes/trainable/ner/index.html b/master/pipes/trainable/ner/index.html
index 956cdab2d..f72b8e3bb 100644
--- a/master/pipes/trainable/ner/index.html
+++ b/master/pipes/trainable/ner/index.html
@@ -16,7 +16,7 @@
     <span class="p">),</span>
     <span class="n">name</span><span class="o">=</span><span class="s2">"ner"</span>
 <span class="p">)</span>
-</code></pre></div> <p>To train the model, refer to the <a href="../../../tutorials/training">Training</a> tutorial.</p> <h2 id="edsnlp.pipes.trainable.ner_crf.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline object</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>PipelineProtocol</code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>Name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> </p> </td> </tr> <tr> <td><code>embedding</code></td> <td class="doc-param-details"> <p>The word embedding component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="edsnlp.pipes.trainable.embeddings.typing.WordEmbeddingComponent">WordEmbeddingComponent</span></code> </span> </p> </td> </tr> <tr> <td><code>target_span_getter</code></td> <td class="doc-param-details"> <p>Method to call to get the gold spans from a document, for scoring or training. By default, takes all entities in <code>doc.ents</code>, but we recommend you specify a given span group name instead.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../../reference/edsnlp/utils/span_getters/#edsnlp.utils.span_getters.SpanGetterArg" title="edsnlp.utils.span_getters.SpanGetterArg">SpanGetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True}</code> </span> </p> </td> </tr> <tr> <td><code>labels</code></td> <td class="doc-param-details"> <p>The labels to predict. The labels can also be inferred from the data during <code>nlp.post_init(...)</code></p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.List">List</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>The span setter to use to set the predicted spans on the Doc object. If None, the component will infer the span setter from the target_span_getter config.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<a class="autorefs autorefs-internal" href="../../../reference/edsnlp/utils/span_getters/#edsnlp.utils.span_getters.SpanSetterArg" title="edsnlp.utils.span_getters.SpanSetterArg">SpanSetterArg</a>]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>infer_span_setter</code></td> <td class="doc-param-details"> <p>Whether to complete the span setter from the target_span_getter config. False by default, unless the span_setter is None.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[bool]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>context_getter</code></td> <td class="doc-param-details"> <p>What context to use when computing the span embeddings (defaults to the whole document). For example <code>{"section": "conclusion"}</code> to only extract the entities from the conclusion.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<a class="autorefs autorefs-internal" href="../../../reference/edsnlp/utils/span_getters/#edsnlp.utils.span_getters.SpanGetterArg" title="edsnlp.utils.span_getters.SpanGetterArg">SpanGetterArg</a>]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>mode</code></td> <td class="doc-param-details"> <p>The CRF mode to use : independent, joint or marginal</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing_extensions.Literal">Literal</span>['independent', 'joint', 'marginal']</code> </span> </p> </td> </tr> <tr> <td><code>window</code></td> <td class="doc-param-details"> <p>The window size to use for the CRF. If 0, will use the whole document, at the cost of a longer computation time. If 1, this is equivalent to assuming that the tags are independent and will the component be faster, but with degraded performance. Empirically, we found that a window size of 10 or 20 works well.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>int</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>40</code> </span> </p> </td> </tr> <tr> <td><code>stride</code></td> <td class="doc-param-details"> <p>The stride to use for the CRF windows. Defaults to <code>window // 2</code>.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[int]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.trainable.ner_crf.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.trainable.ner_crf.factory.create_component">eds.ner_crf</a></code> pipeline was developed by AP-HP's Data Science team.</p> <p>The deep learning model was adapted from <span><a class="citation" href="./#ref-wajsburt:tel-03624928" id="edsnlp.pipes.trainable.ner_crf.factory.create_component--cite-wajsburt:tel-03624928">Wajsbürt, 2021</a></span>.</p> </div> </div> <div class="footnote"><hr/><ol><li id="ref-wajsburt:tel-03624928"><p><p id="ref-wajsburt:tel-03624928">Wajsbürt P., 2021. Extraction and normalization of simple and structured entities in medical documents. <a href="https://hal.archives-ouvertes.fr/tel-03624928" target="_blank">https://hal.archives-ouvertes.fr/tel-03624928</a></p></p></li></ol><div></div></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> <p>To train the model, refer to the <a href="../../../tutorials/training">Training</a> tutorial.</p> <h2 id="edsnlp.pipes.trainable.ner_crf.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline object</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>PipelineProtocol</code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>Name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> </p> </td> </tr> <tr> <td><code>embedding</code></td> <td class="doc-param-details"> <p>The word embedding component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="edsnlp.pipes.trainable.embeddings.typing.WordEmbeddingComponent">WordEmbeddingComponent</span></code> </span> </p> </td> </tr> <tr> <td><code>target_span_getter</code></td> <td class="doc-param-details"> <p>Method to call to get the gold spans from a document, for scoring or training. By default, takes all entities in <code>doc.ents</code>, but we recommend you specify a given span group name instead.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../../reference/edsnlp/utils/span_getters/#edsnlp.utils.span_getters.SpanGetterArg" title="edsnlp.utils.span_getters.SpanGetterArg">SpanGetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>{'ents': True}</code> </span> </p> </td> </tr> <tr> <td><code>labels</code></td> <td class="doc-param-details"> <p>The labels to predict. The labels can also be inferred from the data during <code>nlp.post_init(...)</code></p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.List">List</span>[str]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>span_setter</code></td> <td class="doc-param-details"> <p>The span setter to use to set the predicted spans on the Doc object. If None, the component will infer the span setter from the target_span_getter config.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<a class="autorefs autorefs-internal" href="../../../reference/edsnlp/utils/span_getters/#edsnlp.utils.span_getters.SpanSetterArg" title="edsnlp.utils.span_getters.SpanSetterArg">SpanSetterArg</a>]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>infer_span_setter</code></td> <td class="doc-param-details"> <p>Whether to complete the span setter from the target_span_getter config. False by default, unless the span_setter is None.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[bool]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>context_getter</code></td> <td class="doc-param-details"> <p>What context to use when computing the span embeddings (defaults to the whole document). For example <code>{"section": "conclusion"}</code> to only extract the entities from the conclusion.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<a class="autorefs autorefs-internal" href="../../../reference/edsnlp/utils/span_getters/#edsnlp.utils.span_getters.SpanGetterArg" title="edsnlp.utils.span_getters.SpanGetterArg">SpanGetterArg</a>]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>mode</code></td> <td class="doc-param-details"> <p>The CRF mode to use : independent, joint or marginal</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing_extensions.Literal">Literal</span>['independent', 'joint', 'marginal']</code> </span> </p> </td> </tr> <tr> <td><code>window</code></td> <td class="doc-param-details"> <p>The window size to use for the CRF. If 0, will use the whole document, at the cost of a longer computation time. If 1, this is equivalent to assuming that the tags are independent and will the component be faster, but with degraded performance. Empirically, we found that a window size of 10 or 20 works well.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>int</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>40</code> </span> </p> </td> </tr> <tr> <td><code>stride</code></td> <td class="doc-param-details"> <p>The stride to use for the CRF windows. Defaults to <code>window // 2</code>.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[int]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.trainable.ner_crf.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.trainable.ner_crf.factory.create_component">eds.ner_crf</a></code> pipeline was developed by AP-HP's Data Science team.</p> <p>The deep learning model was adapted from <span><a class="citation" href="./#ref-wajsburt:tel-03624928" id="edsnlp.pipes.trainable.ner_crf.factory.create_component--cite-wajsburt:tel-03624928">Wajsbürt, 2021</a></span>.</p> </div> </div> <div class="footnote"><hr/><ol><li id="ref-wajsburt:tel-03624928"><p><p id="ref-wajsburt:tel-03624928">Wajsbürt P., 2021. Extraction and normalization of simple and structured entities in medical documents. <a href="https://hal.archives-ouvertes.fr/tel-03624928" target="_blank">https://hal.archives-ouvertes.fr/tel-03624928</a></p></p></li></ol><div></div></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/pipes/trainable/span-classifier/index.html b/master/pipes/trainable/span-classifier/index.html
index 501d227d2..ad45ef1f6 100644
--- a/master/pipes/trainable/span-classifier/index.html
+++ b/master/pipes/trainable/span-classifier/index.html
@@ -29,7 +29,7 @@
 <span class="c1">#   ('_.negation', True, [True, False]),</span>
 <span class="c1">#   ('_.event_type', True, ['start', 'stop'])</span>
 <span class="c1"># ]</span>
-</code></pre></div></p> <p>You can also change these values and update the bindings by calling the <code>update_bindings</code> method. Don't forget to retrain the model if new values are added !</p> <h2 id="edsnlp.pipes.trainable.span_classifier.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline object</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>Name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'span_classifier'</code> </span> </p> </td> </tr> <tr> <td><code>embedding</code></td> <td class="doc-param-details"> <p>The word embedding component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="edsnlp.pipes.trainable.embeddings.typing.SpanEmbeddingComponent">SpanEmbeddingComponent</span></code> </span> </p> </td> </tr> <tr> <td><code>span_getter</code></td> <td class="doc-param-details"> <p>How to extract the candidate spans and the attributes to predict or train on.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../../reference/edsnlp/utils/span_getters/#edsnlp.utils.span_getters.SpanGetterArg" title="edsnlp.utils.span_getters.SpanGetterArg">SpanGetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>context_getter</code></td> <td class="doc-param-details"> <p>What context to use when computing the span embeddings (defaults to the whole document). This can be:</p> <ul> <li>a <code>SpanGetterArg</code> to retrieve contexts from a whole document. For example <code>{"section": "conclusion"}</code> to only use the conclusion as context (you must ensure that all spans produced by the <code>span_getter</code> argument do fall in the conclusion in this case)</li> <li>a callable, that gets a span and should return a context for this span. For instance, <code>lambda span: span.sent</code> to use the sentence as context.</li> </ul> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="typing.Union">Union</span>[Callable, <a class="autorefs autorefs-internal" href="../../../reference/edsnlp/utils/span_getters/#edsnlp.utils.span_getters.SpanGetterArg" title="edsnlp.utils.span_getters.SpanGetterArg">SpanGetterArg</a>]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>attributes</code></td> <td class="doc-param-details"> <p>The attributes to predict or train on. If a dict is given, keys are the attributes and values are the labels for which the attr is allowed, or True if the attr is allowed for all labels.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../../reference/edsnlp/utils/bindings/#edsnlp.utils.bindings.AttributesArg" title="edsnlp.utils.bindings.AttributesArg">AttributesArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>keep_none</code></td> <td class="doc-param-details"> <p>If False, skip spans for which a attr returns None. If True (default), the None values will be learned and predicted, just as any other value.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> </tbody> </table> </div> </div> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div></p> <p>You can also change these values and update the bindings by calling the <code>update_bindings</code> method. Don't forget to retrain the model if new values are added !</p> <h2 id="edsnlp.pipes.trainable.span_classifier.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>The pipeline object</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>Name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'span_classifier'</code> </span> </p> </td> </tr> <tr> <td><code>embedding</code></td> <td class="doc-param-details"> <p>The word embedding component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="edsnlp.pipes.trainable.embeddings.typing.SpanEmbeddingComponent">SpanEmbeddingComponent</span></code> </span> </p> </td> </tr> <tr> <td><code>span_getter</code></td> <td class="doc-param-details"> <p>How to extract the candidate spans and the attributes to predict or train on.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../../reference/edsnlp/utils/span_getters/#edsnlp.utils.span_getters.SpanGetterArg" title="edsnlp.utils.span_getters.SpanGetterArg">SpanGetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>context_getter</code></td> <td class="doc-param-details"> <p>What context to use when computing the span embeddings (defaults to the whole document). This can be:</p> <ul> <li>a <code>SpanGetterArg</code> to retrieve contexts from a whole document. For example <code>{"section": "conclusion"}</code> to only use the conclusion as context (you must ensure that all spans produced by the <code>span_getter</code> argument do fall in the conclusion in this case)</li> <li>a callable, that gets a span and should return a context for this span. For instance, <code>lambda span: span.sent</code> to use the sentence as context.</li> </ul> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="typing.Union">Union</span>[Callable, <a class="autorefs autorefs-internal" href="../../../reference/edsnlp/utils/span_getters/#edsnlp.utils.span_getters.SpanGetterArg" title="edsnlp.utils.span_getters.SpanGetterArg">SpanGetterArg</a>]]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>attributes</code></td> <td class="doc-param-details"> <p>The attributes to predict or train on. If a dict is given, keys are the attributes and values are the labels for which the attr is allowed, or True if the attr is allowed for all labels.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../../reference/edsnlp/utils/bindings/#edsnlp.utils.bindings.AttributesArg" title="edsnlp.utils.bindings.AttributesArg">AttributesArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>keep_none</code></td> <td class="doc-param-details"> <p>If False, skip spans for which a attr returns None. If True (default), the None values will be learned and predicted, just as any other value.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>False</code> </span> </p> </td> </tr> </tbody> </table> </div> </div> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/pipes/trainable/span-linker/index.html b/master/pipes/trainable/span-linker/index.html
index 11b9f6832..4fa32efb2 100644
--- a/master/pipes/trainable/span-linker/index.html
+++ b/master/pipes/trainable/span-linker/index.html
@@ -46,7 +46,7 @@
 <span class="n">doc</span> <span class="o">=</span> <span class="n">nlp</span><span class="p">(</span><span class="n">doc</span><span class="p">)</span>
 <span class="nb">print</span><span class="p">(</span><span class="n">doc</span><span class="o">.</span><span class="n">ents</span><span class="p">[</span><span class="mi">0</span><span class="p">]</span><span class="o">.</span><span class="n">_</span><span class="o">.</span><span class="n">cui</span><span class="p">)</span>
 <span class="c1"># "B01AC06"</span>
-</code></pre></div></p> <p>To use the <code><a href="#edsnlp.pipes.trainable.span_linker.factory.create_component">eds.span_linker</a></code> component in <code>class</code> mode, we refer to the following repository: <a href="https://github.com/percevalw/deep_multilingual_normalization">deep_multilingual_normalization</a> based on the work of <span><a class="citation" href="./#ref-wajsburt2021medical" id="edsnlp.pipes.trainable.span_linker.factory.create_component--cite-wajsburt2021medical">Wajsbürt et al., 2021</a></span>.</p> <h2 id="edsnlp.pipes.trainable.span_linker.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>Spacy vocabulary</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span>]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>Name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'span_linker'</code> </span> </p> </td> </tr> <tr> <td><code>embedding</code></td> <td class="doc-param-details"> <p>The word embedding component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="edsnlp.pipes.trainable.embeddings.typing.SpanEmbeddingComponent">SpanEmbeddingComponent</span></code> </span> </p> </td> </tr> <tr> <td><code>metric</code></td> <td class="doc-param-details"> <p>Whether to compute the cosine similarity between the input and output embeddings or the dot product.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>Literal["cosine", "dot"] = "cosine"</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>cosine</code> </span> </p> </td> </tr> <tr> <td><code>rescale</code></td> <td class="doc-param-details"> <p>Rescale the output cosine similarities by a constant factor.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>float</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>20</code> </span> </p> </td> </tr> <tr> <td><code>threshold</code></td> <td class="doc-param-details"> <p>Threshold probability to consider a concept as valid</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>float</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>0.5</code> </span> </p> </td> </tr> <tr> <td><code>attribute</code></td> <td class="doc-param-details"> <p>The attribute to store the concept id</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>cui</code> </span> </p> </td> </tr> <tr> <td><code>reference_mode</code></td> <td class="doc-param-details"> <p>Whether to compare the embeddings with the concepts embeddings (one per concept) or the synonyms embeddings (one per concept per synonym). See above for more details.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing_extensions.Literal">Literal</span>['concept', 'synonym']</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>concept</code> </span> </p> </td> </tr> <tr> <td><code>span_getter</code></td> <td class="doc-param-details"> <p>How to extract the candidate spans to predict or train on.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../../reference/edsnlp/utils/span_getters/#edsnlp.utils.span_getters.SpanGetterArg" title="edsnlp.utils.span_getters.SpanGetterArg">SpanGetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>context_getter</code></td> <td class="doc-param-details"> <p>What context to use when computing the span embeddings (defaults to the entity only, so no context)</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<a class="autorefs autorefs-internal" href="../../../reference/edsnlp/utils/span_getters/#edsnlp.utils.span_getters.SpanGetterArg" title="edsnlp.utils.span_getters.SpanGetterArg">SpanGetterArg</a>]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>probability_mode</code></td> <td class="doc-param-details"> <p>Whether to compute the probabilities using a softmax or a sigmoid function. This will also determine the loss function to use, either cross-entropy or binary cross-entropy.</p> <div class="admonition warning"> <p class="admonition-title">Subsetting the concepts</p> <p>The probabilities returned in <code>softmax</code> mode depend on the number of concepts (as an extreme cas, if you have only one concept, its softmax probability will always be 1). This is why we recommend using the <code>sigmoid</code> mode in which the probabilities are computed independently for each concept.</p> </div> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing_extensions.Literal">Literal</span>['softmax', 'sigmoid']</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>sigmoid</code> </span> </p> </td> </tr> <tr> <td><code>init_weights</code></td> <td class="doc-param-details"> <p>Whether to initialize the weights of the component with the embeddings of the entities of the docs provided to the <code>post_init</code> method. How this is done depends on the <code>reference_mode</code> parameter:</p> <ul> <li><code>concept</code>: the embeddings are averaged</li> <li><code>synonym</code>: the embeddings are stored as is</li> </ul> <p>By default, this is set to <code>True</code> if <code>reference_mode</code> is <code>synonym</code>, and <code>False</code> otherwise.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>True</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.trainable.span_linker.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.trainable.span_linker.factory.create_component">eds.span_linker</a></code> component was developed by AP-HP's Data Science team.</p> <p>The deep learning concept-based architecture was adapted from <span><a class="citation" href="./#ref-wajsburt2021medical" id="edsnlp.pipes.trainable.span_linker.factory.create_component--cite-wajsburt2021medical">Wajsbürt et al., 2021</a></span>.</p> </div> </div> <div class="footnote"><hr/><ol><li id="ref-wajsburt2021medical"><p><p id="ref-wajsburt2021medical">Wajsbürt P., Sarfati A. and Tannier X., 2021. Medical concept normalization in French using multilingual terminologies and contextual embeddings. <i>Journal of Biomedical Informatics</i>. <i>114</i>, pp.103684. <a href="https://doi.org/10.1016/j.jbi.2021.103684" target="_blank">https://doi.org/10.1016/j.jbi.2021.103684</a></p></p></li></ol><div></div></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div></p> <p>To use the <code><a href="#edsnlp.pipes.trainable.span_linker.factory.create_component">eds.span_linker</a></code> component in <code>class</code> mode, we refer to the following repository: <a href="https://github.com/percevalw/deep_multilingual_normalization">deep_multilingual_normalization</a> based on the work of <span><a class="citation" href="./#ref-wajsburt2021medical" id="edsnlp.pipes.trainable.span_linker.factory.create_component--cite-wajsburt2021medical">Wajsbürt et al., 2021</a></span>.</p> <h2 id="edsnlp.pipes.trainable.span_linker.factory.create_component--parameters">Parameters</h2> <table> <thead> <tr> <th><b>PARAMETER</b></th> <th><b>DESCRIPTION</b></th> </tr> </thead> <tbody> <tr> <td><code>nlp</code></td> <td class="doc-param-details"> <p>Spacy vocabulary</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<span title="edsnlp.core.PipelineProtocol">PipelineProtocol</span>]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>name</code></td> <td class="doc-param-details"> <p>Name of the component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>'span_linker'</code> </span> </p> </td> </tr> <tr> <td><code>embedding</code></td> <td class="doc-param-details"> <p>The word embedding component</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="edsnlp.pipes.trainable.embeddings.typing.SpanEmbeddingComponent">SpanEmbeddingComponent</span></code> </span> </p> </td> </tr> <tr> <td><code>metric</code></td> <td class="doc-param-details"> <p>Whether to compute the cosine similarity between the input and output embeddings or the dot product.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>Literal["cosine", "dot"] = "cosine"</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>cosine</code> </span> </p> </td> </tr> <tr> <td><code>rescale</code></td> <td class="doc-param-details"> <p>Rescale the output cosine similarities by a constant factor.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>float</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>20</code> </span> </p> </td> </tr> <tr> <td><code>threshold</code></td> <td class="doc-param-details"> <p>Threshold probability to consider a concept as valid</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>float</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>0.5</code> </span> </p> </td> </tr> <tr> <td><code>attribute</code></td> <td class="doc-param-details"> <p>The attribute to store the concept id</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>str</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>cui</code> </span> </p> </td> </tr> <tr> <td><code>reference_mode</code></td> <td class="doc-param-details"> <p>Whether to compare the embeddings with the concepts embeddings (one per concept) or the synonyms embeddings (one per concept per synonym). See above for more details.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing_extensions.Literal">Literal</span>['concept', 'synonym']</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>concept</code> </span> </p> </td> </tr> <tr> <td><code>span_getter</code></td> <td class="doc-param-details"> <p>How to extract the candidate spans to predict or train on.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><a class="autorefs autorefs-internal" href="../../../reference/edsnlp/utils/span_getters/#edsnlp.utils.span_getters.SpanGetterArg" title="edsnlp.utils.span_getters.SpanGetterArg">SpanGetterArg</a></code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>context_getter</code></td> <td class="doc-param-details"> <p>What context to use when computing the span embeddings (defaults to the entity only, so no context)</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing.Optional">Optional</span>[<a class="autorefs autorefs-internal" href="../../../reference/edsnlp/utils/span_getters/#edsnlp.utils.span_getters.SpanGetterArg" title="edsnlp.utils.span_getters.SpanGetterArg">SpanGetterArg</a>]</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>None</code> </span> </p> </td> </tr> <tr> <td><code>probability_mode</code></td> <td class="doc-param-details"> <p>Whether to compute the probabilities using a softmax or a sigmoid function. This will also determine the loss function to use, either cross-entropy or binary cross-entropy.</p> <div class="admonition warning"> <p class="admonition-title">Subsetting the concepts</p> <p>The probabilities returned in <code>softmax</code> mode depend on the number of concepts (as an extreme cas, if you have only one concept, its softmax probability will always be 1). This is why we recommend using the <code>sigmoid</code> mode in which the probabilities are computed independently for each concept.</p> </div> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code><span title="typing_extensions.Literal">Literal</span>['softmax', 'sigmoid']</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>sigmoid</code> </span> </p> </td> </tr> <tr> <td><code>init_weights</code></td> <td class="doc-param-details"> <p>Whether to initialize the weights of the component with the embeddings of the entities of the docs provided to the <code>post_init</code> method. How this is done depends on the <code>reference_mode</code> parameter:</p> <ul> <li><code>concept</code>: the embeddings are averaged</li> <li><code>synonym</code>: the embeddings are stored as is</li> </ul> <p>By default, this is set to <code>True</code> if <code>reference_mode</code> is <code>synonym</code>, and <code>False</code> otherwise.</p> <p> <span class="doc-param-annotation"> <b>TYPE:</b> <code>bool</code> </span> <span class="doc-param-default"> <b>DEFAULT:</b> <code>True</code> </span> </p> </td> </tr> </tbody> </table> <h2 id="edsnlp.pipes.trainable.span_linker.factory.create_component--authors-and-citation">Authors and citation</h2> <p>The <code><a href="#edsnlp.pipes.trainable.span_linker.factory.create_component">eds.span_linker</a></code> component was developed by AP-HP's Data Science team.</p> <p>The deep learning concept-based architecture was adapted from <span><a class="citation" href="./#ref-wajsburt2021medical" id="edsnlp.pipes.trainable.span_linker.factory.create_component--cite-wajsburt2021medical">Wajsbürt et al., 2021</a></span>.</p> </div> </div> <div class="footnote"><hr/><ol><li id="ref-wajsburt2021medical"><p><p id="ref-wajsburt2021medical">Wajsbürt P., Sarfati A. and Tannier X., 2021. Medical concept normalization in French using multilingual terminologies and contextual embeddings. <i>Journal of Biomedical Informatics</i>. <i>114</i>, pp.103684. <a href="https://doi.org/10.1016/j.jbi.2021.103684" target="_blank">https://doi.org/10.1016/j.jbi.2021.103684</a></p></p></li></ol><div></div></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/tokenizers/index.html b/master/tokenizers/index.html
index 833f4d6a9..deee6014b 100644
--- a/master/tokenizers/index.html
+++ b/master/tokenizers/index.html
@@ -5,7 +5,7 @@
 </code></pre></div> </div> <div class="tabbed-block"> <div class="highlight"><pre><span></span><code><span class="kn">import</span> <span class="nn">edsnlp</span>
 
 <span class="n">nlp</span> <span class="o">=</span> <span class="n">edsnlp</span><span class="o">.</span><span class="n"><html><head></head><body><a class="discrete-link" href="../reference/edsnlp/core/pipeline/#edsnlp.core.pipeline.blank">blank</a></body></html></span><span class="p">(</span><span class="s2">"fr"</span><span class="p">)</span>
-</code></pre></div> </div> </div> </div> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> </div> </div> </div> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/tutorials/aggregating-results/index.html b/master/tutorials/aggregating-results/index.html
index 4740f3e49..9655e6d89 100644
--- a/master/tutorials/aggregating-results/index.html
+++ b/master/tutorials/aggregating-results/index.html
@@ -18,7 +18,7 @@
     <span class="n">status</span> <span class="o">=</span> <span class="nb">max</span><span class="p">(</span><span class="n">kept_spans</span><span class="p">,</span> <span class="n">key</span><span class="o">=</span><span class="n">itemgetter</span><span class="p">(</span><span class="mi">1</span><span class="p">))[</span><span class="mi">2</span><span class="p">]</span>  <span class="c1"># (6)!</span>
 
 <span class="n">doc</span><span class="o">.</span><span class="n">_</span><span class="o">.</span><span class="n">aggregated</span><span class="p">[</span><span class="s2">"diabetes"</span><span class="p">]</span> <span class="o">=</span> <span class="n">status</span>
-</code></pre></div> <ol> <li>We want at least 2 correct entities</li> <li>Storing the status in the <code>doc._.aggregated</code> dictionary</li> <li>Getting status for the <code>diabetes</code> component</li> <li>Disregarding entities which are either negated, hypothetical, or not about the patient himself</li> <li>Setting the status to 0 if less than 2 relevant entities are left:</li> <li>Getting the maximum severity status</li> </ol> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> <ol> <li>We want at least 2 correct entities</li> <li>Storing the status in the <code>doc._.aggregated</code> dictionary</li> <li>Getting status for the <code>diabetes</code> component</li> <li>Disregarding entities which are either negated, hypothetical, or not about the patient himself</li> <li>Setting the status to 0 if less than 2 relevant entities are left:</li> <li>Getting the maximum severity status</li> </ol> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/tutorials/detecting-dates/index.html b/master/tutorials/detecting-dates/index.html
index e591701d5..a9505a1a9 100644
--- a/master/tutorials/detecting-dates/index.html
+++ b/master/tutorials/detecting-dates/index.html
@@ -139,7 +139,7 @@
     <span class="nb">print</span><span class="p">(</span><span class="sa">f</span><span class="s2">"</span><span class="si">{</span><span class="n">ent</span><span class="o">.</span><span class="n">text</span><span class="si">:</span><span class="s2">&lt;20</span><span class="si">}{</span><span class="n">date</span><span class="o">.</span><span class="n">text</span><span class="si">:</span><span class="s2">&lt;20</span><span class="si">}{</span><span class="n">date</span><span class="o">.</span><span class="n">_</span><span class="o">.</span><span class="n">date</span><span class="o">.</span><span class="n">to_datetime</span><span class="p">(</span><span class="n">now</span><span class="p">)</span><span class="o">.</span><span class="n">strftime</span><span class="p">(</span><span class="s1">'</span><span class="si">%d</span><span class="s1">/%m/%Y'</span><span class="p">)</span><span class="si">:</span><span class="s2">&lt;15</span><span class="si">}{</span><span class="n">date</span><span class="o">.</span><span class="n">_</span><span class="o">.</span><span class="n">date</span><span class="o">.</span><span class="n">to_duration</span><span class="p">(</span><span class="n">now</span><span class="p">)</span><span class="si">}</span><span class="s2">"</span><span class="p">)</span>
 <span class="c1"># Out: admis               12 avril            12/04/2023     21 weeks 4 days 6 hours 3 minutes 26 seconds</span>
 <span class="c1"># Out: pris en charge      l'année dernière    10/09/2022     -1 year</span>
-</code></pre></div> <p>Which will output:</p> <table> <thead> <tr> <th><code>ent</code></th> <th><code>get_event_date(ent)</code></th> <th><code>get_event_date(ent)._.date.to_datetime()</code></th> </tr> </thead> <tbody> <tr> <td>admis</td> <td>12 avril</td> <td><code>2020-04-12T00:00:00+02:00</code></td> </tr> <tr> <td>pris en charge</td> <td>l'année dernière</td> <td><code>-1 year</code></td> </tr> </tbody> </table> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> <p>Which will output:</p> <table> <thead> <tr> <th><code>ent</code></th> <th><code>get_event_date(ent)</code></th> <th><code>get_event_date(ent)._.date.to_datetime()</code></th> </tr> </thead> <tbody> <tr> <td>admis</td> <td>12 avril</td> <td><code>2020-04-12T00:00:00+02:00</code></td> </tr> <tr> <td>pris en charge</td> <td>l'année dernière</td> <td><code>-1 year</code></td> </tr> </tbody> </table> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/tutorials/endlines/index.html b/master/tutorials/endlines/index.html
index 5c5248585..66183ea28 100644
--- a/master/tutorials/endlines/index.html
+++ b/master/tutorials/endlines/index.html
@@ -64,7 +64,7 @@
 <span class="nb">list</span><span class="p">(</span><span class="n">doc</span><span class="o">.</span><span class="n">sents</span><span class="p">)[</span><span class="mi">0</span><span class="p">]</span>
 <span class="c1"># Out: J'aime le</span>
 <span class="c1"># Out: fromage...</span>
-</code></pre></div> <ol> <li>You should specify the path to the trained model here.</li> <li>All fake new line are excluded by setting their <code>tag</code> to 'EXCLUDED' and all true new lines' <code>tag</code> are set to 'ENDLINE'.</li> </ol> <h2 id="declared-extensions">Declared extensions</h2> <p>It lets downstream matchers skip excluded tokens (see <a href="../../pipes/core/normalizer/">normalisation</a>) for more detail.</p> <div class="footnote"><hr/><ol><li id="ref-zweigenbaum2016"><p><p id="ref-zweigenbaum2016">Zweigenbaum P., Grouin C. and Lavergne T., 2016. Une catégorisation de fins de lignes non-supervisée (End-of-line classification with no supervision). <a href="https://aclanthology.org/2016.jeptalnrecital-poster.7" target="_blank">https://aclanthology.org/2016.jeptalnrecital-poster.7</a></p></p></li></ol><div></div></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> <ol> <li>You should specify the path to the trained model here.</li> <li>All fake new line are excluded by setting their <code>tag</code> to 'EXCLUDED' and all true new lines' <code>tag</code> are set to 'ENDLINE'.</li> </ol> <h2 id="declared-extensions">Declared extensions</h2> <p>It lets downstream matchers skip excluded tokens (see <a href="../../pipes/core/normalizer/">normalisation</a>) for more detail.</p> <div class="footnote"><hr/><ol><li id="ref-zweigenbaum2016"><p><p id="ref-zweigenbaum2016">Zweigenbaum P., Grouin C. and Lavergne T., 2016. Une catégorisation de fins de lignes non-supervisée (End-of-line classification with no supervision). <a href="https://aclanthology.org/2016.jeptalnrecital-poster.7" target="_blank">https://aclanthology.org/2016.jeptalnrecital-poster.7</a></p></p></li></ol><div></div></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/tutorials/index.html b/master/tutorials/index.html
index b848f3ec9..cf1d20b46 100644
--- a/master/tutorials/index.html
+++ b/master/tutorials/index.html
@@ -1,5 +1,5 @@
 <!DOCTYPE html>
-<html class="no-js" lang="en"> <head><meta charset="utf-8"/><meta content="width=device-width,initial-scale=1" name="viewport"/><link href=".." rel="prev"/><link href="spacy101/" rel="next"/><link href="../assets/logo/edsnlp.svg" rel="icon"/><meta content="mkdocs-1.6.1, mkdocs-material-9.5.42" name="generator"/><title>Overview - EDS-NLP</title><link href="../assets/stylesheets/main.0253249f.min.css" rel="stylesheet"/><link href="../assets/stylesheets/palette.06af60db.min.css" rel="stylesheet"/><link crossorigin="" href="https://fonts.gstatic.com" rel="preconnect"/><link href="https://fonts.googleapis.com/css?family=Roboto:300,300i,400,400i,700,700i%7CRoboto+Mono:400,400i,700,700i&amp;display=fallback" rel="stylesheet"/><style>:root{--md-text-font:"Roboto";--md-code-font:"Roboto Mono"}</style><link href="../assets/_mkdocstrings.css" rel="stylesheet"/><link href="../assets/stylesheets/extra.css" rel="stylesheet"/><link href="../assets/stylesheets/cards.css" rel="stylesheet"/><link href="../assets/termynal/termynal.css" rel="stylesheet"/><script>__md_scope=new URL("..",location),__md_hash=e=>[...e].reduce(((e,_)=>(e<<5)-e+_.charCodeAt(0)),0),__md_get=(e,_=localStorage,t=__md_scope)=>JSON.parse(_.getItem(t.pathname+"."+e)),__md_set=(e,_,t=localStorage,a=__md_scope)=>{try{t.setItem(a.pathname+"."+e,JSON.stringify(_))}catch(e){}}</script></head> <body data-md-color-accent="indigo" data-md-color-primary="indigo" data-md-color-scheme="default" dir="ltr"> <input autocomplete="off" class="md-toggle" data-md-toggle="drawer" id="__drawer" type="checkbox"/> <input autocomplete="off" class="md-toggle" data-md-toggle="search" id="__search" type="checkbox"/> <label class="md-overlay" for="__drawer"></label> <div data-md-component="skip"> <a class="md-skip" href="#tutorials"> Skip to content </a> </div> <div data-md-component="announce"> <aside class="md-banner"> <div class="md-banner__inner md-grid md-typeset"> <button aria-label="Don't show this again" class="md-banner__button md-icon"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M19 6.41 17.59 5 12 10.59 6.41 5 5 6.41 10.59 12 5 17.59 6.41 19 12 13.41 17.59 19 19 17.59 13.41 12z"></path></svg> </button> Check out the new <a href="training">Model Training tutorial</a> ! </div> <script>var el=document.querySelector("[data-md-component=announce]");if(el){var content=el.querySelector(".md-typeset");__md_hash(content.innerHTML)===__md_get("__announce")&&(el.hidden=!0)}</script> </aside> </div> <div data-md-color-scheme="default" data-md-component="outdated" hidden=""> </div> <header class="md-header md-header--shadow" data-md-component="header"> <nav aria-label="Header" class="md-header__inner md-grid"> <a aria-label="EDS-NLP" class="md-header__button md-logo" data-md-component="logo" href=".." title="EDS-NLP"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a3 3 0 0 0 3-3 3 3 0 0 0-3-3 3 3 0 0 0-3 3 3 3 0 0 0 3 3m0 3.54C9.64 9.35 6.5 8 3 8v11c3.5 0 6.64 1.35 9 3.54 2.36-2.19 5.5-3.54 9-3.54V8c-3.5 0-6.64 1.35-9 3.54"></path></svg> </a> <label class="md-header__button md-icon" for="__drawer"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M3 6h18v2H3zm0 5h18v2H3zm0 5h18v2H3z"></path></svg> </label> <div class="md-header__title" data-md-component="header-title"> <div class="md-header__ellipsis"> <div class="md-header__topic"> <span class="md-ellipsis"> EDS-NLP </span> </div> <div class="md-header__topic" data-md-component="header-topic"> <span class="md-ellipsis"> Overview </span> </div> </div> </div> <form class="md-header__option" data-md-component="palette"> <input aria-label="Switch to dark mode" class="md-option" data-md-color-accent="indigo" data-md-color-media="" data-md-color-primary="indigo" data-md-color-scheme="default" id="__palette_0" name="__palette" type="radio"/> <label class="md-header__button md-icon" for="__palette_1" hidden="" title="Switch to dark mode"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 18c-.89 0-1.74-.2-2.5-.55C11.56 16.5 13 14.42 13 12s-1.44-4.5-3.5-5.45C10.26 6.2 11.11 6 12 6a6 6 0 0 1 6 6 6 6 0 0 1-6 6m8-9.31V4h-4.69L12 .69 8.69 4H4v4.69L.69 12 4 15.31V20h4.69L12 23.31 15.31 20H20v-4.69L23.31 12z"></path></svg> </label> <input aria-label="Switch to light mode" class="md-option" data-md-color-accent="indigo" data-md-color-media="" data-md-color-primary="indigo" data-md-color-scheme="slate" id="__palette_1" name="__palette" type="radio"/> <label class="md-header__button md-icon" for="__palette_0" hidden="" title="Switch to light mode"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a4 4 0 0 0-4 4 4 4 0 0 0 4 4 4 4 0 0 0 4-4 4 4 0 0 0-4-4m0 10a6 6 0 0 1-6-6 6 6 0 0 1 6-6 6 6 0 0 1 6 6 6 6 0 0 1-6 6m8-9.31V4h-4.69L12 .69 8.69 4H4v4.69L.69 12 4 15.31V20h4.69L12 23.31 15.31 20H20v-4.69L23.31 12z"></path></svg> </label> </form> <script>var palette=__md_get("__palette");if(palette&&palette.color){if("(prefers-color-scheme)"===palette.color.media){var media=matchMedia("(prefers-color-scheme: light)"),input=document.querySelector(media.matches?"[data-md-color-media='(prefers-color-scheme: light)']":"[data-md-color-media='(prefers-color-scheme: dark)']");palette.color.media=input.getAttribute("data-md-color-media"),palette.color.scheme=input.getAttribute("data-md-color-scheme"),palette.color.primary=input.getAttribute("data-md-color-primary"),palette.color.accent=input.getAttribute("data-md-color-accent")}for(var[key,value]of Object.entries(palette.color))document.body.setAttribute("data-md-color-"+key,value)}</script> <label class="md-header__button md-icon" for="__search"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M9.5 3A6.5 6.5 0 0 1 16 9.5c0 1.61-.59 3.09-1.56 4.23l.27.27h.79l5 5-1.5 1.5-5-5v-.79l-.27-.27A6.52 6.52 0 0 1 9.5 16 6.5 6.5 0 0 1 3 9.5 6.5 6.5 0 0 1 9.5 3m0 2C7 5 5 7 5 9.5S7 14 9.5 14 14 12 14 9.5 12 5 9.5 5"></path></svg> </label> <div class="md-search" data-md-component="search" role="dialog"> <label class="md-search__overlay" for="__search"></label> <div class="md-search__inner" role="search"> <form class="md-search__form" name="search"> <input aria-label="Search" autocapitalize="off" autocomplete="off" autocorrect="off" class="md-search__input" data-md-component="search-query" name="query" placeholder="Search" required="" spellcheck="false" type="text"/> <label class="md-search__icon md-icon" for="__search"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M9.5 3A6.5 6.5 0 0 1 16 9.5c0 1.61-.59 3.09-1.56 4.23l.27.27h.79l5 5-1.5 1.5-5-5v-.79l-.27-.27A6.52 6.52 0 0 1 9.5 16 6.5 6.5 0 0 1 3 9.5 6.5 6.5 0 0 1 9.5 3m0 2C7 5 5 7 5 9.5S7 14 9.5 14 14 12 14 9.5 12 5 9.5 5"></path></svg> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M20 11v2H8l5.5 5.5-1.42 1.42L4.16 12l7.92-7.92L13.5 5.5 8 11z"></path></svg> </label> <nav aria-label="Search" class="md-search__options"> <button aria-label="Clear" class="md-search__icon md-icon" tabindex="-1" title="Clear" type="reset"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M19 6.41 17.59 5 12 10.59 6.41 5 5 6.41 10.59 12 5 17.59 6.41 19 12 13.41 17.59 19 19 17.59 13.41 12z"></path></svg> </button> </nav> </form> <div class="md-search__output"> <div class="md-search__scrollwrap" data-md-scrollfix="" tabindex="0"> <div class="md-search-result" data-md-component="search-result"> <div class="md-search-result__meta"> Initializing search </div> <ol class="md-search-result__list" role="presentation"></ol> </div> </div> </div> </div> </div> <div class="md-header__source"> <a class="md-source" data-md-component="source" href="https://github.com/aphp/edsnlp" title="Go to repository"> <div class="md-source__icon md-icon"> <svg viewbox="0 0 448 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M439.55 236.05 244 40.45a28.87 28.87 0 0 0-40.81 0l-40.66 40.63 51.52 51.52c27.06-9.14 52.68 16.77 43.39 43.68l49.66 49.66c34.23-11.8 61.18 31 35.47 56.69-26.49 26.49-70.21-2.87-56-37.34L240.22 199v121.85c25.3 12.54 22.26 41.85 9.08 55a34.34 34.34 0 0 1-48.55 0c-17.57-17.6-11.07-46.91 11.25-56v-123c-20.8-8.51-24.6-30.74-18.64-45L142.57 101 8.45 235.14a28.86 28.86 0 0 0 0 40.81l195.61 195.6a28.86 28.86 0 0 0 40.8 0l194.69-194.69a28.86 28.86 0 0 0 0-40.81"></path></svg> </div> <div class="md-source__repository"> aphp/edsnlp </div> </a> </div> </nav> </header> <div class="md-container" data-md-component="container"> <main class="md-main" data-md-component="main"> <div class="md-main__inner md-grid"> <div class="md-sidebar md-sidebar--primary" data-md-component="sidebar" data-md-type="navigation"> <div class="md-sidebar__scrollwrap"> <div class="md-sidebar__inner"> <nav aria-label="Navigation" class="md-nav md-nav--primary" data-md-level="0"> <label class="md-nav__title" for="__drawer"> <a aria-label="EDS-NLP" class="md-nav__button md-logo" data-md-component="logo" href=".." title="EDS-NLP"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a3 3 0 0 0 3-3 3 3 0 0 0-3-3 3 3 0 0 0-3 3 3 3 0 0 0 3 3m0 3.54C9.64 9.35 6.5 8 3 8v11c3.5 0 6.64 1.35 9 3.54 2.36-2.19 5.5-3.54 9-3.54V8c-3.5 0-6.64 1.35-9 3.54"></path></svg> </a> EDS-NLP </label> <div class="md-nav__source"> <a class="md-source" data-md-component="source" href="https://github.com/aphp/edsnlp" title="Go to repository"> <div class="md-source__icon md-icon"> <svg viewbox="0 0 448 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M439.55 236.05 244 40.45a28.87 28.87 0 0 0-40.81 0l-40.66 40.63 51.52 51.52c27.06-9.14 52.68 16.77 43.39 43.68l49.66 49.66c34.23-11.8 61.18 31 35.47 56.69-26.49 26.49-70.21-2.87-56-37.34L240.22 199v121.85c25.3 12.54 22.26 41.85 9.08 55a34.34 34.34 0 0 1-48.55 0c-17.57-17.6-11.07-46.91 11.25-56v-123c-20.8-8.51-24.6-30.74-18.64-45L142.57 101 8.45 235.14a28.86 28.86 0 0 0 0 40.81l195.61 195.6a28.86 28.86 0 0 0 40.8 0l194.69-194.69a28.86 28.86 0 0 0 0-40.81"></path></svg> </div> <div class="md-source__repository"> aphp/edsnlp </div> </a> </div> <ul class="md-nav__list" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href=".."> <span class="md-ellipsis"> Getting started </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="https://aphp.github.io/edsnlp/demo" target="_blank"> <span class="md-ellipsis"> Demo </span> </a> </li> <li class="md-nav__item md-nav__item--active md-nav__item--nested"> <input checked="" class="md-nav__toggle md-toggle" id="__nav_3" type="checkbox"/> <div class="md-nav__link md-nav__container"> <a class="md-nav__link" href="./"> <span class="md-ellipsis"> Tutorials </span> </a> <label class="md-nav__link" for="__nav_3" id="__nav_3_label" tabindex="0"> <span class="md-nav__icon md-icon"></span> </label> </div> <nav aria-expanded="true" aria-labelledby="__nav_3_label" class="md-nav" data-md-level="1"> <label class="md-nav__title" for="__nav_3"> <span class="md-nav__icon md-icon"></span> Tutorials </label> <ul class="md-nav__list" data-md-scrollfix=""> <li class="md-nav__item md-nav__item--active"> <input class="md-nav__toggle md-toggle" id="__toc" type="checkbox"/> <a class="md-nav__link md-nav__link--active" href="./"> <span class="md-ellipsis"> Overview </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="spacy101/"> <span class="md-ellipsis"> SpaCy representations </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="matching-a-terminology/"> <span class="md-ellipsis"> Matching a terminology </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="qualifying-entities/"> <span class="md-ellipsis"> Qualifying entities </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="visualization/"> <span class="md-ellipsis"> Visualization </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="detecting-dates/"> <span class="md-ellipsis"> Detecting dates </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="reason/"> <span class="md-ellipsis"> Detecting Reason of Hospitalisation </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="endlines/"> <span class="md-ellipsis"> Detecting end-of-lines </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="aggregating-results/"> <span class="md-ellipsis"> Aggregating results </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="multiple-texts/"> <span class="md-ellipsis"> Processing multiple texts </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../advanced-tutorials/fastapi/"> <span class="md-ellipsis"> Deploying as an API </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="make-a-training-script/"> <span class="md-ellipsis"> Deep-learning tutorial </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="training/"> <span class="md-ellipsis"> Training API </span> </a> </li> </ul> </nav> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../pipes/"> <span class="md-ellipsis"> Pipes </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../tokenizers/"> <span class="md-ellipsis"> Tokenizers </span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../data/"> <span class="md-ellipsis"> Data Connectors </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../concepts/pipeline/"> <span class="md-ellipsis"> Concepts </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../utilities/"> <span class="md-ellipsis"> Utilities </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../reference/edsnlp/"> <span class="md-ellipsis"> Code Reference </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../contributing/"> <span class="md-ellipsis"> Contributing to EDS-NLP </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../changelog/"> <span class="md-ellipsis"> Changelog </span> </a> </li> </ul> </nav> </div> </div> </div> <div class="md-sidebar md-sidebar--secondary" data-md-component="sidebar" data-md-type="toc"> <div class="md-sidebar__scrollwrap"> <div class="md-sidebar__inner"> <nav aria-label="Table of contents" class="md-nav md-nav--secondary"> </nav> </div> </div> </div> <div class="md-content" data-md-component="content"> <article class="md-content__inner md-typeset"> <h1 id="tutorials">Tutorials</h1> <p>We provide step-by-step guides to get you started. We cover the following use-cases:</p> <div class="card-set" data-cards="1:12"><a class="card-content" href="spacy101"><p><span class="twemoji"><svg viewbox="0 0 384 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M64 464c-8.8 0-16-7.2-16-16V64c0-8.8 7.2-16 16-16h160v80c0 17.7 14.3 32 32 32h80v288c0 8.8-7.2 16-16 16zM64 0C28.7 0 0 28.7 0 64v384c0 35.3 28.7 64 64 64h256c35.3 0 64-28.7 64-64V154.5c0-17-6.7-33.3-18.7-45.3l-90.6-90.5C262.7 6.7 246.5 0 229.5 0zm97 289c9.4-9.4 9.4-24.6 0-33.9s-24.6-9.4-33.9 0L79 303c-9.4 9.4-9.4 24.6 0 33.9l48 48c9.4 9.4 24.6 9.4 33.9 0s9.4-24.6 0-33.9l-31-31 31-31zm96-34c-9.4-9.4-24.6-9.4-33.9 0s-9.4 24.6 0 33.9l31 31-31 31c-9.4 9.4-9.4 24.6 0 33.9s24.6 9.4 33.9 0l48-48c9.4-9.4 9.4-24.6 0-33.9l-48-48z"></path></svg></span> <strong>Spacy representations</strong></p><hr/><p>Learn the basics of how documents are represented with spaCy.</p></a><a class="card-content" href="matching-a-terminology"><p><span class="twemoji"><svg viewbox="0 0 448 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M96 0C43 0 0 43 0 96v320c0 53 43 96 96 96h320c17.7 0 32-14.3 32-32s-14.3-32-32-32v-64c17.7 0 32-14.3 32-32V32c0-17.7-14.3-32-32-32H96m0 384h256v64H96c-17.7 0-32-14.3-32-32s14.3-32 32-32m32-240c0-8.8 7.2-16 16-16h192c8.8 0 16 7.2 16 16s-7.2 16-16 16H144c-8.8 0-16-7.2-16-16m16 48h192c8.8 0 16 7.2 16 16s-7.2 16-16 16H144c-8.8 0-16-7.2-16-16s7.2-16 16-16"></path></svg></span> <strong>Matching a terminology</strong></p><hr/><p>Extract phrases that belong to a given terminology.</p></a><a class="card-content" href="qualifying-entities"><p><span class="twemoji"><svg viewbox="0 0 448 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M64 80c-8.8 0-16 7.2-16 16v320c0 8.8 7.2 16 16 16h320c8.8 0 16-7.2 16-16V96c0-8.8-7.2-16-16-16zM0 96c0-35.3 28.7-64 64-64h320c35.3 0 64 28.7 64 64v320c0 35.3-28.7 64-64 64H64c-35.3 0-64-28.7-64-64zm337 113L209 337c-9.4 9.4-24.6 9.4-33.9 0l-64-64c-9.4-9.4-9.4-24.6 0-33.9s24.6-9.4 33.9 0l47 47L303 175c9.4-9.4 24.6-9.4 33.9 0s9.4 24.6 0 33.9z"></path></svg></span> <strong>Qualifying entities</strong></p><hr/><p>Ensure extracted concepts are not invalidated by linguistic modulation.</p></a><a class="card-content" href="detecting-dates"><p><span class="twemoji"><svg viewbox="0 0 448 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M152 24c0-13.3-10.7-24-24-24s-24 10.7-24 24v40H64C28.7 64 0 92.7 0 128v320c0 35.3 28.7 64 64 64h320c35.3 0 64-28.7 64-64V128c0-35.3-28.7-64-64-64h-40V24c0-13.3-10.7-24-24-24s-24 10.7-24 24v40H152zM48 192h352v256c0 8.8-7.2 16-16 16H64c-8.8 0-16-7.2-16-16z"></path></svg></span> <strong>Detecting dates</strong></p><hr/><p>Detect and parse dates in a text.</p></a><a class="card-content" href="multiple-texts"><p><span class="twemoji"><svg viewbox="0 0 448 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M384 336H192c-8.8 0-16-7.2-16-16V64c0-8.8 7.2-16 16-16h140.1l67.9 67.9V320c0 8.8-7.2 16-16 16m-192 48h192c35.3 0 64-28.7 64-64V115.9c0-12.7-5.1-24.9-14.1-33.9l-67.8-67.9c-9-9-21.2-14.1-33.9-14.1H192c-35.3 0-64 28.7-64 64v256c0 35.3 28.7 64 64 64M64 128c-35.3 0-64 28.7-64 64v256c0 35.3 28.7 64 64 64h192c35.3 0 64-28.7 64-64v-32h-48v32c0 8.8-7.2 16-16 16H64c-8.8 0-16-7.2-16-16V192c0-8.8 7.2-16 16-16h32v-48z"></path></svg></span> <strong>Processing multiple texts</strong></p><hr/><p>Improve the inference speed of your pipeline</p></a><a class="card-content" href="reason"><p><span class="twemoji"><svg viewbox="0 0 640 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M232 0c-39.8 0-72 32.2-72 72v8H72c-39.8 0-72 32.2-72 72v288c0 39.8 32.2 72 72 72h486.4c39.8 0 72-32.2 72-72V152c0-39.8-32.2-72-72-72h-88v-8c0-39.8-32.2-72-72-72zm248 128h88c13.3 0 24 10.7 24 24v40h-56c-13.3 0-24 10.7-24 24s10.7 24 24 24h56v48h-56c-13.3 0-24 10.7-24 24s10.7 24 24 24h56v104c0 13.3-10.7 24-24 24h-88V128m-408 0h88v336H77.5c-13.2 0-24-10.7-24-24V336h56c13.3 0 24-10.7 24-24s-10.7-24-24-24h-56v-48h56c13.3 0 24-10.7 24-24s-10.7-24-24-24h-56v-40c0-13.3 10.7-24 24-24zm136-56c0-13.3 10.7-24 24-24h176c13.3 0 24 10.7 24 24v392h-64v-64c0-26.5-21.5-48-48-48s-48 21.5-48 48v64h-64zm88 24v24h-24c-8.8 0-16 7.2-16 16v16c0 8.8 7.2 16 16 16h24v24c0 8.8 7.2 16 16 16h16c8.8 0 16-7.2 16-16v-24h24c8.8 0 16-7.2 16-16v-16c0-8.8-7.2-16-16-16h-24V96c0-8.8-7.2-16-16-16h-16c-8.8 0-16 7.2-16 16"></path></svg></span> <strong>Detecting hospitalisation reason</strong></p><hr/><p>Identify spans mentioning the reason for hospitalisation or tag entities as the reason.</p></a><a class="card-content" href="endlines"><p>↵ <strong>Detecting false endlines</strong></p><hr/><p>Classify each line end and add the <code>excluded</code> attribute to these tokens.</p></a><a class="card-content" href="aggregating-results"><p><span class="twemoji"><svg viewbox="0 0 384 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M64 0C28.7 0 0 28.7 0 64v384c0 35.3 28.7 64 64 64h256c35.3 0 64-28.7 64-64V64c0-35.3-28.7-64-64-64zm32 64h192c17.7 0 32 14.3 32 32v32c0 17.7-14.3 32-32 32H96c-17.7 0-32-14.3-32-32V96c0-17.7 14.3-32 32-32m32 160a32 32 0 1 1-64 0 32 32 0 1 1 64 0M96 352a32 32 0 1 1 0-64 32 32 0 1 1 0 64m-32 64c0-17.7 14.3-32 32-32h96c17.7 0 32 14.3 32 32s-14.3 32-32 32H96c-17.7 0-32-14.3-32-32m128-160a32 32 0 1 1 0-64 32 32 0 1 1 0 64m32 64a32 32 0 1 1-64 0 32 32 0 1 1 64 0m64-64a32 32 0 1 1 0-64 32 32 0 1 1 0 64m32 64a32 32 0 1 1-64 0 32 32 0 1 1 64 0m-32 128a32 32 0 1 1 0-64 32 32 0 1 1 0 64"></path></svg></span> <strong>Aggregating results</strong></p><hr/><p>Aggregate the results of your pipeline at the document level.</p></a><a class="card-content" href="../advanced-tutorials/fastapi"><p><span class="twemoji"><svg viewbox="0 0 512 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M64 32C28.7 32 0 60.7 0 96v64c0 35.3 28.7 64 64 64h384c35.3 0 64-28.7 64-64V96c0-35.3-28.7-64-64-64zm280 72a24 24 0 1 1 0 48 24 24 0 1 1 0-48m48 24a24 24 0 1 1 48 0 24 24 0 1 1-48 0M64 288c-35.3 0-64 28.7-64 64v64c0 35.3 28.7 64 64 64h384c35.3 0 64-28.7 64-64v-64c0-35.3-28.7-64-64-64zm280 72a24 24 0 1 1 0 48 24 24 0 1 1 0-48m56 24a24 24 0 1 1 48 0 24 24 0 1 1-48 0"></path></svg></span> <strong>FastAPI</strong></p><hr/><p>Deploy your pipeline as an API.</p></a><a class="card-content" href="visualization"><p><span class="twemoji"><svg viewbox="0 0 576 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M288 32c-80.8 0-145.5 36.8-192.6 80.6C48.6 156 17.3 208 2.5 243.7c-3.3 7.9-3.3 16.7 0 24.6C17.3 304 48.6 356 95.4 399.4 142.5 443.2 207.2 480 288 480s145.5-36.8 192.6-80.6c46.8-43.5 78.1-95.4 93-131.1 3.3-7.9 3.3-16.7 0-24.6-14.9-35.7-46.2-87.7-93-131.1C433.5 68.8 368.8 32 288 32M144 256a144 144 0 1 1 288 0 144 144 0 1 1-288 0m144-64c0 35.3-28.7 64-64 64-7.1 0-13.9-1.2-20.3-3.3-5.5-1.8-11.9 1.6-11.7 7.4.3 6.9 1.3 13.8 3.2 20.7 13.7 51.2 66.4 81.6 117.6 67.9s81.6-66.4 67.9-117.6c-11.1-41.5-47.8-69.4-88.6-71.1-5.8-.2-9.2 6.1-7.4 11.7 2.1 6.4 3.3 13.2 3.3 20.3"></path></svg></span> <strong>Visualization</strong></p><hr/><p>Quickly visualize the results of your pipeline as annotations or tables.</p></a><a class="card-content" href="make-a-training-script"><p><span class="twemoji"><svg viewbox="0 0 448 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M288 0H128c-17.7 0-32 14.3-32 32s14.3 32 32 32v132.8c0 11.8-3.3 23.5-9.5 33.5L10.3 406.2C3.6 417.2 0 429.7 0 442.6 0 480.9 31.1 512 69.4 512h309.2c38.3 0 69.4-31.1 69.4-69.4 0-12.8-3.6-25.4-10.3-36.4L329.5 230.4c-6.2-10.1-9.5-21.7-9.5-33.5V64c17.7 0 32-14.3 32-32S337.7 0 320 0zm-96 196.8V64h64v132.8c0 23.7 6.6 46.9 19 67.1l34.5 56.1h-171l34.5-56.1c12.4-20.2 19-43.4 19-67.1"></path></svg></span> <strong>Deep learning tutorial</strong></p><hr/><p>Learn how EDS-NLP handles training deep-neural networks.</p></a><a class="card-content" href="training"><p><span class="twemoji"><svg viewbox="0 0 512 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M184 0c30.9 0 56 25.1 56 56v400c0 30.9-25.1 56-56 56-28.9 0-52.7-21.9-55.7-50.1-5.2 1.4-10.7 2.1-16.3 2.1-35.3 0-64-28.7-64-64 0-7.4 1.3-14.6 3.6-21.2C21.4 367.4 0 338.2 0 304c0-31.9 18.7-59.5 45.8-72.3C37.1 220.8 32 207 32 192c0-30.7 21.6-56.3 50.4-62.6C80.8 123.9 80 118 80 112c0-29.9 20.6-55.1 48.3-62.1 3-28 26.8-49.9 55.7-49.9m144 0c28.9 0 52.6 21.9 55.7 49.9C411.5 56.9 432 82 432 112c0 6-.8 11.9-2.4 17.4 28.8 6.2 50.4 31.9 50.4 62.6 0 15-5.1 28.8-13.8 39.7 27.1 12.8 45.8 40.4 45.8 72.3 0 34.2-21.4 63.4-51.6 74.8 2.3 6.6 3.6 13.8 3.6 21.2 0 35.3-28.7 64-64 64-5.6 0-11.1-.7-16.3-2.1-3 28.2-26.8 50.1-55.7 50.1-30.9 0-56-25.1-56-56V56c0-30.9 25.1-56 56-56"></path></svg></span> <strong>Training API</strong></p><hr/><p>Learn how to quicky train a deep-learning model with <code>edsnlp.train</code>.</p></a></div> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+<html class="no-js" lang="en"> <head><meta charset="utf-8"/><meta content="width=device-width,initial-scale=1" name="viewport"/><link href=".." rel="prev"/><link href="spacy101/" rel="next"/><link href="../assets/logo/edsnlp.svg" rel="icon"/><meta content="mkdocs-1.6.1, mkdocs-material-9.5.42" name="generator"/><title>Overview - EDS-NLP</title><link href="../assets/stylesheets/main.0253249f.min.css" rel="stylesheet"/><link href="../assets/stylesheets/palette.06af60db.min.css" rel="stylesheet"/><link crossorigin="" href="https://fonts.gstatic.com" rel="preconnect"/><link href="https://fonts.googleapis.com/css?family=Roboto:300,300i,400,400i,700,700i%7CRoboto+Mono:400,400i,700,700i&amp;display=fallback" rel="stylesheet"/><style>:root{--md-text-font:"Roboto";--md-code-font:"Roboto Mono"}</style><link href="../assets/_mkdocstrings.css" rel="stylesheet"/><link href="../assets/stylesheets/extra.css" rel="stylesheet"/><link href="../assets/stylesheets/cards.css" rel="stylesheet"/><link href="../assets/termynal/termynal.css" rel="stylesheet"/><script>__md_scope=new URL("..",location),__md_hash=e=>[...e].reduce(((e,_)=>(e<<5)-e+_.charCodeAt(0)),0),__md_get=(e,_=localStorage,t=__md_scope)=>JSON.parse(_.getItem(t.pathname+"."+e)),__md_set=(e,_,t=localStorage,a=__md_scope)=>{try{t.setItem(a.pathname+"."+e,JSON.stringify(_))}catch(e){}}</script></head> <body data-md-color-accent="indigo" data-md-color-primary="indigo" data-md-color-scheme="default" dir="ltr"> <input autocomplete="off" class="md-toggle" data-md-toggle="drawer" id="__drawer" type="checkbox"/> <input autocomplete="off" class="md-toggle" data-md-toggle="search" id="__search" type="checkbox"/> <label class="md-overlay" for="__drawer"></label> <div data-md-component="skip"> <a class="md-skip" href="#tutorials"> Skip to content </a> </div> <div data-md-component="announce"> <aside class="md-banner"> <div class="md-banner__inner md-grid md-typeset"> <button aria-label="Don't show this again" class="md-banner__button md-icon"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M19 6.41 17.59 5 12 10.59 6.41 5 5 6.41 10.59 12 5 17.59 6.41 19 12 13.41 17.59 19 19 17.59 13.41 12z"></path></svg> </button> Check out the new <a href="training">Model Training tutorial</a> ! </div> <script>var el=document.querySelector("[data-md-component=announce]");if(el){var content=el.querySelector(".md-typeset");__md_hash(content.innerHTML)===__md_get("__announce")&&(el.hidden=!0)}</script> </aside> </div> <div data-md-color-scheme="default" data-md-component="outdated" hidden=""> </div> <header class="md-header md-header--shadow" data-md-component="header"> <nav aria-label="Header" class="md-header__inner md-grid"> <a aria-label="EDS-NLP" class="md-header__button md-logo" data-md-component="logo" href=".." title="EDS-NLP"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a3 3 0 0 0 3-3 3 3 0 0 0-3-3 3 3 0 0 0-3 3 3 3 0 0 0 3 3m0 3.54C9.64 9.35 6.5 8 3 8v11c3.5 0 6.64 1.35 9 3.54 2.36-2.19 5.5-3.54 9-3.54V8c-3.5 0-6.64 1.35-9 3.54"></path></svg> </a> <label class="md-header__button md-icon" for="__drawer"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M3 6h18v2H3zm0 5h18v2H3zm0 5h18v2H3z"></path></svg> </label> <div class="md-header__title" data-md-component="header-title"> <div class="md-header__ellipsis"> <div class="md-header__topic"> <span class="md-ellipsis"> EDS-NLP </span> </div> <div class="md-header__topic" data-md-component="header-topic"> <span class="md-ellipsis"> Overview </span> </div> </div> </div> <form class="md-header__option" data-md-component="palette"> <input aria-label="Switch to dark mode" class="md-option" data-md-color-accent="indigo" data-md-color-media="" data-md-color-primary="indigo" data-md-color-scheme="default" id="__palette_0" name="__palette" type="radio"/> <label class="md-header__button md-icon" for="__palette_1" hidden="" title="Switch to dark mode"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 18c-.89 0-1.74-.2-2.5-.55C11.56 16.5 13 14.42 13 12s-1.44-4.5-3.5-5.45C10.26 6.2 11.11 6 12 6a6 6 0 0 1 6 6 6 6 0 0 1-6 6m8-9.31V4h-4.69L12 .69 8.69 4H4v4.69L.69 12 4 15.31V20h4.69L12 23.31 15.31 20H20v-4.69L23.31 12z"></path></svg> </label> <input aria-label="Switch to light mode" class="md-option" data-md-color-accent="indigo" data-md-color-media="" data-md-color-primary="indigo" data-md-color-scheme="slate" id="__palette_1" name="__palette" type="radio"/> <label class="md-header__button md-icon" for="__palette_0" hidden="" title="Switch to light mode"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a4 4 0 0 0-4 4 4 4 0 0 0 4 4 4 4 0 0 0 4-4 4 4 0 0 0-4-4m0 10a6 6 0 0 1-6-6 6 6 0 0 1 6-6 6 6 0 0 1 6 6 6 6 0 0 1-6 6m8-9.31V4h-4.69L12 .69 8.69 4H4v4.69L.69 12 4 15.31V20h4.69L12 23.31 15.31 20H20v-4.69L23.31 12z"></path></svg> </label> </form> <script>var palette=__md_get("__palette");if(palette&&palette.color){if("(prefers-color-scheme)"===palette.color.media){var media=matchMedia("(prefers-color-scheme: light)"),input=document.querySelector(media.matches?"[data-md-color-media='(prefers-color-scheme: light)']":"[data-md-color-media='(prefers-color-scheme: dark)']");palette.color.media=input.getAttribute("data-md-color-media"),palette.color.scheme=input.getAttribute("data-md-color-scheme"),palette.color.primary=input.getAttribute("data-md-color-primary"),palette.color.accent=input.getAttribute("data-md-color-accent")}for(var[key,value]of Object.entries(palette.color))document.body.setAttribute("data-md-color-"+key,value)}</script> <label class="md-header__button md-icon" for="__search"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M9.5 3A6.5 6.5 0 0 1 16 9.5c0 1.61-.59 3.09-1.56 4.23l.27.27h.79l5 5-1.5 1.5-5-5v-.79l-.27-.27A6.52 6.52 0 0 1 9.5 16 6.5 6.5 0 0 1 3 9.5 6.5 6.5 0 0 1 9.5 3m0 2C7 5 5 7 5 9.5S7 14 9.5 14 14 12 14 9.5 12 5 9.5 5"></path></svg> </label> <div class="md-search" data-md-component="search" role="dialog"> <label class="md-search__overlay" for="__search"></label> <div class="md-search__inner" role="search"> <form class="md-search__form" name="search"> <input aria-label="Search" autocapitalize="off" autocomplete="off" autocorrect="off" class="md-search__input" data-md-component="search-query" name="query" placeholder="Search" required="" spellcheck="false" type="text"/> <label class="md-search__icon md-icon" for="__search"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M9.5 3A6.5 6.5 0 0 1 16 9.5c0 1.61-.59 3.09-1.56 4.23l.27.27h.79l5 5-1.5 1.5-5-5v-.79l-.27-.27A6.52 6.52 0 0 1 9.5 16 6.5 6.5 0 0 1 3 9.5 6.5 6.5 0 0 1 9.5 3m0 2C7 5 5 7 5 9.5S7 14 9.5 14 14 12 14 9.5 12 5 9.5 5"></path></svg> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M20 11v2H8l5.5 5.5-1.42 1.42L4.16 12l7.92-7.92L13.5 5.5 8 11z"></path></svg> </label> <nav aria-label="Search" class="md-search__options"> <button aria-label="Clear" class="md-search__icon md-icon" tabindex="-1" title="Clear" type="reset"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M19 6.41 17.59 5 12 10.59 6.41 5 5 6.41 10.59 12 5 17.59 6.41 19 12 13.41 17.59 19 19 17.59 13.41 12z"></path></svg> </button> </nav> </form> <div class="md-search__output"> <div class="md-search__scrollwrap" data-md-scrollfix="" tabindex="0"> <div class="md-search-result" data-md-component="search-result"> <div class="md-search-result__meta"> Initializing search </div> <ol class="md-search-result__list" role="presentation"></ol> </div> </div> </div> </div> </div> <div class="md-header__source"> <a class="md-source" data-md-component="source" href="https://github.com/aphp/edsnlp" title="Go to repository"> <div class="md-source__icon md-icon"> <svg viewbox="0 0 448 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M439.55 236.05 244 40.45a28.87 28.87 0 0 0-40.81 0l-40.66 40.63 51.52 51.52c27.06-9.14 52.68 16.77 43.39 43.68l49.66 49.66c34.23-11.8 61.18 31 35.47 56.69-26.49 26.49-70.21-2.87-56-37.34L240.22 199v121.85c25.3 12.54 22.26 41.85 9.08 55a34.34 34.34 0 0 1-48.55 0c-17.57-17.6-11.07-46.91 11.25-56v-123c-20.8-8.51-24.6-30.74-18.64-45L142.57 101 8.45 235.14a28.86 28.86 0 0 0 0 40.81l195.61 195.6a28.86 28.86 0 0 0 40.8 0l194.69-194.69a28.86 28.86 0 0 0 0-40.81"></path></svg> </div> <div class="md-source__repository"> aphp/edsnlp </div> </a> </div> </nav> </header> <div class="md-container" data-md-component="container"> <main class="md-main" data-md-component="main"> <div class="md-main__inner md-grid"> <div class="md-sidebar md-sidebar--primary" data-md-component="sidebar" data-md-type="navigation"> <div class="md-sidebar__scrollwrap"> <div class="md-sidebar__inner"> <nav aria-label="Navigation" class="md-nav md-nav--primary" data-md-level="0"> <label class="md-nav__title" for="__drawer"> <a aria-label="EDS-NLP" class="md-nav__button md-logo" data-md-component="logo" href=".." title="EDS-NLP"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a3 3 0 0 0 3-3 3 3 0 0 0-3-3 3 3 0 0 0-3 3 3 3 0 0 0 3 3m0 3.54C9.64 9.35 6.5 8 3 8v11c3.5 0 6.64 1.35 9 3.54 2.36-2.19 5.5-3.54 9-3.54V8c-3.5 0-6.64 1.35-9 3.54"></path></svg> </a> EDS-NLP </label> <div class="md-nav__source"> <a class="md-source" data-md-component="source" href="https://github.com/aphp/edsnlp" title="Go to repository"> <div class="md-source__icon md-icon"> <svg viewbox="0 0 448 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M439.55 236.05 244 40.45a28.87 28.87 0 0 0-40.81 0l-40.66 40.63 51.52 51.52c27.06-9.14 52.68 16.77 43.39 43.68l49.66 49.66c34.23-11.8 61.18 31 35.47 56.69-26.49 26.49-70.21-2.87-56-37.34L240.22 199v121.85c25.3 12.54 22.26 41.85 9.08 55a34.34 34.34 0 0 1-48.55 0c-17.57-17.6-11.07-46.91 11.25-56v-123c-20.8-8.51-24.6-30.74-18.64-45L142.57 101 8.45 235.14a28.86 28.86 0 0 0 0 40.81l195.61 195.6a28.86 28.86 0 0 0 40.8 0l194.69-194.69a28.86 28.86 0 0 0 0-40.81"></path></svg> </div> <div class="md-source__repository"> aphp/edsnlp </div> </a> </div> <ul class="md-nav__list" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href=".."> <span class="md-ellipsis"> Getting started </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="https://aphp.github.io/edsnlp/demo" target="_blank"> <span class="md-ellipsis"> Demo </span> </a> </li> <li class="md-nav__item md-nav__item--active md-nav__item--nested"> <input checked="" class="md-nav__toggle md-toggle" id="__nav_3" type="checkbox"/> <div class="md-nav__link md-nav__container"> <a class="md-nav__link" href="./"> <span class="md-ellipsis"> Tutorials </span> </a> <label class="md-nav__link" for="__nav_3" id="__nav_3_label" tabindex="0"> <span class="md-nav__icon md-icon"></span> </label> </div> <nav aria-expanded="true" aria-labelledby="__nav_3_label" class="md-nav" data-md-level="1"> <label class="md-nav__title" for="__nav_3"> <span class="md-nav__icon md-icon"></span> Tutorials </label> <ul class="md-nav__list" data-md-scrollfix=""> <li class="md-nav__item md-nav__item--active"> <input class="md-nav__toggle md-toggle" id="__toc" type="checkbox"/> <a class="md-nav__link md-nav__link--active" href="./"> <span class="md-ellipsis"> Overview </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="spacy101/"> <span class="md-ellipsis"> SpaCy representations </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="matching-a-terminology/"> <span class="md-ellipsis"> Matching a terminology </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="qualifying-entities/"> <span class="md-ellipsis"> Qualifying entities </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="visualization/"> <span class="md-ellipsis"> Visualization </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="detecting-dates/"> <span class="md-ellipsis"> Detecting dates </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="reason/"> <span class="md-ellipsis"> Detecting Reason of Hospitalisation </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="endlines/"> <span class="md-ellipsis"> Detecting end-of-lines </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="aggregating-results/"> <span class="md-ellipsis"> Aggregating results </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="multiple-texts/"> <span class="md-ellipsis"> Processing multiple texts </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../advanced-tutorials/fastapi/"> <span class="md-ellipsis"> Deploying as an API </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="make-a-training-script/"> <span class="md-ellipsis"> Deep-learning tutorial </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="training/"> <span class="md-ellipsis"> Training API </span> </a> </li> </ul> </nav> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../pipes/"> <span class="md-ellipsis"> Pipes </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../tokenizers/"> <span class="md-ellipsis"> Tokenizers </span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../data/"> <span class="md-ellipsis"> Data Connectors </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../concepts/pipeline/"> <span class="md-ellipsis"> Concepts </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../utilities/"> <span class="md-ellipsis"> Utilities </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../reference/edsnlp/"> <span class="md-ellipsis"> Code Reference </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../contributing/"> <span class="md-ellipsis"> Contributing to EDS-NLP </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../changelog/"> <span class="md-ellipsis"> Changelog </span> </a> </li> </ul> </nav> </div> </div> </div> <div class="md-sidebar md-sidebar--secondary" data-md-component="sidebar" data-md-type="toc"> <div class="md-sidebar__scrollwrap"> <div class="md-sidebar__inner"> <nav aria-label="Table of contents" class="md-nav md-nav--secondary"> </nav> </div> </div> </div> <div class="md-content" data-md-component="content"> <article class="md-content__inner md-typeset"> <h1 id="tutorials">Tutorials</h1> <p>We provide step-by-step guides to get you started. We cover the following use-cases:</p> <div class="card-set" data-cards="1:12"><a class="card-content" href="spacy101"><p><span class="twemoji"><svg viewbox="0 0 384 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M64 464c-8.8 0-16-7.2-16-16V64c0-8.8 7.2-16 16-16h160v80c0 17.7 14.3 32 32 32h80v288c0 8.8-7.2 16-16 16zM64 0C28.7 0 0 28.7 0 64v384c0 35.3 28.7 64 64 64h256c35.3 0 64-28.7 64-64V154.5c0-17-6.7-33.3-18.7-45.3l-90.6-90.5C262.7 6.7 246.5 0 229.5 0zm97 289c9.4-9.4 9.4-24.6 0-33.9s-24.6-9.4-33.9 0L79 303c-9.4 9.4-9.4 24.6 0 33.9l48 48c9.4 9.4 24.6 9.4 33.9 0s9.4-24.6 0-33.9l-31-31 31-31zm96-34c-9.4-9.4-24.6-9.4-33.9 0s-9.4 24.6 0 33.9l31 31-31 31c-9.4 9.4-9.4 24.6 0 33.9s24.6 9.4 33.9 0l48-48c9.4-9.4 9.4-24.6 0-33.9l-48-48z"></path></svg></span> <strong>Spacy representations</strong></p><hr/><p>Learn the basics of how documents are represented with spaCy.</p></a><a class="card-content" href="matching-a-terminology"><p><span class="twemoji"><svg viewbox="0 0 448 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M96 0C43 0 0 43 0 96v320c0 53 43 96 96 96h320c17.7 0 32-14.3 32-32s-14.3-32-32-32v-64c17.7 0 32-14.3 32-32V32c0-17.7-14.3-32-32-32H96m0 384h256v64H96c-17.7 0-32-14.3-32-32s14.3-32 32-32m32-240c0-8.8 7.2-16 16-16h192c8.8 0 16 7.2 16 16s-7.2 16-16 16H144c-8.8 0-16-7.2-16-16m16 48h192c8.8 0 16 7.2 16 16s-7.2 16-16 16H144c-8.8 0-16-7.2-16-16s7.2-16 16-16"></path></svg></span> <strong>Matching a terminology</strong></p><hr/><p>Extract phrases that belong to a given terminology.</p></a><a class="card-content" href="qualifying-entities"><p><span class="twemoji"><svg viewbox="0 0 448 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M64 80c-8.8 0-16 7.2-16 16v320c0 8.8 7.2 16 16 16h320c8.8 0 16-7.2 16-16V96c0-8.8-7.2-16-16-16zM0 96c0-35.3 28.7-64 64-64h320c35.3 0 64 28.7 64 64v320c0 35.3-28.7 64-64 64H64c-35.3 0-64-28.7-64-64zm337 113L209 337c-9.4 9.4-24.6 9.4-33.9 0l-64-64c-9.4-9.4-9.4-24.6 0-33.9s24.6-9.4 33.9 0l47 47L303 175c9.4-9.4 24.6-9.4 33.9 0s9.4 24.6 0 33.9z"></path></svg></span> <strong>Qualifying entities</strong></p><hr/><p>Ensure extracted concepts are not invalidated by linguistic modulation.</p></a><a class="card-content" href="detecting-dates"><p><span class="twemoji"><svg viewbox="0 0 448 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M152 24c0-13.3-10.7-24-24-24s-24 10.7-24 24v40H64C28.7 64 0 92.7 0 128v320c0 35.3 28.7 64 64 64h320c35.3 0 64-28.7 64-64V128c0-35.3-28.7-64-64-64h-40V24c0-13.3-10.7-24-24-24s-24 10.7-24 24v40H152zM48 192h352v256c0 8.8-7.2 16-16 16H64c-8.8 0-16-7.2-16-16z"></path></svg></span> <strong>Detecting dates</strong></p><hr/><p>Detect and parse dates in a text.</p></a><a class="card-content" href="multiple-texts"><p><span class="twemoji"><svg viewbox="0 0 448 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M384 336H192c-8.8 0-16-7.2-16-16V64c0-8.8 7.2-16 16-16h140.1l67.9 67.9V320c0 8.8-7.2 16-16 16m-192 48h192c35.3 0 64-28.7 64-64V115.9c0-12.7-5.1-24.9-14.1-33.9l-67.8-67.9c-9-9-21.2-14.1-33.9-14.1H192c-35.3 0-64 28.7-64 64v256c0 35.3 28.7 64 64 64M64 128c-35.3 0-64 28.7-64 64v256c0 35.3 28.7 64 64 64h192c35.3 0 64-28.7 64-64v-32h-48v32c0 8.8-7.2 16-16 16H64c-8.8 0-16-7.2-16-16V192c0-8.8 7.2-16 16-16h32v-48z"></path></svg></span> <strong>Processing multiple texts</strong></p><hr/><p>Improve the inference speed of your pipeline</p></a><a class="card-content" href="reason"><p><span class="twemoji"><svg viewbox="0 0 640 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M232 0c-39.8 0-72 32.2-72 72v8H72c-39.8 0-72 32.2-72 72v288c0 39.8 32.2 72 72 72h486.4c39.8 0 72-32.2 72-72V152c0-39.8-32.2-72-72-72h-88v-8c0-39.8-32.2-72-72-72zm248 128h88c13.3 0 24 10.7 24 24v40h-56c-13.3 0-24 10.7-24 24s10.7 24 24 24h56v48h-56c-13.3 0-24 10.7-24 24s10.7 24 24 24h56v104c0 13.3-10.7 24-24 24h-88V128m-408 0h88v336H77.5c-13.2 0-24-10.7-24-24V336h56c13.3 0 24-10.7 24-24s-10.7-24-24-24h-56v-48h56c13.3 0 24-10.7 24-24s-10.7-24-24-24h-56v-40c0-13.3 10.7-24 24-24zm136-56c0-13.3 10.7-24 24-24h176c13.3 0 24 10.7 24 24v392h-64v-64c0-26.5-21.5-48-48-48s-48 21.5-48 48v64h-64zm88 24v24h-24c-8.8 0-16 7.2-16 16v16c0 8.8 7.2 16 16 16h24v24c0 8.8 7.2 16 16 16h16c8.8 0 16-7.2 16-16v-24h24c8.8 0 16-7.2 16-16v-16c0-8.8-7.2-16-16-16h-24V96c0-8.8-7.2-16-16-16h-16c-8.8 0-16 7.2-16 16"></path></svg></span> <strong>Detecting hospitalisation reason</strong></p><hr/><p>Identify spans mentioning the reason for hospitalisation or tag entities as the reason.</p></a><a class="card-content" href="endlines"><p>↵ <strong>Detecting false endlines</strong></p><hr/><p>Classify each line end and add the <code>excluded</code> attribute to these tokens.</p></a><a class="card-content" href="aggregating-results"><p><span class="twemoji"><svg viewbox="0 0 384 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M64 0C28.7 0 0 28.7 0 64v384c0 35.3 28.7 64 64 64h256c35.3 0 64-28.7 64-64V64c0-35.3-28.7-64-64-64zm32 64h192c17.7 0 32 14.3 32 32v32c0 17.7-14.3 32-32 32H96c-17.7 0-32-14.3-32-32V96c0-17.7 14.3-32 32-32m32 160a32 32 0 1 1-64 0 32 32 0 1 1 64 0M96 352a32 32 0 1 1 0-64 32 32 0 1 1 0 64m-32 64c0-17.7 14.3-32 32-32h96c17.7 0 32 14.3 32 32s-14.3 32-32 32H96c-17.7 0-32-14.3-32-32m128-160a32 32 0 1 1 0-64 32 32 0 1 1 0 64m32 64a32 32 0 1 1-64 0 32 32 0 1 1 64 0m64-64a32 32 0 1 1 0-64 32 32 0 1 1 0 64m32 64a32 32 0 1 1-64 0 32 32 0 1 1 64 0m-32 128a32 32 0 1 1 0-64 32 32 0 1 1 0 64"></path></svg></span> <strong>Aggregating results</strong></p><hr/><p>Aggregate the results of your pipeline at the document level.</p></a><a class="card-content" href="../advanced-tutorials/fastapi"><p><span class="twemoji"><svg viewbox="0 0 512 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M64 32C28.7 32 0 60.7 0 96v64c0 35.3 28.7 64 64 64h384c35.3 0 64-28.7 64-64V96c0-35.3-28.7-64-64-64zm280 72a24 24 0 1 1 0 48 24 24 0 1 1 0-48m48 24a24 24 0 1 1 48 0 24 24 0 1 1-48 0M64 288c-35.3 0-64 28.7-64 64v64c0 35.3 28.7 64 64 64h384c35.3 0 64-28.7 64-64v-64c0-35.3-28.7-64-64-64zm280 72a24 24 0 1 1 0 48 24 24 0 1 1 0-48m56 24a24 24 0 1 1 48 0 24 24 0 1 1-48 0"></path></svg></span> <strong>FastAPI</strong></p><hr/><p>Deploy your pipeline as an API.</p></a><a class="card-content" href="visualization"><p><span class="twemoji"><svg viewbox="0 0 576 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M288 32c-80.8 0-145.5 36.8-192.6 80.6C48.6 156 17.3 208 2.5 243.7c-3.3 7.9-3.3 16.7 0 24.6C17.3 304 48.6 356 95.4 399.4 142.5 443.2 207.2 480 288 480s145.5-36.8 192.6-80.6c46.8-43.5 78.1-95.4 93-131.1 3.3-7.9 3.3-16.7 0-24.6-14.9-35.7-46.2-87.7-93-131.1C433.5 68.8 368.8 32 288 32M144 256a144 144 0 1 1 288 0 144 144 0 1 1-288 0m144-64c0 35.3-28.7 64-64 64-7.1 0-13.9-1.2-20.3-3.3-5.5-1.8-11.9 1.6-11.7 7.4.3 6.9 1.3 13.8 3.2 20.7 13.7 51.2 66.4 81.6 117.6 67.9s81.6-66.4 67.9-117.6c-11.1-41.5-47.8-69.4-88.6-71.1-5.8-.2-9.2 6.1-7.4 11.7 2.1 6.4 3.3 13.2 3.3 20.3"></path></svg></span> <strong>Visualization</strong></p><hr/><p>Quickly visualize the results of your pipeline as annotations or tables.</p></a><a class="card-content" href="make-a-training-script"><p><span class="twemoji"><svg viewbox="0 0 448 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M288 0H128c-17.7 0-32 14.3-32 32s14.3 32 32 32v132.8c0 11.8-3.3 23.5-9.5 33.5L10.3 406.2C3.6 417.2 0 429.7 0 442.6 0 480.9 31.1 512 69.4 512h309.2c38.3 0 69.4-31.1 69.4-69.4 0-12.8-3.6-25.4-10.3-36.4L329.5 230.4c-6.2-10.1-9.5-21.7-9.5-33.5V64c17.7 0 32-14.3 32-32S337.7 0 320 0zm-96 196.8V64h64v132.8c0 23.7 6.6 46.9 19 67.1l34.5 56.1h-171l34.5-56.1c12.4-20.2 19-43.4 19-67.1"></path></svg></span> <strong>Deep learning tutorial</strong></p><hr/><p>Learn how EDS-NLP handles training deep-neural networks.</p></a><a class="card-content" href="training"><p><span class="twemoji"><svg viewbox="0 0 512 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M184 0c30.9 0 56 25.1 56 56v400c0 30.9-25.1 56-56 56-28.9 0-52.7-21.9-55.7-50.1-5.2 1.4-10.7 2.1-16.3 2.1-35.3 0-64-28.7-64-64 0-7.4 1.3-14.6 3.6-21.2C21.4 367.4 0 338.2 0 304c0-31.9 18.7-59.5 45.8-72.3C37.1 220.8 32 207 32 192c0-30.7 21.6-56.3 50.4-62.6C80.8 123.9 80 118 80 112c0-29.9 20.6-55.1 48.3-62.1 3-28 26.8-49.9 55.7-49.9m144 0c28.9 0 52.6 21.9 55.7 49.9C411.5 56.9 432 82 432 112c0 6-.8 11.9-2.4 17.4 28.8 6.2 50.4 31.9 50.4 62.6 0 15-5.1 28.8-13.8 39.7 27.1 12.8 45.8 40.4 45.8 72.3 0 34.2-21.4 63.4-51.6 74.8 2.3 6.6 3.6 13.8 3.6 21.2 0 35.3-28.7 64-64 64-5.6 0-11.1-.7-16.3-2.1-3 28.2-26.8 50.1-55.7 50.1-30.9 0-56-25.1-56-56V56c0-30.9 25.1-56 56-56"></path></svg></span> <strong>Training API</strong></p><hr/><p>Learn how to quicky train a deep-learning model with <code>edsnlp.train</code>.</p></a></div> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/tutorials/make-a-training-script/index.html b/master/tutorials/make-a-training-script/index.html
index 1e64165d9..db06ec89a 100644
--- a/master/tutorials/make-a-training-script/index.html
+++ b/master/tutorials/make-a-training-script/index.html
@@ -405,7 +405,7 @@
 </code></pre></div> <p>And replace the end of the script by</p> <div class="no-check highlight"><pre><span></span><code><span class="k">if</span> <span class="vm">__name__</span> <span class="o">==</span> <span class="s2">"__main__"</span><span class="p">:</span>
     <span class="n">app</span><span class="o">.</span><span class="n">run</span><span class="p">()</span>
 </code></pre></div> <p>That's it ! We can now call the training script with the configuration file as a parameter, and override some of its values:</p> <div class="highlight" data-md-color-scheme="slate"><pre><span></span><code>python<span class="w"> </span>train.py<span class="w"> </span>--config<span class="w"> </span>config.cfg<span class="w"> </span>--nlp.components.ner.embedding.embedding.transformer.window<span class="o">=</span><span class="m">64</span><span class="w"> </span>--seed<span class="w"> </span><span class="m">43</span>
-</code></pre></div> <h2 id="going-further">Going further</h2> <p>EDS-NLP also provides a generic training script that follows the same structure as the one we just wrote. You can learn more about in the <a href="../training">next Training API tutorial</a>.</p> <p>This tutorial gave you a glimpse of the training API of EDS-NLP. To build a custom trainable component, you can refer to the <a class="autorefs autorefs-internal" href="../../concepts/torch-component/#edsnlp.core.torch_component.TorchComponent">TorchComponent</a> class or look up the implementation of <a href="https://github.com/aphp/edsnlp/tree/master/edsnlp/pipes/trainable">some of the trainable components on GitHub</a>.</p> <p>We also recommend looking at an existing project as a reference, such as <a href="https://github.com/aphp/eds-pseudo">eds-pseudo</a> or <a href="https://github.com/percevalw/mlg-norm">mlg-norm</a>.</p> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> <h2 id="going-further">Going further</h2> <p>EDS-NLP also provides a generic training script that follows the same structure as the one we just wrote. You can learn more about in the <a href="../training">next Training API tutorial</a>.</p> <p>This tutorial gave you a glimpse of the training API of EDS-NLP. To build a custom trainable component, you can refer to the <a class="autorefs autorefs-internal" href="../../concepts/torch-component/#edsnlp.core.torch_component.TorchComponent">TorchComponent</a> class or look up the implementation of <a href="https://github.com/aphp/edsnlp/tree/master/edsnlp/pipes/trainable">some of the trainable components on GitHub</a>.</p> <p>We also recommend looking at an existing project as a reference, such as <a href="https://github.com/aphp/eds-pseudo">eds-pseudo</a> or <a href="https://github.com/percevalw/mlg-norm">mlg-norm</a>.</p> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/tutorials/matching-a-terminology/index.html b/master/tutorials/matching-a-terminology/index.html
index f96546810..40f6e437e 100644
--- a/master/tutorials/matching-a-terminology/index.html
+++ b/master/tutorials/matching-a-terminology/index.html
@@ -107,7 +107,7 @@
 
 <span class="n">doc</span><span class="o">.</span><span class="n">ents</span>
 <span class="c1"># Out: (COVID19, respiratoires, asthmatique)</span>
-</code></pre></div> <ol> <li>We can now match using regular expressions.</li> <li>We can mix and match patterns! Here we keep looking for patients using spaCy's term matching.</li> <li>RegExp matching is not limited to the verbatim text! You can choose to use one of spaCy's native attribute, ignore excluded tokens, etc.</li> </ol> <p>To visualize extracted entities, check out the <a href="../visualization">Visualization</a> tutorial.</p> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> <ol> <li>We can now match using regular expressions.</li> <li>We can mix and match patterns! Here we keep looking for patients using spaCy's term matching.</li> <li>RegExp matching is not limited to the verbatim text! You can choose to use one of spaCy's native attribute, ignore excluded tokens, etc.</li> </ol> <p>To visualize extracted entities, check out the <a href="../visualization">Visualization</a> tutorial.</p> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/tutorials/multiple-texts/index.html b/master/tutorials/multiple-texts/index.html
index e2aa50c2b..ab4c0210b 100644
--- a/master/tutorials/multiple-texts/index.html
+++ b/master/tutorials/multiple-texts/index.html
@@ -229,7 +229,7 @@
     <span class="p">],</span>
     <span class="n">dtypes</span><span class="o">=</span><span class="kc">None</span><span class="p">,</span>  <span class="c1"># (1)</span>
 <span class="p">)</span>
-</code></pre></div> <ol> <li>If you don't pass a <code>dtypes</code> argument, EDS-NLP will print the inferred schema it such that you can copy-paste it in your code.</li> </ol> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> <ol> <li>If you don't pass a <code>dtypes</code> argument, EDS-NLP will print the inferred schema it such that you can copy-paste it in your code.</li> </ol> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/tutorials/qualifying-entities/index.html b/master/tutorials/qualifying-entities/index.html
index 670974874..2369dd99c 100644
--- a/master/tutorials/qualifying-entities/index.html
+++ b/master/tutorials/qualifying-entities/index.html
@@ -77,7 +77,7 @@
 </span><span class="hll">    <span class="n">entities</span><span class="o">.</span><span class="n">append</span><span class="p">(</span><span class="n">d</span><span class="p">)</span>
 </span><span class="hll">
 </span><span class="hll"><span class="n">df</span> <span class="o">=</span> <span class="n">pd</span><span class="o">.</span><span class="n">DataFrame</span><span class="o">.</span><span class="n">from_records</span><span class="p">(</span><span class="n">entities</span><span class="p">)</span>
-</span></code></pre></div> <p>This code is complete, and should run as is.</p> <p>We get the following result:</p> <table> <thead> <tr> <th style="text-align: left;">lexical_variant</th> <th style="text-align: left;">label</th> <th>negation</th> <th>hypothesis</th> <th>family</th> </tr> </thead> <tbody> <tr> <td style="text-align: left;">COVID19</td> <td style="text-align: left;">covid</td> <td>False</td> <td>True</td> <td>False</td> </tr> <tr> <td style="text-align: left;">respiratoires</td> <td style="text-align: left;">respiratoire</td> <td>True</td> <td>False</td> <td>False</td> </tr> <tr> <td style="text-align: left;">asthmatique</td> <td style="text-align: left;">respiratoire</td> <td>False</td> <td>False</td> <td>True</td> </tr> </tbody> </table> <h2 id="conclusion">Conclusion</h2> <p>The qualifier pipes limits the number of false positives by detecting linguistic modulations such as negations or speculations. Go to the <a href="../../pipes/qualifiers/overview">full documentation</a> for a complete presentation of the different pipes, their configuration options and validation performance.</p> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</span></code></pre></div> <p>This code is complete, and should run as is.</p> <p>We get the following result:</p> <table> <thead> <tr> <th style="text-align: left;">lexical_variant</th> <th style="text-align: left;">label</th> <th>negation</th> <th>hypothesis</th> <th>family</th> </tr> </thead> <tbody> <tr> <td style="text-align: left;">COVID19</td> <td style="text-align: left;">covid</td> <td>False</td> <td>True</td> <td>False</td> </tr> <tr> <td style="text-align: left;">respiratoires</td> <td style="text-align: left;">respiratoire</td> <td>True</td> <td>False</td> <td>False</td> </tr> <tr> <td style="text-align: left;">asthmatique</td> <td style="text-align: left;">respiratoire</td> <td>False</td> <td>False</td> <td>True</td> </tr> </tbody> </table> <h2 id="conclusion">Conclusion</h2> <p>The qualifier pipes limits the number of false positives by detecting linguistic modulations such as negations or speculations. Go to the <a href="../../pipes/qualifiers/overview">full documentation</a> for a complete presentation of the different pipes, their configuration options and validation performance.</p> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/tutorials/reason/index.html b/master/tutorials/reason/index.html
index 7c72d5091..1eb43e78e 100644
--- a/master/tutorials/reason/index.html
+++ b/master/tutorials/reason/index.html
@@ -55,7 +55,7 @@
     <span class="nb">print</span><span class="p">(</span><span class="n">e</span><span class="o">.</span><span class="n">start</span><span class="p">,</span> <span class="n">e</span><span class="p">,</span> <span class="n">e</span><span class="o">.</span><span class="n">_</span><span class="o">.</span><span class="n">is_reason</span><span class="p">)</span>
 <span class="c1"># Out: 42 asthme True</span>
 <span class="c1"># Out: 54 asthme False</span>
-</code></pre></div> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/tutorials/spacy101/index.html b/master/tutorials/spacy101/index.html
index 441c18391..7e0d39272 100644
--- a/master/tutorials/spacy101/index.html
+++ b/master/tutorials/spacy101/index.html
@@ -52,7 +52,7 @@
 
 <span class="n">span</span><span class="o">.</span><span class="n">_</span><span class="o">.</span><span class="n">date</span><span class="o">.</span><span class="n">to_datetime</span><span class="p">()</span>  <span class="c1"># (1)</span>
 <span class="c1"># Out: DateTime(2005, 5, 5, 0, 0, 0, tzinfo=Timezone('Europe/Paris'))</span>
-</code></pre></div> <ol> <li>We use the <code>to_datetime()</code> method of the extension to get an object that is usable by Python.</li> </ol> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> <ol> <li>We use the <code>to_datetime()</code> method of the extension to get an object that is usable by Python.</li> </ol> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/tutorials/training/index.html b/master/tutorials/training/index.html
index 59788231d..122218737 100644
--- a/master/tutorials/training/index.html
+++ b/master/tutorials/training/index.html
@@ -243,7 +243,7 @@
     <span class="nb">print</span><span class="p">(</span><span class="n">ent</span><span class="p">,</span> <span class="n">ent</span><span class="o">.</span><span class="n">label_</span><span class="p">)</span>
 </code></pre></div> <h2 id="packaging-the-model">Packaging the model</h2> <p>To package the model and share it with friends or family (if the model does not contain sensitive data), you can use the following command:</p> <div class="highlight" data-md-color-scheme="slate"><pre><span></span><code>python<span class="w"> </span>-m<span class="w"> </span>edsnlp.package<span class="w"> </span>--pipeline<span class="w"> </span>artifacts/model-last/<span class="w"> </span>--name<span class="w"> </span>my_ner_model<span class="w"> </span>--distributions<span class="w"> </span>sdist
 </code></pre></div> <p><em>Parametrize either via the CLI or in <code>config.yml</code> under <code>[package]</code>.</em></p> <p>Tthe model saved at the train script output path (<code>artifacts/model-last</code>) will be named <code>my_ner_model</code> and will be saved in the <code>dist</code> folder. You can upload it to a package registry or install it directly with</p> <div class="highlight" data-md-color-scheme="slate"><pre><span></span><code>pip<span class="w"> </span>install<span class="w"> </span>dist/my_ner_model-0.1.0.tar.gz
-</code></pre></div> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/tutorials/visualization/index.html b/master/tutorials/visualization/index.html
index 524aa1eb2..1d04901f6 100644
--- a/master/tutorials/visualization/index.html
+++ b/master/tutorials/visualization/index.html
@@ -29,7 +29,7 @@
     <span class="c1"># Shows the entities in doc.ents by default</span>
     <span class="c1"># span_getter=["ents"]</span>
 <span class="p">)</span>
-</code></pre></div> <div class="md-typeset"> <div class="md-typeset__table compact-table"> <table> <thead> <tr style="text-align: right;"> <th>note_id</th> <th>start</th> <th>end</th> <th>label</th> <th>lexical_variant</th> <th>span_type</th> <th>negation</th> <th>hypothesis</th> <th>family</th> </tr> </thead> <tbody> <tr> <td>None</td> <td>16</td> <td>21</td> <td>covid</td> <td>covid</td> <td>ents</td> <td>False</td> <td>False</td> <td>False</td> </tr> </tbody> </table> </div> </div> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> <div class="md-typeset"> <div class="md-typeset__table compact-table"> <table> <thead> <tr style="text-align: right;"> <th>note_id</th> <th>start</th> <th>end</th> <th>label</th> <th>lexical_variant</th> <th>span_type</th> <th>negation</th> <th>hypothesis</th> <th>family</th> </tr> </thead> <tbody> <tr> <td>None</td> <td>16</td> <td>21</td> <td>covid</td> <td>covid</td> <td>ents</td> <td>False</td> <td>False</td> <td>False</td> </tr> </tbody> </table> </div> </div> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/utilities/connectors/brat/index.html b/master/utilities/connectors/brat/index.html
index ffe99fc04..de61efedc 100644
--- a/master/utilities/connectors/brat/index.html
+++ b/master/utilities/connectors/brat/index.html
@@ -22,7 +22,7 @@
 
 <span class="n">doc</span><span class="o">.</span><span class="n">ents</span><span class="p">[</span><span class="mi">0</span><span class="p">]</span><span class="o">.</span><span class="n">label_</span>
 <span class="c1"># Out: Patient</span>
-</code></pre></div> <p>The connector can also go the other way around, enabling pre-annotations and an ersatz of active learning.</p> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> <p>The connector can also go the other way around, enabling pre-annotations and an ersatz of active learning.</p> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/utilities/connectors/labeltool/index.html b/master/utilities/connectors/labeltool/index.html
index 06a61a87b..b1c731a7e 100644
--- a/master/utilities/connectors/labeltool/index.html
+++ b/master/utilities/connectors/labeltool/index.html
@@ -18,7 +18,7 @@
 <span class="n">docs</span> <span class="o">=</span> <span class="n">nlp</span><span class="o">.</span><span class="n"><html><head></head><body><a class="discrete-link" href="../../../reference/edsnlp/core/pipeline/#edsnlp.core.pipeline.Pipeline.pipe">pipe</a></body></html></span><span class="p">(</span><span class="n">corpus</span><span class="p">)</span>
 
 <span class="n">df</span> <span class="o">=</span> <span class="n"><html><head></head><body><a class="discrete-link" href="../../../reference/edsnlp/connectors/labeltool/#edsnlp.connectors.labeltool.docs2labeltool">docs2labeltool</a></body></html></span><span class="p">(</span><span class="n">docs</span><span class="p">,</span> <span class="n">extensions</span><span class="o">=</span><span class="p">[</span><span class="s2">"negation"</span><span class="p">])</span>
-</code></pre></div> <p>The results:</p> <table> <thead> <tr> <th>note_id</th> <th>note_text</th> <th>start</th> <th>end</th> <th>label</th> <th>lexical_variant</th> <th>negation</th> </tr> </thead> <tbody> <tr> <td>0</td> <td>Ceci est un document médical.</td> <td>21</td> <td>28</td> <td>medical</td> <td>médical</td> <td>False</td> </tr> <tr> <td>1</td> <td>Le patient n'est pas malade.</td> <td>21</td> <td>27</td> <td>malade</td> <td>malade</td> <td>True</td> </tr> </tbody> </table> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> <p>The results:</p> <table> <thead> <tr> <th>note_id</th> <th>note_text</th> <th>start</th> <th>end</th> <th>label</th> <th>lexical_variant</th> <th>negation</th> </tr> </thead> <tbody> <tr> <td>0</td> <td>Ceci est un document médical.</td> <td>21</td> <td>28</td> <td>medical</td> <td>médical</td> <td>False</td> </tr> <tr> <td>1</td> <td>Le patient n'est pas malade.</td> <td>21</td> <td>27</td> <td>malade</td> <td>malade</td> <td>True</td> </tr> </tbody> </table> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/utilities/connectors/omop/index.html b/master/utilities/connectors/omop/index.html
index c0b570b8b..cd04fd1c3 100644
--- a/master/utilities/connectors/omop/index.html
+++ b/master/utilities/connectors/omop/index.html
@@ -22,7 +22,7 @@
 
 <span class="n">doc</span><span class="o">.</span><span class="n">text</span> <span class="o">==</span> <span class="n">note</span><span class="o">.</span><span class="n">loc</span><span class="p">[</span><span class="mi">0</span><span class="p">]</span><span class="o">.</span><span class="n">note_text</span>
 <span class="c1"># Out: True</span>
-</code></pre></div> <p>The object <code>docs</code> now contains a list of documents that reflects the information contained in the OMOP-formatted dataframes.</p> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> <p>The object <code>docs</code> now contains a list of documents that reflects the information contained in the OMOP-formatted dataframes.</p> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/utilities/connectors/overview/index.html b/master/utilities/connectors/overview/index.html
index b925d3b9a..d8ca503d3 100644
--- a/master/utilities/connectors/overview/index.html
+++ b/master/utilities/connectors/overview/index.html
@@ -1,5 +1,5 @@
 <!DOCTYPE html>
-<html class="no-js" lang="en"> <head><meta charset="utf-8"/><meta content="width=device-width,initial-scale=1" name="viewport"/><link href="../../../assets/logo/edsnlp.svg" rel="icon"/><meta content="mkdocs-1.6.1, mkdocs-material-9.5.42" name="generator"/><title>Overview of connectors - EDS-NLP</title><link href="../../../assets/stylesheets/main.0253249f.min.css" rel="stylesheet"/><link href="../../../assets/stylesheets/palette.06af60db.min.css" rel="stylesheet"/><link crossorigin="" href="https://fonts.gstatic.com" rel="preconnect"/><link href="https://fonts.googleapis.com/css?family=Roboto:300,300i,400,400i,700,700i%7CRoboto+Mono:400,400i,700,700i&amp;display=fallback" rel="stylesheet"/><style>:root{--md-text-font:"Roboto";--md-code-font:"Roboto Mono"}</style><link href="../../../assets/_mkdocstrings.css" rel="stylesheet"/><link href="../../../assets/stylesheets/extra.css" rel="stylesheet"/><link href="../../../assets/stylesheets/cards.css" rel="stylesheet"/><link href="../../../assets/termynal/termynal.css" rel="stylesheet"/><script>__md_scope=new URL("../../..",location),__md_hash=e=>[...e].reduce(((e,_)=>(e<<5)-e+_.charCodeAt(0)),0),__md_get=(e,_=localStorage,t=__md_scope)=>JSON.parse(_.getItem(t.pathname+"."+e)),__md_set=(e,_,t=localStorage,a=__md_scope)=>{try{t.setItem(a.pathname+"."+e,JSON.stringify(_))}catch(e){}}</script></head> <body data-md-color-accent="indigo" data-md-color-primary="indigo" data-md-color-scheme="default" dir="ltr"> <input autocomplete="off" class="md-toggle" data-md-toggle="drawer" id="__drawer" type="checkbox"/> <input autocomplete="off" class="md-toggle" data-md-toggle="search" id="__search" type="checkbox"/> <label class="md-overlay" for="__drawer"></label> <div data-md-component="skip"> <a class="md-skip" href="#overview-of-connectors"> Skip to content </a> </div> <div data-md-component="announce"> <aside class="md-banner"> <div class="md-banner__inner md-grid md-typeset"> <button aria-label="Don't show this again" class="md-banner__button md-icon"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M19 6.41 17.59 5 12 10.59 6.41 5 5 6.41 10.59 12 5 17.59 6.41 19 12 13.41 17.59 19 19 17.59 13.41 12z"></path></svg> </button> Check out the new <a href="../../../tutorials/training">Model Training tutorial</a> ! </div> <script>var el=document.querySelector("[data-md-component=announce]");if(el){var content=el.querySelector(".md-typeset");__md_hash(content.innerHTML)===__md_get("__announce")&&(el.hidden=!0)}</script> </aside> </div> <div data-md-color-scheme="default" data-md-component="outdated" hidden=""> </div> <header class="md-header md-header--shadow" data-md-component="header"> <nav aria-label="Header" class="md-header__inner md-grid"> <a aria-label="EDS-NLP" class="md-header__button md-logo" data-md-component="logo" href="../../.." title="EDS-NLP"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a3 3 0 0 0 3-3 3 3 0 0 0-3-3 3 3 0 0 0-3 3 3 3 0 0 0 3 3m0 3.54C9.64 9.35 6.5 8 3 8v11c3.5 0 6.64 1.35 9 3.54 2.36-2.19 5.5-3.54 9-3.54V8c-3.5 0-6.64 1.35-9 3.54"></path></svg> </a> <label class="md-header__button md-icon" for="__drawer"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M3 6h18v2H3zm0 5h18v2H3zm0 5h18v2H3z"></path></svg> </label> <div class="md-header__title" data-md-component="header-title"> <div class="md-header__ellipsis"> <div class="md-header__topic"> <span class="md-ellipsis"> EDS-NLP </span> </div> <div class="md-header__topic" data-md-component="header-topic"> <span class="md-ellipsis"> Overview of connectors </span> </div> </div> </div> <form class="md-header__option" data-md-component="palette"> <input aria-label="Switch to dark mode" class="md-option" data-md-color-accent="indigo" data-md-color-media="" data-md-color-primary="indigo" data-md-color-scheme="default" id="__palette_0" name="__palette" type="radio"/> <label class="md-header__button md-icon" for="__palette_1" hidden="" title="Switch to dark mode"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 18c-.89 0-1.74-.2-2.5-.55C11.56 16.5 13 14.42 13 12s-1.44-4.5-3.5-5.45C10.26 6.2 11.11 6 12 6a6 6 0 0 1 6 6 6 6 0 0 1-6 6m8-9.31V4h-4.69L12 .69 8.69 4H4v4.69L.69 12 4 15.31V20h4.69L12 23.31 15.31 20H20v-4.69L23.31 12z"></path></svg> </label> <input aria-label="Switch to light mode" class="md-option" data-md-color-accent="indigo" data-md-color-media="" data-md-color-primary="indigo" data-md-color-scheme="slate" id="__palette_1" name="__palette" type="radio"/> <label class="md-header__button md-icon" for="__palette_0" hidden="" title="Switch to light mode"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a4 4 0 0 0-4 4 4 4 0 0 0 4 4 4 4 0 0 0 4-4 4 4 0 0 0-4-4m0 10a6 6 0 0 1-6-6 6 6 0 0 1 6-6 6 6 0 0 1 6 6 6 6 0 0 1-6 6m8-9.31V4h-4.69L12 .69 8.69 4H4v4.69L.69 12 4 15.31V20h4.69L12 23.31 15.31 20H20v-4.69L23.31 12z"></path></svg> </label> </form> <script>var palette=__md_get("__palette");if(palette&&palette.color){if("(prefers-color-scheme)"===palette.color.media){var media=matchMedia("(prefers-color-scheme: light)"),input=document.querySelector(media.matches?"[data-md-color-media='(prefers-color-scheme: light)']":"[data-md-color-media='(prefers-color-scheme: dark)']");palette.color.media=input.getAttribute("data-md-color-media"),palette.color.scheme=input.getAttribute("data-md-color-scheme"),palette.color.primary=input.getAttribute("data-md-color-primary"),palette.color.accent=input.getAttribute("data-md-color-accent")}for(var[key,value]of Object.entries(palette.color))document.body.setAttribute("data-md-color-"+key,value)}</script> <label class="md-header__button md-icon" for="__search"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M9.5 3A6.5 6.5 0 0 1 16 9.5c0 1.61-.59 3.09-1.56 4.23l.27.27h.79l5 5-1.5 1.5-5-5v-.79l-.27-.27A6.52 6.52 0 0 1 9.5 16 6.5 6.5 0 0 1 3 9.5 6.5 6.5 0 0 1 9.5 3m0 2C7 5 5 7 5 9.5S7 14 9.5 14 14 12 14 9.5 12 5 9.5 5"></path></svg> </label> <div class="md-search" data-md-component="search" role="dialog"> <label class="md-search__overlay" for="__search"></label> <div class="md-search__inner" role="search"> <form class="md-search__form" name="search"> <input aria-label="Search" autocapitalize="off" autocomplete="off" autocorrect="off" class="md-search__input" data-md-component="search-query" name="query" placeholder="Search" required="" spellcheck="false" type="text"/> <label class="md-search__icon md-icon" for="__search"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M9.5 3A6.5 6.5 0 0 1 16 9.5c0 1.61-.59 3.09-1.56 4.23l.27.27h.79l5 5-1.5 1.5-5-5v-.79l-.27-.27A6.52 6.52 0 0 1 9.5 16 6.5 6.5 0 0 1 3 9.5 6.5 6.5 0 0 1 9.5 3m0 2C7 5 5 7 5 9.5S7 14 9.5 14 14 12 14 9.5 12 5 9.5 5"></path></svg> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M20 11v2H8l5.5 5.5-1.42 1.42L4.16 12l7.92-7.92L13.5 5.5 8 11z"></path></svg> </label> <nav aria-label="Search" class="md-search__options"> <button aria-label="Clear" class="md-search__icon md-icon" tabindex="-1" title="Clear" type="reset"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M19 6.41 17.59 5 12 10.59 6.41 5 5 6.41 10.59 12 5 17.59 6.41 19 12 13.41 17.59 19 19 17.59 13.41 12z"></path></svg> </button> </nav> </form> <div class="md-search__output"> <div class="md-search__scrollwrap" data-md-scrollfix="" tabindex="0"> <div class="md-search-result" data-md-component="search-result"> <div class="md-search-result__meta"> Initializing search </div> <ol class="md-search-result__list" role="presentation"></ol> </div> </div> </div> </div> </div> <div class="md-header__source"> <a class="md-source" data-md-component="source" href="https://github.com/aphp/edsnlp" title="Go to repository"> <div class="md-source__icon md-icon"> <svg viewbox="0 0 448 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M439.55 236.05 244 40.45a28.87 28.87 0 0 0-40.81 0l-40.66 40.63 51.52 51.52c27.06-9.14 52.68 16.77 43.39 43.68l49.66 49.66c34.23-11.8 61.18 31 35.47 56.69-26.49 26.49-70.21-2.87-56-37.34L240.22 199v121.85c25.3 12.54 22.26 41.85 9.08 55a34.34 34.34 0 0 1-48.55 0c-17.57-17.6-11.07-46.91 11.25-56v-123c-20.8-8.51-24.6-30.74-18.64-45L142.57 101 8.45 235.14a28.86 28.86 0 0 0 0 40.81l195.61 195.6a28.86 28.86 0 0 0 40.8 0l194.69-194.69a28.86 28.86 0 0 0 0-40.81"></path></svg> </div> <div class="md-source__repository"> aphp/edsnlp </div> </a> </div> </nav> </header> <div class="md-container" data-md-component="container"> <main class="md-main" data-md-component="main"> <div class="md-main__inner md-grid"> <div class="md-sidebar md-sidebar--primary" data-md-component="sidebar" data-md-type="navigation"> <div class="md-sidebar__scrollwrap"> <div class="md-sidebar__inner"> <nav aria-label="Navigation" class="md-nav md-nav--primary" data-md-level="0"> <label class="md-nav__title" for="__drawer"> <a aria-label="EDS-NLP" class="md-nav__button md-logo" data-md-component="logo" href="../../.." title="EDS-NLP"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a3 3 0 0 0 3-3 3 3 0 0 0-3-3 3 3 0 0 0-3 3 3 3 0 0 0 3 3m0 3.54C9.64 9.35 6.5 8 3 8v11c3.5 0 6.64 1.35 9 3.54 2.36-2.19 5.5-3.54 9-3.54V8c-3.5 0-6.64 1.35-9 3.54"></path></svg> </a> EDS-NLP </label> <div class="md-nav__source"> <a class="md-source" data-md-component="source" href="https://github.com/aphp/edsnlp" title="Go to repository"> <div class="md-source__icon md-icon"> <svg viewbox="0 0 448 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M439.55 236.05 244 40.45a28.87 28.87 0 0 0-40.81 0l-40.66 40.63 51.52 51.52c27.06-9.14 52.68 16.77 43.39 43.68l49.66 49.66c34.23-11.8 61.18 31 35.47 56.69-26.49 26.49-70.21-2.87-56-37.34L240.22 199v121.85c25.3 12.54 22.26 41.85 9.08 55a34.34 34.34 0 0 1-48.55 0c-17.57-17.6-11.07-46.91 11.25-56v-123c-20.8-8.51-24.6-30.74-18.64-45L142.57 101 8.45 235.14a28.86 28.86 0 0 0 0 40.81l195.61 195.6a28.86 28.86 0 0 0 40.8 0l194.69-194.69a28.86 28.86 0 0 0 0-40.81"></path></svg> </div> <div class="md-source__repository"> aphp/edsnlp </div> </a> </div> <ul class="md-nav__list" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href="../../.."> <span class="md-ellipsis"> Getting started </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="https://aphp.github.io/edsnlp/demo" target="_blank"> <span class="md-ellipsis"> Demo </span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../../tutorials/"> <span class="md-ellipsis"> Tutorials </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../../pipes/"> <span class="md-ellipsis"> Pipes </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../../tokenizers/"> <span class="md-ellipsis"> Tokenizers </span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../../data/"> <span class="md-ellipsis"> Data Connectors </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../../concepts/pipeline/"> <span class="md-ellipsis"> Concepts </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../"> <span class="md-ellipsis"> Utilities </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../../reference/edsnlp/"> <span class="md-ellipsis"> Code Reference </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../../contributing/"> <span class="md-ellipsis"> Contributing to EDS-NLP </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../../changelog/"> <span class="md-ellipsis"> Changelog </span> </a> </li> </ul> </nav> </div> </div> </div> <div class="md-sidebar md-sidebar--secondary" data-md-component="sidebar" data-md-type="toc"> <div class="md-sidebar__scrollwrap"> <div class="md-sidebar__inner"> <nav aria-label="Table of contents" class="md-nav md-nav--secondary"> </nav> </div> </div> </div> <div class="md-content" data-md-component="content"> <article class="md-content__inner md-typeset"> <h1 id="overview-of-connectors">Overview of connectors</h1> <p>EDS-NLP provides a series of connectors apt to convert back and forth from different formats into spaCy representation.</p> <p>We provide the following connectors:</p> <ul> <li><a href="../brat/">BRAT</a></li> <li><a href="../omop/">OMOP</a></li> </ul> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+<html class="no-js" lang="en"> <head><meta charset="utf-8"/><meta content="width=device-width,initial-scale=1" name="viewport"/><link href="../../../assets/logo/edsnlp.svg" rel="icon"/><meta content="mkdocs-1.6.1, mkdocs-material-9.5.42" name="generator"/><title>Overview of connectors - EDS-NLP</title><link href="../../../assets/stylesheets/main.0253249f.min.css" rel="stylesheet"/><link href="../../../assets/stylesheets/palette.06af60db.min.css" rel="stylesheet"/><link crossorigin="" href="https://fonts.gstatic.com" rel="preconnect"/><link href="https://fonts.googleapis.com/css?family=Roboto:300,300i,400,400i,700,700i%7CRoboto+Mono:400,400i,700,700i&amp;display=fallback" rel="stylesheet"/><style>:root{--md-text-font:"Roboto";--md-code-font:"Roboto Mono"}</style><link href="../../../assets/_mkdocstrings.css" rel="stylesheet"/><link href="../../../assets/stylesheets/extra.css" rel="stylesheet"/><link href="../../../assets/stylesheets/cards.css" rel="stylesheet"/><link href="../../../assets/termynal/termynal.css" rel="stylesheet"/><script>__md_scope=new URL("../../..",location),__md_hash=e=>[...e].reduce(((e,_)=>(e<<5)-e+_.charCodeAt(0)),0),__md_get=(e,_=localStorage,t=__md_scope)=>JSON.parse(_.getItem(t.pathname+"."+e)),__md_set=(e,_,t=localStorage,a=__md_scope)=>{try{t.setItem(a.pathname+"."+e,JSON.stringify(_))}catch(e){}}</script></head> <body data-md-color-accent="indigo" data-md-color-primary="indigo" data-md-color-scheme="default" dir="ltr"> <input autocomplete="off" class="md-toggle" data-md-toggle="drawer" id="__drawer" type="checkbox"/> <input autocomplete="off" class="md-toggle" data-md-toggle="search" id="__search" type="checkbox"/> <label class="md-overlay" for="__drawer"></label> <div data-md-component="skip"> <a class="md-skip" href="#overview-of-connectors"> Skip to content </a> </div> <div data-md-component="announce"> <aside class="md-banner"> <div class="md-banner__inner md-grid md-typeset"> <button aria-label="Don't show this again" class="md-banner__button md-icon"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M19 6.41 17.59 5 12 10.59 6.41 5 5 6.41 10.59 12 5 17.59 6.41 19 12 13.41 17.59 19 19 17.59 13.41 12z"></path></svg> </button> Check out the new <a href="../../../tutorials/training">Model Training tutorial</a> ! </div> <script>var el=document.querySelector("[data-md-component=announce]");if(el){var content=el.querySelector(".md-typeset");__md_hash(content.innerHTML)===__md_get("__announce")&&(el.hidden=!0)}</script> </aside> </div> <div data-md-color-scheme="default" data-md-component="outdated" hidden=""> </div> <header class="md-header md-header--shadow" data-md-component="header"> <nav aria-label="Header" class="md-header__inner md-grid"> <a aria-label="EDS-NLP" class="md-header__button md-logo" data-md-component="logo" href="../../.." title="EDS-NLP"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a3 3 0 0 0 3-3 3 3 0 0 0-3-3 3 3 0 0 0-3 3 3 3 0 0 0 3 3m0 3.54C9.64 9.35 6.5 8 3 8v11c3.5 0 6.64 1.35 9 3.54 2.36-2.19 5.5-3.54 9-3.54V8c-3.5 0-6.64 1.35-9 3.54"></path></svg> </a> <label class="md-header__button md-icon" for="__drawer"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M3 6h18v2H3zm0 5h18v2H3zm0 5h18v2H3z"></path></svg> </label> <div class="md-header__title" data-md-component="header-title"> <div class="md-header__ellipsis"> <div class="md-header__topic"> <span class="md-ellipsis"> EDS-NLP </span> </div> <div class="md-header__topic" data-md-component="header-topic"> <span class="md-ellipsis"> Overview of connectors </span> </div> </div> </div> <form class="md-header__option" data-md-component="palette"> <input aria-label="Switch to dark mode" class="md-option" data-md-color-accent="indigo" data-md-color-media="" data-md-color-primary="indigo" data-md-color-scheme="default" id="__palette_0" name="__palette" type="radio"/> <label class="md-header__button md-icon" for="__palette_1" hidden="" title="Switch to dark mode"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 18c-.89 0-1.74-.2-2.5-.55C11.56 16.5 13 14.42 13 12s-1.44-4.5-3.5-5.45C10.26 6.2 11.11 6 12 6a6 6 0 0 1 6 6 6 6 0 0 1-6 6m8-9.31V4h-4.69L12 .69 8.69 4H4v4.69L.69 12 4 15.31V20h4.69L12 23.31 15.31 20H20v-4.69L23.31 12z"></path></svg> </label> <input aria-label="Switch to light mode" class="md-option" data-md-color-accent="indigo" data-md-color-media="" data-md-color-primary="indigo" data-md-color-scheme="slate" id="__palette_1" name="__palette" type="radio"/> <label class="md-header__button md-icon" for="__palette_0" hidden="" title="Switch to light mode"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a4 4 0 0 0-4 4 4 4 0 0 0 4 4 4 4 0 0 0 4-4 4 4 0 0 0-4-4m0 10a6 6 0 0 1-6-6 6 6 0 0 1 6-6 6 6 0 0 1 6 6 6 6 0 0 1-6 6m8-9.31V4h-4.69L12 .69 8.69 4H4v4.69L.69 12 4 15.31V20h4.69L12 23.31 15.31 20H20v-4.69L23.31 12z"></path></svg> </label> </form> <script>var palette=__md_get("__palette");if(palette&&palette.color){if("(prefers-color-scheme)"===palette.color.media){var media=matchMedia("(prefers-color-scheme: light)"),input=document.querySelector(media.matches?"[data-md-color-media='(prefers-color-scheme: light)']":"[data-md-color-media='(prefers-color-scheme: dark)']");palette.color.media=input.getAttribute("data-md-color-media"),palette.color.scheme=input.getAttribute("data-md-color-scheme"),palette.color.primary=input.getAttribute("data-md-color-primary"),palette.color.accent=input.getAttribute("data-md-color-accent")}for(var[key,value]of Object.entries(palette.color))document.body.setAttribute("data-md-color-"+key,value)}</script> <label class="md-header__button md-icon" for="__search"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M9.5 3A6.5 6.5 0 0 1 16 9.5c0 1.61-.59 3.09-1.56 4.23l.27.27h.79l5 5-1.5 1.5-5-5v-.79l-.27-.27A6.52 6.52 0 0 1 9.5 16 6.5 6.5 0 0 1 3 9.5 6.5 6.5 0 0 1 9.5 3m0 2C7 5 5 7 5 9.5S7 14 9.5 14 14 12 14 9.5 12 5 9.5 5"></path></svg> </label> <div class="md-search" data-md-component="search" role="dialog"> <label class="md-search__overlay" for="__search"></label> <div class="md-search__inner" role="search"> <form class="md-search__form" name="search"> <input aria-label="Search" autocapitalize="off" autocomplete="off" autocorrect="off" class="md-search__input" data-md-component="search-query" name="query" placeholder="Search" required="" spellcheck="false" type="text"/> <label class="md-search__icon md-icon" for="__search"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M9.5 3A6.5 6.5 0 0 1 16 9.5c0 1.61-.59 3.09-1.56 4.23l.27.27h.79l5 5-1.5 1.5-5-5v-.79l-.27-.27A6.52 6.52 0 0 1 9.5 16 6.5 6.5 0 0 1 3 9.5 6.5 6.5 0 0 1 9.5 3m0 2C7 5 5 7 5 9.5S7 14 9.5 14 14 12 14 9.5 12 5 9.5 5"></path></svg> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M20 11v2H8l5.5 5.5-1.42 1.42L4.16 12l7.92-7.92L13.5 5.5 8 11z"></path></svg> </label> <nav aria-label="Search" class="md-search__options"> <button aria-label="Clear" class="md-search__icon md-icon" tabindex="-1" title="Clear" type="reset"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M19 6.41 17.59 5 12 10.59 6.41 5 5 6.41 10.59 12 5 17.59 6.41 19 12 13.41 17.59 19 19 17.59 13.41 12z"></path></svg> </button> </nav> </form> <div class="md-search__output"> <div class="md-search__scrollwrap" data-md-scrollfix="" tabindex="0"> <div class="md-search-result" data-md-component="search-result"> <div class="md-search-result__meta"> Initializing search </div> <ol class="md-search-result__list" role="presentation"></ol> </div> </div> </div> </div> </div> <div class="md-header__source"> <a class="md-source" data-md-component="source" href="https://github.com/aphp/edsnlp" title="Go to repository"> <div class="md-source__icon md-icon"> <svg viewbox="0 0 448 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M439.55 236.05 244 40.45a28.87 28.87 0 0 0-40.81 0l-40.66 40.63 51.52 51.52c27.06-9.14 52.68 16.77 43.39 43.68l49.66 49.66c34.23-11.8 61.18 31 35.47 56.69-26.49 26.49-70.21-2.87-56-37.34L240.22 199v121.85c25.3 12.54 22.26 41.85 9.08 55a34.34 34.34 0 0 1-48.55 0c-17.57-17.6-11.07-46.91 11.25-56v-123c-20.8-8.51-24.6-30.74-18.64-45L142.57 101 8.45 235.14a28.86 28.86 0 0 0 0 40.81l195.61 195.6a28.86 28.86 0 0 0 40.8 0l194.69-194.69a28.86 28.86 0 0 0 0-40.81"></path></svg> </div> <div class="md-source__repository"> aphp/edsnlp </div> </a> </div> </nav> </header> <div class="md-container" data-md-component="container"> <main class="md-main" data-md-component="main"> <div class="md-main__inner md-grid"> <div class="md-sidebar md-sidebar--primary" data-md-component="sidebar" data-md-type="navigation"> <div class="md-sidebar__scrollwrap"> <div class="md-sidebar__inner"> <nav aria-label="Navigation" class="md-nav md-nav--primary" data-md-level="0"> <label class="md-nav__title" for="__drawer"> <a aria-label="EDS-NLP" class="md-nav__button md-logo" data-md-component="logo" href="../../.." title="EDS-NLP"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a3 3 0 0 0 3-3 3 3 0 0 0-3-3 3 3 0 0 0-3 3 3 3 0 0 0 3 3m0 3.54C9.64 9.35 6.5 8 3 8v11c3.5 0 6.64 1.35 9 3.54 2.36-2.19 5.5-3.54 9-3.54V8c-3.5 0-6.64 1.35-9 3.54"></path></svg> </a> EDS-NLP </label> <div class="md-nav__source"> <a class="md-source" data-md-component="source" href="https://github.com/aphp/edsnlp" title="Go to repository"> <div class="md-source__icon md-icon"> <svg viewbox="0 0 448 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M439.55 236.05 244 40.45a28.87 28.87 0 0 0-40.81 0l-40.66 40.63 51.52 51.52c27.06-9.14 52.68 16.77 43.39 43.68l49.66 49.66c34.23-11.8 61.18 31 35.47 56.69-26.49 26.49-70.21-2.87-56-37.34L240.22 199v121.85c25.3 12.54 22.26 41.85 9.08 55a34.34 34.34 0 0 1-48.55 0c-17.57-17.6-11.07-46.91 11.25-56v-123c-20.8-8.51-24.6-30.74-18.64-45L142.57 101 8.45 235.14a28.86 28.86 0 0 0 0 40.81l195.61 195.6a28.86 28.86 0 0 0 40.8 0l194.69-194.69a28.86 28.86 0 0 0 0-40.81"></path></svg> </div> <div class="md-source__repository"> aphp/edsnlp </div> </a> </div> <ul class="md-nav__list" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href="../../.."> <span class="md-ellipsis"> Getting started </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="https://aphp.github.io/edsnlp/demo" target="_blank"> <span class="md-ellipsis"> Demo </span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../../tutorials/"> <span class="md-ellipsis"> Tutorials </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../../pipes/"> <span class="md-ellipsis"> Pipes </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../../tokenizers/"> <span class="md-ellipsis"> Tokenizers </span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../../data/"> <span class="md-ellipsis"> Data Connectors </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../../concepts/pipeline/"> <span class="md-ellipsis"> Concepts </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../"> <span class="md-ellipsis"> Utilities </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../../reference/edsnlp/"> <span class="md-ellipsis"> Code Reference </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../../contributing/"> <span class="md-ellipsis"> Contributing to EDS-NLP </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../../changelog/"> <span class="md-ellipsis"> Changelog </span> </a> </li> </ul> </nav> </div> </div> </div> <div class="md-sidebar md-sidebar--secondary" data-md-component="sidebar" data-md-type="toc"> <div class="md-sidebar__scrollwrap"> <div class="md-sidebar__inner"> <nav aria-label="Table of contents" class="md-nav md-nav--secondary"> </nav> </div> </div> </div> <div class="md-content" data-md-component="content"> <article class="md-content__inner md-typeset"> <h1 id="overview-of-connectors">Overview of connectors</h1> <p>EDS-NLP provides a series of connectors apt to convert back and forth from different formats into spaCy representation.</p> <p>We provide the following connectors:</p> <ul> <li><a href="../brat/">BRAT</a></li> <li><a href="../omop/">OMOP</a></li> </ul> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/utilities/evaluation/index.html b/master/utilities/evaluation/index.html
index 0375f796c..19f592578 100644
--- a/master/utilities/evaluation/index.html
+++ b/master/utilities/evaluation/index.html
@@ -1,5 +1,5 @@
 <!DOCTYPE html>
-<html class="no-js" lang="en"> <head><meta charset="utf-8"/><meta content="width=device-width,initial-scale=1" name="viewport"/><link href="../../assets/logo/edsnlp.svg" rel="icon"/><meta content="mkdocs-1.6.1, mkdocs-material-9.5.42" name="generator"/><title>Pipeline evaluation - EDS-NLP</title><link href="../../assets/stylesheets/main.0253249f.min.css" rel="stylesheet"/><link href="../../assets/stylesheets/palette.06af60db.min.css" rel="stylesheet"/><link crossorigin="" href="https://fonts.gstatic.com" rel="preconnect"/><link href="https://fonts.googleapis.com/css?family=Roboto:300,300i,400,400i,700,700i%7CRoboto+Mono:400,400i,700,700i&amp;display=fallback" rel="stylesheet"/><style>:root{--md-text-font:"Roboto";--md-code-font:"Roboto Mono"}</style><link href="../../assets/_mkdocstrings.css" rel="stylesheet"/><link href="../../assets/stylesheets/extra.css" rel="stylesheet"/><link href="../../assets/stylesheets/cards.css" rel="stylesheet"/><link href="../../assets/termynal/termynal.css" rel="stylesheet"/><script>__md_scope=new URL("../..",location),__md_hash=e=>[...e].reduce(((e,_)=>(e<<5)-e+_.charCodeAt(0)),0),__md_get=(e,_=localStorage,t=__md_scope)=>JSON.parse(_.getItem(t.pathname+"."+e)),__md_set=(e,_,t=localStorage,a=__md_scope)=>{try{t.setItem(a.pathname+"."+e,JSON.stringify(_))}catch(e){}}</script></head> <body data-md-color-accent="indigo" data-md-color-primary="indigo" data-md-color-scheme="default" dir="ltr"> <input autocomplete="off" class="md-toggle" data-md-toggle="drawer" id="__drawer" type="checkbox"/> <input autocomplete="off" class="md-toggle" data-md-toggle="search" id="__search" type="checkbox"/> <label class="md-overlay" for="__drawer"></label> <div data-md-component="skip"> <a class="md-skip" href="#pipeline-evaluation"> Skip to content </a> </div> <div data-md-component="announce"> <aside class="md-banner"> <div class="md-banner__inner md-grid md-typeset"> <button aria-label="Don't show this again" class="md-banner__button md-icon"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M19 6.41 17.59 5 12 10.59 6.41 5 5 6.41 10.59 12 5 17.59 6.41 19 12 13.41 17.59 19 19 17.59 13.41 12z"></path></svg> </button> Check out the new <a href="../../tutorials/training">Model Training tutorial</a> ! </div> <script>var el=document.querySelector("[data-md-component=announce]");if(el){var content=el.querySelector(".md-typeset");__md_hash(content.innerHTML)===__md_get("__announce")&&(el.hidden=!0)}</script> </aside> </div> <div data-md-color-scheme="default" data-md-component="outdated" hidden=""> </div> <header class="md-header md-header--shadow" data-md-component="header"> <nav aria-label="Header" class="md-header__inner md-grid"> <a aria-label="EDS-NLP" class="md-header__button md-logo" data-md-component="logo" href="../.." title="EDS-NLP"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a3 3 0 0 0 3-3 3 3 0 0 0-3-3 3 3 0 0 0-3 3 3 3 0 0 0 3 3m0 3.54C9.64 9.35 6.5 8 3 8v11c3.5 0 6.64 1.35 9 3.54 2.36-2.19 5.5-3.54 9-3.54V8c-3.5 0-6.64 1.35-9 3.54"></path></svg> </a> <label class="md-header__button md-icon" for="__drawer"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M3 6h18v2H3zm0 5h18v2H3zm0 5h18v2H3z"></path></svg> </label> <div class="md-header__title" data-md-component="header-title"> <div class="md-header__ellipsis"> <div class="md-header__topic"> <span class="md-ellipsis"> EDS-NLP </span> </div> <div class="md-header__topic" data-md-component="header-topic"> <span class="md-ellipsis"> Pipeline evaluation </span> </div> </div> </div> <form class="md-header__option" data-md-component="palette"> <input aria-label="Switch to dark mode" class="md-option" data-md-color-accent="indigo" data-md-color-media="" data-md-color-primary="indigo" data-md-color-scheme="default" id="__palette_0" name="__palette" type="radio"/> <label class="md-header__button md-icon" for="__palette_1" hidden="" title="Switch to dark mode"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 18c-.89 0-1.74-.2-2.5-.55C11.56 16.5 13 14.42 13 12s-1.44-4.5-3.5-5.45C10.26 6.2 11.11 6 12 6a6 6 0 0 1 6 6 6 6 0 0 1-6 6m8-9.31V4h-4.69L12 .69 8.69 4H4v4.69L.69 12 4 15.31V20h4.69L12 23.31 15.31 20H20v-4.69L23.31 12z"></path></svg> </label> <input aria-label="Switch to light mode" class="md-option" data-md-color-accent="indigo" data-md-color-media="" data-md-color-primary="indigo" data-md-color-scheme="slate" id="__palette_1" name="__palette" type="radio"/> <label class="md-header__button md-icon" for="__palette_0" hidden="" title="Switch to light mode"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a4 4 0 0 0-4 4 4 4 0 0 0 4 4 4 4 0 0 0 4-4 4 4 0 0 0-4-4m0 10a6 6 0 0 1-6-6 6 6 0 0 1 6-6 6 6 0 0 1 6 6 6 6 0 0 1-6 6m8-9.31V4h-4.69L12 .69 8.69 4H4v4.69L.69 12 4 15.31V20h4.69L12 23.31 15.31 20H20v-4.69L23.31 12z"></path></svg> </label> </form> <script>var palette=__md_get("__palette");if(palette&&palette.color){if("(prefers-color-scheme)"===palette.color.media){var media=matchMedia("(prefers-color-scheme: light)"),input=document.querySelector(media.matches?"[data-md-color-media='(prefers-color-scheme: light)']":"[data-md-color-media='(prefers-color-scheme: dark)']");palette.color.media=input.getAttribute("data-md-color-media"),palette.color.scheme=input.getAttribute("data-md-color-scheme"),palette.color.primary=input.getAttribute("data-md-color-primary"),palette.color.accent=input.getAttribute("data-md-color-accent")}for(var[key,value]of Object.entries(palette.color))document.body.setAttribute("data-md-color-"+key,value)}</script> <label class="md-header__button md-icon" for="__search"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M9.5 3A6.5 6.5 0 0 1 16 9.5c0 1.61-.59 3.09-1.56 4.23l.27.27h.79l5 5-1.5 1.5-5-5v-.79l-.27-.27A6.52 6.52 0 0 1 9.5 16 6.5 6.5 0 0 1 3 9.5 6.5 6.5 0 0 1 9.5 3m0 2C7 5 5 7 5 9.5S7 14 9.5 14 14 12 14 9.5 12 5 9.5 5"></path></svg> </label> <div class="md-search" data-md-component="search" role="dialog"> <label class="md-search__overlay" for="__search"></label> <div class="md-search__inner" role="search"> <form class="md-search__form" name="search"> <input aria-label="Search" autocapitalize="off" autocomplete="off" autocorrect="off" class="md-search__input" data-md-component="search-query" name="query" placeholder="Search" required="" spellcheck="false" type="text"/> <label class="md-search__icon md-icon" for="__search"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M9.5 3A6.5 6.5 0 0 1 16 9.5c0 1.61-.59 3.09-1.56 4.23l.27.27h.79l5 5-1.5 1.5-5-5v-.79l-.27-.27A6.52 6.52 0 0 1 9.5 16 6.5 6.5 0 0 1 3 9.5 6.5 6.5 0 0 1 9.5 3m0 2C7 5 5 7 5 9.5S7 14 9.5 14 14 12 14 9.5 12 5 9.5 5"></path></svg> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M20 11v2H8l5.5 5.5-1.42 1.42L4.16 12l7.92-7.92L13.5 5.5 8 11z"></path></svg> </label> <nav aria-label="Search" class="md-search__options"> <button aria-label="Clear" class="md-search__icon md-icon" tabindex="-1" title="Clear" type="reset"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M19 6.41 17.59 5 12 10.59 6.41 5 5 6.41 10.59 12 5 17.59 6.41 19 12 13.41 17.59 19 19 17.59 13.41 12z"></path></svg> </button> </nav> </form> <div class="md-search__output"> <div class="md-search__scrollwrap" data-md-scrollfix="" tabindex="0"> <div class="md-search-result" data-md-component="search-result"> <div class="md-search-result__meta"> Initializing search </div> <ol class="md-search-result__list" role="presentation"></ol> </div> </div> </div> </div> </div> <div class="md-header__source"> <a class="md-source" data-md-component="source" href="https://github.com/aphp/edsnlp" title="Go to repository"> <div class="md-source__icon md-icon"> <svg viewbox="0 0 448 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M439.55 236.05 244 40.45a28.87 28.87 0 0 0-40.81 0l-40.66 40.63 51.52 51.52c27.06-9.14 52.68 16.77 43.39 43.68l49.66 49.66c34.23-11.8 61.18 31 35.47 56.69-26.49 26.49-70.21-2.87-56-37.34L240.22 199v121.85c25.3 12.54 22.26 41.85 9.08 55a34.34 34.34 0 0 1-48.55 0c-17.57-17.6-11.07-46.91 11.25-56v-123c-20.8-8.51-24.6-30.74-18.64-45L142.57 101 8.45 235.14a28.86 28.86 0 0 0 0 40.81l195.61 195.6a28.86 28.86 0 0 0 40.8 0l194.69-194.69a28.86 28.86 0 0 0 0-40.81"></path></svg> </div> <div class="md-source__repository"> aphp/edsnlp </div> </a> </div> </nav> </header> <div class="md-container" data-md-component="container"> <main class="md-main" data-md-component="main"> <div class="md-main__inner md-grid"> <div class="md-sidebar md-sidebar--primary" data-md-component="sidebar" data-md-type="navigation"> <div class="md-sidebar__scrollwrap"> <div class="md-sidebar__inner"> <nav aria-label="Navigation" class="md-nav md-nav--primary" data-md-level="0"> <label class="md-nav__title" for="__drawer"> <a aria-label="EDS-NLP" class="md-nav__button md-logo" data-md-component="logo" href="../.." title="EDS-NLP"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a3 3 0 0 0 3-3 3 3 0 0 0-3-3 3 3 0 0 0-3 3 3 3 0 0 0 3 3m0 3.54C9.64 9.35 6.5 8 3 8v11c3.5 0 6.64 1.35 9 3.54 2.36-2.19 5.5-3.54 9-3.54V8c-3.5 0-6.64 1.35-9 3.54"></path></svg> </a> EDS-NLP </label> <div class="md-nav__source"> <a class="md-source" data-md-component="source" href="https://github.com/aphp/edsnlp" title="Go to repository"> <div class="md-source__icon md-icon"> <svg viewbox="0 0 448 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M439.55 236.05 244 40.45a28.87 28.87 0 0 0-40.81 0l-40.66 40.63 51.52 51.52c27.06-9.14 52.68 16.77 43.39 43.68l49.66 49.66c34.23-11.8 61.18 31 35.47 56.69-26.49 26.49-70.21-2.87-56-37.34L240.22 199v121.85c25.3 12.54 22.26 41.85 9.08 55a34.34 34.34 0 0 1-48.55 0c-17.57-17.6-11.07-46.91 11.25-56v-123c-20.8-8.51-24.6-30.74-18.64-45L142.57 101 8.45 235.14a28.86 28.86 0 0 0 0 40.81l195.61 195.6a28.86 28.86 0 0 0 40.8 0l194.69-194.69a28.86 28.86 0 0 0 0-40.81"></path></svg> </div> <div class="md-source__repository"> aphp/edsnlp </div> </a> </div> <ul class="md-nav__list" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href="../.."> <span class="md-ellipsis"> Getting started </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="https://aphp.github.io/edsnlp/demo" target="_blank"> <span class="md-ellipsis"> Demo </span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../tutorials/"> <span class="md-ellipsis"> Tutorials </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../pipes/"> <span class="md-ellipsis"> Pipes </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../tokenizers/"> <span class="md-ellipsis"> Tokenizers </span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../data/"> <span class="md-ellipsis"> Data Connectors </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../concepts/pipeline/"> <span class="md-ellipsis"> Concepts </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../"> <span class="md-ellipsis"> Utilities </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../reference/edsnlp/"> <span class="md-ellipsis"> Code Reference </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../contributing/"> <span class="md-ellipsis"> Contributing to EDS-NLP </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../changelog/"> <span class="md-ellipsis"> Changelog </span> </a> </li> </ul> </nav> </div> </div> </div> <div class="md-sidebar md-sidebar--secondary" data-md-component="sidebar" data-md-type="toc"> <div class="md-sidebar__scrollwrap"> <div class="md-sidebar__inner"> <nav aria-label="Table of contents" class="md-nav md-nav--secondary"> </nav> </div> </div> </div> <div class="md-content" data-md-component="content"> <article class="md-content__inner md-typeset"> <h1 id="pipeline-evaluation">Pipeline evaluation</h1> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+<html class="no-js" lang="en"> <head><meta charset="utf-8"/><meta content="width=device-width,initial-scale=1" name="viewport"/><link href="../../assets/logo/edsnlp.svg" rel="icon"/><meta content="mkdocs-1.6.1, mkdocs-material-9.5.42" name="generator"/><title>Pipeline evaluation - EDS-NLP</title><link href="../../assets/stylesheets/main.0253249f.min.css" rel="stylesheet"/><link href="../../assets/stylesheets/palette.06af60db.min.css" rel="stylesheet"/><link crossorigin="" href="https://fonts.gstatic.com" rel="preconnect"/><link href="https://fonts.googleapis.com/css?family=Roboto:300,300i,400,400i,700,700i%7CRoboto+Mono:400,400i,700,700i&amp;display=fallback" rel="stylesheet"/><style>:root{--md-text-font:"Roboto";--md-code-font:"Roboto Mono"}</style><link href="../../assets/_mkdocstrings.css" rel="stylesheet"/><link href="../../assets/stylesheets/extra.css" rel="stylesheet"/><link href="../../assets/stylesheets/cards.css" rel="stylesheet"/><link href="../../assets/termynal/termynal.css" rel="stylesheet"/><script>__md_scope=new URL("../..",location),__md_hash=e=>[...e].reduce(((e,_)=>(e<<5)-e+_.charCodeAt(0)),0),__md_get=(e,_=localStorage,t=__md_scope)=>JSON.parse(_.getItem(t.pathname+"."+e)),__md_set=(e,_,t=localStorage,a=__md_scope)=>{try{t.setItem(a.pathname+"."+e,JSON.stringify(_))}catch(e){}}</script></head> <body data-md-color-accent="indigo" data-md-color-primary="indigo" data-md-color-scheme="default" dir="ltr"> <input autocomplete="off" class="md-toggle" data-md-toggle="drawer" id="__drawer" type="checkbox"/> <input autocomplete="off" class="md-toggle" data-md-toggle="search" id="__search" type="checkbox"/> <label class="md-overlay" for="__drawer"></label> <div data-md-component="skip"> <a class="md-skip" href="#pipeline-evaluation"> Skip to content </a> </div> <div data-md-component="announce"> <aside class="md-banner"> <div class="md-banner__inner md-grid md-typeset"> <button aria-label="Don't show this again" class="md-banner__button md-icon"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M19 6.41 17.59 5 12 10.59 6.41 5 5 6.41 10.59 12 5 17.59 6.41 19 12 13.41 17.59 19 19 17.59 13.41 12z"></path></svg> </button> Check out the new <a href="../../tutorials/training">Model Training tutorial</a> ! </div> <script>var el=document.querySelector("[data-md-component=announce]");if(el){var content=el.querySelector(".md-typeset");__md_hash(content.innerHTML)===__md_get("__announce")&&(el.hidden=!0)}</script> </aside> </div> <div data-md-color-scheme="default" data-md-component="outdated" hidden=""> </div> <header class="md-header md-header--shadow" data-md-component="header"> <nav aria-label="Header" class="md-header__inner md-grid"> <a aria-label="EDS-NLP" class="md-header__button md-logo" data-md-component="logo" href="../.." title="EDS-NLP"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a3 3 0 0 0 3-3 3 3 0 0 0-3-3 3 3 0 0 0-3 3 3 3 0 0 0 3 3m0 3.54C9.64 9.35 6.5 8 3 8v11c3.5 0 6.64 1.35 9 3.54 2.36-2.19 5.5-3.54 9-3.54V8c-3.5 0-6.64 1.35-9 3.54"></path></svg> </a> <label class="md-header__button md-icon" for="__drawer"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M3 6h18v2H3zm0 5h18v2H3zm0 5h18v2H3z"></path></svg> </label> <div class="md-header__title" data-md-component="header-title"> <div class="md-header__ellipsis"> <div class="md-header__topic"> <span class="md-ellipsis"> EDS-NLP </span> </div> <div class="md-header__topic" data-md-component="header-topic"> <span class="md-ellipsis"> Pipeline evaluation </span> </div> </div> </div> <form class="md-header__option" data-md-component="palette"> <input aria-label="Switch to dark mode" class="md-option" data-md-color-accent="indigo" data-md-color-media="" data-md-color-primary="indigo" data-md-color-scheme="default" id="__palette_0" name="__palette" type="radio"/> <label class="md-header__button md-icon" for="__palette_1" hidden="" title="Switch to dark mode"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 18c-.89 0-1.74-.2-2.5-.55C11.56 16.5 13 14.42 13 12s-1.44-4.5-3.5-5.45C10.26 6.2 11.11 6 12 6a6 6 0 0 1 6 6 6 6 0 0 1-6 6m8-9.31V4h-4.69L12 .69 8.69 4H4v4.69L.69 12 4 15.31V20h4.69L12 23.31 15.31 20H20v-4.69L23.31 12z"></path></svg> </label> <input aria-label="Switch to light mode" class="md-option" data-md-color-accent="indigo" data-md-color-media="" data-md-color-primary="indigo" data-md-color-scheme="slate" id="__palette_1" name="__palette" type="radio"/> <label class="md-header__button md-icon" for="__palette_0" hidden="" title="Switch to light mode"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a4 4 0 0 0-4 4 4 4 0 0 0 4 4 4 4 0 0 0 4-4 4 4 0 0 0-4-4m0 10a6 6 0 0 1-6-6 6 6 0 0 1 6-6 6 6 0 0 1 6 6 6 6 0 0 1-6 6m8-9.31V4h-4.69L12 .69 8.69 4H4v4.69L.69 12 4 15.31V20h4.69L12 23.31 15.31 20H20v-4.69L23.31 12z"></path></svg> </label> </form> <script>var palette=__md_get("__palette");if(palette&&palette.color){if("(prefers-color-scheme)"===palette.color.media){var media=matchMedia("(prefers-color-scheme: light)"),input=document.querySelector(media.matches?"[data-md-color-media='(prefers-color-scheme: light)']":"[data-md-color-media='(prefers-color-scheme: dark)']");palette.color.media=input.getAttribute("data-md-color-media"),palette.color.scheme=input.getAttribute("data-md-color-scheme"),palette.color.primary=input.getAttribute("data-md-color-primary"),palette.color.accent=input.getAttribute("data-md-color-accent")}for(var[key,value]of Object.entries(palette.color))document.body.setAttribute("data-md-color-"+key,value)}</script> <label class="md-header__button md-icon" for="__search"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M9.5 3A6.5 6.5 0 0 1 16 9.5c0 1.61-.59 3.09-1.56 4.23l.27.27h.79l5 5-1.5 1.5-5-5v-.79l-.27-.27A6.52 6.52 0 0 1 9.5 16 6.5 6.5 0 0 1 3 9.5 6.5 6.5 0 0 1 9.5 3m0 2C7 5 5 7 5 9.5S7 14 9.5 14 14 12 14 9.5 12 5 9.5 5"></path></svg> </label> <div class="md-search" data-md-component="search" role="dialog"> <label class="md-search__overlay" for="__search"></label> <div class="md-search__inner" role="search"> <form class="md-search__form" name="search"> <input aria-label="Search" autocapitalize="off" autocomplete="off" autocorrect="off" class="md-search__input" data-md-component="search-query" name="query" placeholder="Search" required="" spellcheck="false" type="text"/> <label class="md-search__icon md-icon" for="__search"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M9.5 3A6.5 6.5 0 0 1 16 9.5c0 1.61-.59 3.09-1.56 4.23l.27.27h.79l5 5-1.5 1.5-5-5v-.79l-.27-.27A6.52 6.52 0 0 1 9.5 16 6.5 6.5 0 0 1 3 9.5 6.5 6.5 0 0 1 9.5 3m0 2C7 5 5 7 5 9.5S7 14 9.5 14 14 12 14 9.5 12 5 9.5 5"></path></svg> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M20 11v2H8l5.5 5.5-1.42 1.42L4.16 12l7.92-7.92L13.5 5.5 8 11z"></path></svg> </label> <nav aria-label="Search" class="md-search__options"> <button aria-label="Clear" class="md-search__icon md-icon" tabindex="-1" title="Clear" type="reset"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M19 6.41 17.59 5 12 10.59 6.41 5 5 6.41 10.59 12 5 17.59 6.41 19 12 13.41 17.59 19 19 17.59 13.41 12z"></path></svg> </button> </nav> </form> <div class="md-search__output"> <div class="md-search__scrollwrap" data-md-scrollfix="" tabindex="0"> <div class="md-search-result" data-md-component="search-result"> <div class="md-search-result__meta"> Initializing search </div> <ol class="md-search-result__list" role="presentation"></ol> </div> </div> </div> </div> </div> <div class="md-header__source"> <a class="md-source" data-md-component="source" href="https://github.com/aphp/edsnlp" title="Go to repository"> <div class="md-source__icon md-icon"> <svg viewbox="0 0 448 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M439.55 236.05 244 40.45a28.87 28.87 0 0 0-40.81 0l-40.66 40.63 51.52 51.52c27.06-9.14 52.68 16.77 43.39 43.68l49.66 49.66c34.23-11.8 61.18 31 35.47 56.69-26.49 26.49-70.21-2.87-56-37.34L240.22 199v121.85c25.3 12.54 22.26 41.85 9.08 55a34.34 34.34 0 0 1-48.55 0c-17.57-17.6-11.07-46.91 11.25-56v-123c-20.8-8.51-24.6-30.74-18.64-45L142.57 101 8.45 235.14a28.86 28.86 0 0 0 0 40.81l195.61 195.6a28.86 28.86 0 0 0 40.8 0l194.69-194.69a28.86 28.86 0 0 0 0-40.81"></path></svg> </div> <div class="md-source__repository"> aphp/edsnlp </div> </a> </div> </nav> </header> <div class="md-container" data-md-component="container"> <main class="md-main" data-md-component="main"> <div class="md-main__inner md-grid"> <div class="md-sidebar md-sidebar--primary" data-md-component="sidebar" data-md-type="navigation"> <div class="md-sidebar__scrollwrap"> <div class="md-sidebar__inner"> <nav aria-label="Navigation" class="md-nav md-nav--primary" data-md-level="0"> <label class="md-nav__title" for="__drawer"> <a aria-label="EDS-NLP" class="md-nav__button md-logo" data-md-component="logo" href="../.." title="EDS-NLP"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a3 3 0 0 0 3-3 3 3 0 0 0-3-3 3 3 0 0 0-3 3 3 3 0 0 0 3 3m0 3.54C9.64 9.35 6.5 8 3 8v11c3.5 0 6.64 1.35 9 3.54 2.36-2.19 5.5-3.54 9-3.54V8c-3.5 0-6.64 1.35-9 3.54"></path></svg> </a> EDS-NLP </label> <div class="md-nav__source"> <a class="md-source" data-md-component="source" href="https://github.com/aphp/edsnlp" title="Go to repository"> <div class="md-source__icon md-icon"> <svg viewbox="0 0 448 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M439.55 236.05 244 40.45a28.87 28.87 0 0 0-40.81 0l-40.66 40.63 51.52 51.52c27.06-9.14 52.68 16.77 43.39 43.68l49.66 49.66c34.23-11.8 61.18 31 35.47 56.69-26.49 26.49-70.21-2.87-56-37.34L240.22 199v121.85c25.3 12.54 22.26 41.85 9.08 55a34.34 34.34 0 0 1-48.55 0c-17.57-17.6-11.07-46.91 11.25-56v-123c-20.8-8.51-24.6-30.74-18.64-45L142.57 101 8.45 235.14a28.86 28.86 0 0 0 0 40.81l195.61 195.6a28.86 28.86 0 0 0 40.8 0l194.69-194.69a28.86 28.86 0 0 0 0-40.81"></path></svg> </div> <div class="md-source__repository"> aphp/edsnlp </div> </a> </div> <ul class="md-nav__list" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href="../.."> <span class="md-ellipsis"> Getting started </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="https://aphp.github.io/edsnlp/demo" target="_blank"> <span class="md-ellipsis"> Demo </span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../tutorials/"> <span class="md-ellipsis"> Tutorials </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../pipes/"> <span class="md-ellipsis"> Pipes </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../tokenizers/"> <span class="md-ellipsis"> Tokenizers </span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../data/"> <span class="md-ellipsis"> Data Connectors </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../concepts/pipeline/"> <span class="md-ellipsis"> Concepts </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../"> <span class="md-ellipsis"> Utilities </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../reference/edsnlp/"> <span class="md-ellipsis"> Code Reference </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../contributing/"> <span class="md-ellipsis"> Contributing to EDS-NLP </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../changelog/"> <span class="md-ellipsis"> Changelog </span> </a> </li> </ul> </nav> </div> </div> </div> <div class="md-sidebar md-sidebar--secondary" data-md-component="sidebar" data-md-type="toc"> <div class="md-sidebar__scrollwrap"> <div class="md-sidebar__inner"> <nav aria-label="Table of contents" class="md-nav md-nav--secondary"> </nav> </div> </div> </div> <div class="md-content" data-md-component="content"> <article class="md-content__inner md-typeset"> <h1 id="pipeline-evaluation">Pipeline evaluation</h1> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/utilities/index.html b/master/utilities/index.html
index 1ba0d54d5..0f43fc0fa 100644
--- a/master/utilities/index.html
+++ b/master/utilities/index.html
@@ -1,5 +1,5 @@
 <!DOCTYPE html>
-<html class="no-js" lang="en"> <head><meta charset="utf-8"/><meta content="width=device-width,initial-scale=1" name="viewport"/><link href="../concepts/inference/" rel="prev"/><link href="tests/blocs/" rel="next"/><link href="../assets/logo/edsnlp.svg" rel="icon"/><meta content="mkdocs-1.6.1, mkdocs-material-9.5.42" name="generator"/><title>Overview - EDS-NLP</title><link href="../assets/stylesheets/main.0253249f.min.css" rel="stylesheet"/><link href="../assets/stylesheets/palette.06af60db.min.css" rel="stylesheet"/><link crossorigin="" href="https://fonts.gstatic.com" rel="preconnect"/><link href="https://fonts.googleapis.com/css?family=Roboto:300,300i,400,400i,700,700i%7CRoboto+Mono:400,400i,700,700i&amp;display=fallback" rel="stylesheet"/><style>:root{--md-text-font:"Roboto";--md-code-font:"Roboto Mono"}</style><link href="../assets/_mkdocstrings.css" rel="stylesheet"/><link href="../assets/stylesheets/extra.css" rel="stylesheet"/><link href="../assets/stylesheets/cards.css" rel="stylesheet"/><link href="../assets/termynal/termynal.css" rel="stylesheet"/><script>__md_scope=new URL("..",location),__md_hash=e=>[...e].reduce(((e,_)=>(e<<5)-e+_.charCodeAt(0)),0),__md_get=(e,_=localStorage,t=__md_scope)=>JSON.parse(_.getItem(t.pathname+"."+e)),__md_set=(e,_,t=localStorage,a=__md_scope)=>{try{t.setItem(a.pathname+"."+e,JSON.stringify(_))}catch(e){}}</script></head> <body data-md-color-accent="indigo" data-md-color-primary="indigo" data-md-color-scheme="default" dir="ltr"> <input autocomplete="off" class="md-toggle" data-md-toggle="drawer" id="__drawer" type="checkbox"/> <input autocomplete="off" class="md-toggle" data-md-toggle="search" id="__search" type="checkbox"/> <label class="md-overlay" for="__drawer"></label> <div data-md-component="skip"> <a class="md-skip" href="#utilities"> Skip to content </a> </div> <div data-md-component="announce"> <aside class="md-banner"> <div class="md-banner__inner md-grid md-typeset"> <button aria-label="Don't show this again" class="md-banner__button md-icon"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M19 6.41 17.59 5 12 10.59 6.41 5 5 6.41 10.59 12 5 17.59 6.41 19 12 13.41 17.59 19 19 17.59 13.41 12z"></path></svg> </button> Check out the new <a href="../tutorials/training">Model Training tutorial</a> ! </div> <script>var el=document.querySelector("[data-md-component=announce]");if(el){var content=el.querySelector(".md-typeset");__md_hash(content.innerHTML)===__md_get("__announce")&&(el.hidden=!0)}</script> </aside> </div> <div data-md-color-scheme="default" data-md-component="outdated" hidden=""> </div> <header class="md-header md-header--shadow" data-md-component="header"> <nav aria-label="Header" class="md-header__inner md-grid"> <a aria-label="EDS-NLP" class="md-header__button md-logo" data-md-component="logo" href=".." title="EDS-NLP"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a3 3 0 0 0 3-3 3 3 0 0 0-3-3 3 3 0 0 0-3 3 3 3 0 0 0 3 3m0 3.54C9.64 9.35 6.5 8 3 8v11c3.5 0 6.64 1.35 9 3.54 2.36-2.19 5.5-3.54 9-3.54V8c-3.5 0-6.64 1.35-9 3.54"></path></svg> </a> <label class="md-header__button md-icon" for="__drawer"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M3 6h18v2H3zm0 5h18v2H3zm0 5h18v2H3z"></path></svg> </label> <div class="md-header__title" data-md-component="header-title"> <div class="md-header__ellipsis"> <div class="md-header__topic"> <span class="md-ellipsis"> EDS-NLP </span> </div> <div class="md-header__topic" data-md-component="header-topic"> <span class="md-ellipsis"> Overview </span> </div> </div> </div> <form class="md-header__option" data-md-component="palette"> <input aria-label="Switch to dark mode" class="md-option" data-md-color-accent="indigo" data-md-color-media="" data-md-color-primary="indigo" data-md-color-scheme="default" id="__palette_0" name="__palette" type="radio"/> <label class="md-header__button md-icon" for="__palette_1" hidden="" title="Switch to dark mode"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 18c-.89 0-1.74-.2-2.5-.55C11.56 16.5 13 14.42 13 12s-1.44-4.5-3.5-5.45C10.26 6.2 11.11 6 12 6a6 6 0 0 1 6 6 6 6 0 0 1-6 6m8-9.31V4h-4.69L12 .69 8.69 4H4v4.69L.69 12 4 15.31V20h4.69L12 23.31 15.31 20H20v-4.69L23.31 12z"></path></svg> </label> <input aria-label="Switch to light mode" class="md-option" data-md-color-accent="indigo" data-md-color-media="" data-md-color-primary="indigo" data-md-color-scheme="slate" id="__palette_1" name="__palette" type="radio"/> <label class="md-header__button md-icon" for="__palette_0" hidden="" title="Switch to light mode"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a4 4 0 0 0-4 4 4 4 0 0 0 4 4 4 4 0 0 0 4-4 4 4 0 0 0-4-4m0 10a6 6 0 0 1-6-6 6 6 0 0 1 6-6 6 6 0 0 1 6 6 6 6 0 0 1-6 6m8-9.31V4h-4.69L12 .69 8.69 4H4v4.69L.69 12 4 15.31V20h4.69L12 23.31 15.31 20H20v-4.69L23.31 12z"></path></svg> </label> </form> <script>var palette=__md_get("__palette");if(palette&&palette.color){if("(prefers-color-scheme)"===palette.color.media){var media=matchMedia("(prefers-color-scheme: light)"),input=document.querySelector(media.matches?"[data-md-color-media='(prefers-color-scheme: light)']":"[data-md-color-media='(prefers-color-scheme: dark)']");palette.color.media=input.getAttribute("data-md-color-media"),palette.color.scheme=input.getAttribute("data-md-color-scheme"),palette.color.primary=input.getAttribute("data-md-color-primary"),palette.color.accent=input.getAttribute("data-md-color-accent")}for(var[key,value]of Object.entries(palette.color))document.body.setAttribute("data-md-color-"+key,value)}</script> <label class="md-header__button md-icon" for="__search"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M9.5 3A6.5 6.5 0 0 1 16 9.5c0 1.61-.59 3.09-1.56 4.23l.27.27h.79l5 5-1.5 1.5-5-5v-.79l-.27-.27A6.52 6.52 0 0 1 9.5 16 6.5 6.5 0 0 1 3 9.5 6.5 6.5 0 0 1 9.5 3m0 2C7 5 5 7 5 9.5S7 14 9.5 14 14 12 14 9.5 12 5 9.5 5"></path></svg> </label> <div class="md-search" data-md-component="search" role="dialog"> <label class="md-search__overlay" for="__search"></label> <div class="md-search__inner" role="search"> <form class="md-search__form" name="search"> <input aria-label="Search" autocapitalize="off" autocomplete="off" autocorrect="off" class="md-search__input" data-md-component="search-query" name="query" placeholder="Search" required="" spellcheck="false" type="text"/> <label class="md-search__icon md-icon" for="__search"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M9.5 3A6.5 6.5 0 0 1 16 9.5c0 1.61-.59 3.09-1.56 4.23l.27.27h.79l5 5-1.5 1.5-5-5v-.79l-.27-.27A6.52 6.52 0 0 1 9.5 16 6.5 6.5 0 0 1 3 9.5 6.5 6.5 0 0 1 9.5 3m0 2C7 5 5 7 5 9.5S7 14 9.5 14 14 12 14 9.5 12 5 9.5 5"></path></svg> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M20 11v2H8l5.5 5.5-1.42 1.42L4.16 12l7.92-7.92L13.5 5.5 8 11z"></path></svg> </label> <nav aria-label="Search" class="md-search__options"> <button aria-label="Clear" class="md-search__icon md-icon" tabindex="-1" title="Clear" type="reset"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M19 6.41 17.59 5 12 10.59 6.41 5 5 6.41 10.59 12 5 17.59 6.41 19 12 13.41 17.59 19 19 17.59 13.41 12z"></path></svg> </button> </nav> </form> <div class="md-search__output"> <div class="md-search__scrollwrap" data-md-scrollfix="" tabindex="0"> <div class="md-search-result" data-md-component="search-result"> <div class="md-search-result__meta"> Initializing search </div> <ol class="md-search-result__list" role="presentation"></ol> </div> </div> </div> </div> </div> <div class="md-header__source"> <a class="md-source" data-md-component="source" href="https://github.com/aphp/edsnlp" title="Go to repository"> <div class="md-source__icon md-icon"> <svg viewbox="0 0 448 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M439.55 236.05 244 40.45a28.87 28.87 0 0 0-40.81 0l-40.66 40.63 51.52 51.52c27.06-9.14 52.68 16.77 43.39 43.68l49.66 49.66c34.23-11.8 61.18 31 35.47 56.69-26.49 26.49-70.21-2.87-56-37.34L240.22 199v121.85c25.3 12.54 22.26 41.85 9.08 55a34.34 34.34 0 0 1-48.55 0c-17.57-17.6-11.07-46.91 11.25-56v-123c-20.8-8.51-24.6-30.74-18.64-45L142.57 101 8.45 235.14a28.86 28.86 0 0 0 0 40.81l195.61 195.6a28.86 28.86 0 0 0 40.8 0l194.69-194.69a28.86 28.86 0 0 0 0-40.81"></path></svg> </div> <div class="md-source__repository"> aphp/edsnlp </div> </a> </div> </nav> </header> <div class="md-container" data-md-component="container"> <main class="md-main" data-md-component="main"> <div class="md-main__inner md-grid"> <div class="md-sidebar md-sidebar--primary" data-md-component="sidebar" data-md-type="navigation"> <div class="md-sidebar__scrollwrap"> <div class="md-sidebar__inner"> <nav aria-label="Navigation" class="md-nav md-nav--primary" data-md-level="0"> <label class="md-nav__title" for="__drawer"> <a aria-label="EDS-NLP" class="md-nav__button md-logo" data-md-component="logo" href=".." title="EDS-NLP"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a3 3 0 0 0 3-3 3 3 0 0 0-3-3 3 3 0 0 0-3 3 3 3 0 0 0 3 3m0 3.54C9.64 9.35 6.5 8 3 8v11c3.5 0 6.64 1.35 9 3.54 2.36-2.19 5.5-3.54 9-3.54V8c-3.5 0-6.64 1.35-9 3.54"></path></svg> </a> EDS-NLP </label> <div class="md-nav__source"> <a class="md-source" data-md-component="source" href="https://github.com/aphp/edsnlp" title="Go to repository"> <div class="md-source__icon md-icon"> <svg viewbox="0 0 448 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M439.55 236.05 244 40.45a28.87 28.87 0 0 0-40.81 0l-40.66 40.63 51.52 51.52c27.06-9.14 52.68 16.77 43.39 43.68l49.66 49.66c34.23-11.8 61.18 31 35.47 56.69-26.49 26.49-70.21-2.87-56-37.34L240.22 199v121.85c25.3 12.54 22.26 41.85 9.08 55a34.34 34.34 0 0 1-48.55 0c-17.57-17.6-11.07-46.91 11.25-56v-123c-20.8-8.51-24.6-30.74-18.64-45L142.57 101 8.45 235.14a28.86 28.86 0 0 0 0 40.81l195.61 195.6a28.86 28.86 0 0 0 40.8 0l194.69-194.69a28.86 28.86 0 0 0 0-40.81"></path></svg> </div> <div class="md-source__repository"> aphp/edsnlp </div> </a> </div> <ul class="md-nav__list" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href=".."> <span class="md-ellipsis"> Getting started </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="https://aphp.github.io/edsnlp/demo" target="_blank"> <span class="md-ellipsis"> Demo </span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../tutorials/"> <span class="md-ellipsis"> Tutorials </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../pipes/"> <span class="md-ellipsis"> Pipes </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../tokenizers/"> <span class="md-ellipsis"> Tokenizers </span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../data/"> <span class="md-ellipsis"> Data Connectors </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../concepts/pipeline/"> <span class="md-ellipsis"> Concepts </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--active md-nav__item--nested"> <input checked="" class="md-nav__toggle md-toggle" id="__nav_8" type="checkbox"/> <div class="md-nav__link md-nav__container"> <a class="md-nav__link" href="./"> <span class="md-ellipsis"> Utilities </span> </a> <label class="md-nav__link" for="__nav_8" id="__nav_8_label" tabindex="0"> <span class="md-nav__icon md-icon"></span> </label> </div> <nav aria-expanded="true" aria-labelledby="__nav_8_label" class="md-nav" data-md-level="1"> <label class="md-nav__title" for="__nav_8"> <span class="md-nav__icon md-icon"></span> Utilities </label> <ul class="md-nav__list" data-md-scrollfix=""> <li class="md-nav__item md-nav__item--active"> <input class="md-nav__toggle md-toggle" id="__toc" type="checkbox"/> <a class="md-nav__link md-nav__link--active" href="./"> <span class="md-ellipsis"> Overview </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="tests/blocs/"> <span class="md-ellipsis"> Testing Code Blocs </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="tests/examples/"> <span class="md-ellipsis"> Creating Examples </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="matchers/"> <span class="md-ellipsis"> Matchers </span> </a> </li> </ul> </nav> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../reference/edsnlp/"> <span class="md-ellipsis"> Code Reference </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../contributing/"> <span class="md-ellipsis"> Contributing to EDS-NLP </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../changelog/"> <span class="md-ellipsis"> Changelog </span> </a> </li> </ul> </nav> </div> </div> </div> <div class="md-sidebar md-sidebar--secondary" data-md-component="sidebar" data-md-type="toc"> <div class="md-sidebar__scrollwrap"> <div class="md-sidebar__inner"> <nav aria-label="Table of contents" class="md-nav md-nav--secondary"> </nav> </div> </div> </div> <div class="md-content" data-md-component="content"> <article class="md-content__inner md-typeset"> <h1 id="utilities">Utilities</h1> <p>EDS-NLP provides a few utilities to deploy pipelines, process RegExps, etc.</p> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+<html class="no-js" lang="en"> <head><meta charset="utf-8"/><meta content="width=device-width,initial-scale=1" name="viewport"/><link href="../concepts/inference/" rel="prev"/><link href="tests/blocs/" rel="next"/><link href="../assets/logo/edsnlp.svg" rel="icon"/><meta content="mkdocs-1.6.1, mkdocs-material-9.5.42" name="generator"/><title>Overview - EDS-NLP</title><link href="../assets/stylesheets/main.0253249f.min.css" rel="stylesheet"/><link href="../assets/stylesheets/palette.06af60db.min.css" rel="stylesheet"/><link crossorigin="" href="https://fonts.gstatic.com" rel="preconnect"/><link href="https://fonts.googleapis.com/css?family=Roboto:300,300i,400,400i,700,700i%7CRoboto+Mono:400,400i,700,700i&amp;display=fallback" rel="stylesheet"/><style>:root{--md-text-font:"Roboto";--md-code-font:"Roboto Mono"}</style><link href="../assets/_mkdocstrings.css" rel="stylesheet"/><link href="../assets/stylesheets/extra.css" rel="stylesheet"/><link href="../assets/stylesheets/cards.css" rel="stylesheet"/><link href="../assets/termynal/termynal.css" rel="stylesheet"/><script>__md_scope=new URL("..",location),__md_hash=e=>[...e].reduce(((e,_)=>(e<<5)-e+_.charCodeAt(0)),0),__md_get=(e,_=localStorage,t=__md_scope)=>JSON.parse(_.getItem(t.pathname+"."+e)),__md_set=(e,_,t=localStorage,a=__md_scope)=>{try{t.setItem(a.pathname+"."+e,JSON.stringify(_))}catch(e){}}</script></head> <body data-md-color-accent="indigo" data-md-color-primary="indigo" data-md-color-scheme="default" dir="ltr"> <input autocomplete="off" class="md-toggle" data-md-toggle="drawer" id="__drawer" type="checkbox"/> <input autocomplete="off" class="md-toggle" data-md-toggle="search" id="__search" type="checkbox"/> <label class="md-overlay" for="__drawer"></label> <div data-md-component="skip"> <a class="md-skip" href="#utilities"> Skip to content </a> </div> <div data-md-component="announce"> <aside class="md-banner"> <div class="md-banner__inner md-grid md-typeset"> <button aria-label="Don't show this again" class="md-banner__button md-icon"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M19 6.41 17.59 5 12 10.59 6.41 5 5 6.41 10.59 12 5 17.59 6.41 19 12 13.41 17.59 19 19 17.59 13.41 12z"></path></svg> </button> Check out the new <a href="../tutorials/training">Model Training tutorial</a> ! </div> <script>var el=document.querySelector("[data-md-component=announce]");if(el){var content=el.querySelector(".md-typeset");__md_hash(content.innerHTML)===__md_get("__announce")&&(el.hidden=!0)}</script> </aside> </div> <div data-md-color-scheme="default" data-md-component="outdated" hidden=""> </div> <header class="md-header md-header--shadow" data-md-component="header"> <nav aria-label="Header" class="md-header__inner md-grid"> <a aria-label="EDS-NLP" class="md-header__button md-logo" data-md-component="logo" href=".." title="EDS-NLP"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a3 3 0 0 0 3-3 3 3 0 0 0-3-3 3 3 0 0 0-3 3 3 3 0 0 0 3 3m0 3.54C9.64 9.35 6.5 8 3 8v11c3.5 0 6.64 1.35 9 3.54 2.36-2.19 5.5-3.54 9-3.54V8c-3.5 0-6.64 1.35-9 3.54"></path></svg> </a> <label class="md-header__button md-icon" for="__drawer"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M3 6h18v2H3zm0 5h18v2H3zm0 5h18v2H3z"></path></svg> </label> <div class="md-header__title" data-md-component="header-title"> <div class="md-header__ellipsis"> <div class="md-header__topic"> <span class="md-ellipsis"> EDS-NLP </span> </div> <div class="md-header__topic" data-md-component="header-topic"> <span class="md-ellipsis"> Overview </span> </div> </div> </div> <form class="md-header__option" data-md-component="palette"> <input aria-label="Switch to dark mode" class="md-option" data-md-color-accent="indigo" data-md-color-media="" data-md-color-primary="indigo" data-md-color-scheme="default" id="__palette_0" name="__palette" type="radio"/> <label class="md-header__button md-icon" for="__palette_1" hidden="" title="Switch to dark mode"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 18c-.89 0-1.74-.2-2.5-.55C11.56 16.5 13 14.42 13 12s-1.44-4.5-3.5-5.45C10.26 6.2 11.11 6 12 6a6 6 0 0 1 6 6 6 6 0 0 1-6 6m8-9.31V4h-4.69L12 .69 8.69 4H4v4.69L.69 12 4 15.31V20h4.69L12 23.31 15.31 20H20v-4.69L23.31 12z"></path></svg> </label> <input aria-label="Switch to light mode" class="md-option" data-md-color-accent="indigo" data-md-color-media="" data-md-color-primary="indigo" data-md-color-scheme="slate" id="__palette_1" name="__palette" type="radio"/> <label class="md-header__button md-icon" for="__palette_0" hidden="" title="Switch to light mode"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a4 4 0 0 0-4 4 4 4 0 0 0 4 4 4 4 0 0 0 4-4 4 4 0 0 0-4-4m0 10a6 6 0 0 1-6-6 6 6 0 0 1 6-6 6 6 0 0 1 6 6 6 6 0 0 1-6 6m8-9.31V4h-4.69L12 .69 8.69 4H4v4.69L.69 12 4 15.31V20h4.69L12 23.31 15.31 20H20v-4.69L23.31 12z"></path></svg> </label> </form> <script>var palette=__md_get("__palette");if(palette&&palette.color){if("(prefers-color-scheme)"===palette.color.media){var media=matchMedia("(prefers-color-scheme: light)"),input=document.querySelector(media.matches?"[data-md-color-media='(prefers-color-scheme: light)']":"[data-md-color-media='(prefers-color-scheme: dark)']");palette.color.media=input.getAttribute("data-md-color-media"),palette.color.scheme=input.getAttribute("data-md-color-scheme"),palette.color.primary=input.getAttribute("data-md-color-primary"),palette.color.accent=input.getAttribute("data-md-color-accent")}for(var[key,value]of Object.entries(palette.color))document.body.setAttribute("data-md-color-"+key,value)}</script> <label class="md-header__button md-icon" for="__search"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M9.5 3A6.5 6.5 0 0 1 16 9.5c0 1.61-.59 3.09-1.56 4.23l.27.27h.79l5 5-1.5 1.5-5-5v-.79l-.27-.27A6.52 6.52 0 0 1 9.5 16 6.5 6.5 0 0 1 3 9.5 6.5 6.5 0 0 1 9.5 3m0 2C7 5 5 7 5 9.5S7 14 9.5 14 14 12 14 9.5 12 5 9.5 5"></path></svg> </label> <div class="md-search" data-md-component="search" role="dialog"> <label class="md-search__overlay" for="__search"></label> <div class="md-search__inner" role="search"> <form class="md-search__form" name="search"> <input aria-label="Search" autocapitalize="off" autocomplete="off" autocorrect="off" class="md-search__input" data-md-component="search-query" name="query" placeholder="Search" required="" spellcheck="false" type="text"/> <label class="md-search__icon md-icon" for="__search"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M9.5 3A6.5 6.5 0 0 1 16 9.5c0 1.61-.59 3.09-1.56 4.23l.27.27h.79l5 5-1.5 1.5-5-5v-.79l-.27-.27A6.52 6.52 0 0 1 9.5 16 6.5 6.5 0 0 1 3 9.5 6.5 6.5 0 0 1 9.5 3m0 2C7 5 5 7 5 9.5S7 14 9.5 14 14 12 14 9.5 12 5 9.5 5"></path></svg> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M20 11v2H8l5.5 5.5-1.42 1.42L4.16 12l7.92-7.92L13.5 5.5 8 11z"></path></svg> </label> <nav aria-label="Search" class="md-search__options"> <button aria-label="Clear" class="md-search__icon md-icon" tabindex="-1" title="Clear" type="reset"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M19 6.41 17.59 5 12 10.59 6.41 5 5 6.41 10.59 12 5 17.59 6.41 19 12 13.41 17.59 19 19 17.59 13.41 12z"></path></svg> </button> </nav> </form> <div class="md-search__output"> <div class="md-search__scrollwrap" data-md-scrollfix="" tabindex="0"> <div class="md-search-result" data-md-component="search-result"> <div class="md-search-result__meta"> Initializing search </div> <ol class="md-search-result__list" role="presentation"></ol> </div> </div> </div> </div> </div> <div class="md-header__source"> <a class="md-source" data-md-component="source" href="https://github.com/aphp/edsnlp" title="Go to repository"> <div class="md-source__icon md-icon"> <svg viewbox="0 0 448 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M439.55 236.05 244 40.45a28.87 28.87 0 0 0-40.81 0l-40.66 40.63 51.52 51.52c27.06-9.14 52.68 16.77 43.39 43.68l49.66 49.66c34.23-11.8 61.18 31 35.47 56.69-26.49 26.49-70.21-2.87-56-37.34L240.22 199v121.85c25.3 12.54 22.26 41.85 9.08 55a34.34 34.34 0 0 1-48.55 0c-17.57-17.6-11.07-46.91 11.25-56v-123c-20.8-8.51-24.6-30.74-18.64-45L142.57 101 8.45 235.14a28.86 28.86 0 0 0 0 40.81l195.61 195.6a28.86 28.86 0 0 0 40.8 0l194.69-194.69a28.86 28.86 0 0 0 0-40.81"></path></svg> </div> <div class="md-source__repository"> aphp/edsnlp </div> </a> </div> </nav> </header> <div class="md-container" data-md-component="container"> <main class="md-main" data-md-component="main"> <div class="md-main__inner md-grid"> <div class="md-sidebar md-sidebar--primary" data-md-component="sidebar" data-md-type="navigation"> <div class="md-sidebar__scrollwrap"> <div class="md-sidebar__inner"> <nav aria-label="Navigation" class="md-nav md-nav--primary" data-md-level="0"> <label class="md-nav__title" for="__drawer"> <a aria-label="EDS-NLP" class="md-nav__button md-logo" data-md-component="logo" href=".." title="EDS-NLP"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a3 3 0 0 0 3-3 3 3 0 0 0-3-3 3 3 0 0 0-3 3 3 3 0 0 0 3 3m0 3.54C9.64 9.35 6.5 8 3 8v11c3.5 0 6.64 1.35 9 3.54 2.36-2.19 5.5-3.54 9-3.54V8c-3.5 0-6.64 1.35-9 3.54"></path></svg> </a> EDS-NLP </label> <div class="md-nav__source"> <a class="md-source" data-md-component="source" href="https://github.com/aphp/edsnlp" title="Go to repository"> <div class="md-source__icon md-icon"> <svg viewbox="0 0 448 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M439.55 236.05 244 40.45a28.87 28.87 0 0 0-40.81 0l-40.66 40.63 51.52 51.52c27.06-9.14 52.68 16.77 43.39 43.68l49.66 49.66c34.23-11.8 61.18 31 35.47 56.69-26.49 26.49-70.21-2.87-56-37.34L240.22 199v121.85c25.3 12.54 22.26 41.85 9.08 55a34.34 34.34 0 0 1-48.55 0c-17.57-17.6-11.07-46.91 11.25-56v-123c-20.8-8.51-24.6-30.74-18.64-45L142.57 101 8.45 235.14a28.86 28.86 0 0 0 0 40.81l195.61 195.6a28.86 28.86 0 0 0 40.8 0l194.69-194.69a28.86 28.86 0 0 0 0-40.81"></path></svg> </div> <div class="md-source__repository"> aphp/edsnlp </div> </a> </div> <ul class="md-nav__list" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href=".."> <span class="md-ellipsis"> Getting started </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="https://aphp.github.io/edsnlp/demo" target="_blank"> <span class="md-ellipsis"> Demo </span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../tutorials/"> <span class="md-ellipsis"> Tutorials </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../pipes/"> <span class="md-ellipsis"> Pipes </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../tokenizers/"> <span class="md-ellipsis"> Tokenizers </span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../data/"> <span class="md-ellipsis"> Data Connectors </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../concepts/pipeline/"> <span class="md-ellipsis"> Concepts </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--active md-nav__item--nested"> <input checked="" class="md-nav__toggle md-toggle" id="__nav_8" type="checkbox"/> <div class="md-nav__link md-nav__container"> <a class="md-nav__link" href="./"> <span class="md-ellipsis"> Utilities </span> </a> <label class="md-nav__link" for="__nav_8" id="__nav_8_label" tabindex="0"> <span class="md-nav__icon md-icon"></span> </label> </div> <nav aria-expanded="true" aria-labelledby="__nav_8_label" class="md-nav" data-md-level="1"> <label class="md-nav__title" for="__nav_8"> <span class="md-nav__icon md-icon"></span> Utilities </label> <ul class="md-nav__list" data-md-scrollfix=""> <li class="md-nav__item md-nav__item--active"> <input class="md-nav__toggle md-toggle" id="__toc" type="checkbox"/> <a class="md-nav__link md-nav__link--active" href="./"> <span class="md-ellipsis"> Overview </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="tests/blocs/"> <span class="md-ellipsis"> Testing Code Blocs </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="tests/examples/"> <span class="md-ellipsis"> Creating Examples </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="matchers/"> <span class="md-ellipsis"> Matchers </span> </a> </li> </ul> </nav> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../reference/edsnlp/"> <span class="md-ellipsis"> Code Reference </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../contributing/"> <span class="md-ellipsis"> Contributing to EDS-NLP </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../changelog/"> <span class="md-ellipsis"> Changelog </span> </a> </li> </ul> </nav> </div> </div> </div> <div class="md-sidebar md-sidebar--secondary" data-md-component="sidebar" data-md-type="toc"> <div class="md-sidebar__scrollwrap"> <div class="md-sidebar__inner"> <nav aria-label="Table of contents" class="md-nav md-nav--secondary"> </nav> </div> </div> </div> <div class="md-content" data-md-component="content"> <article class="md-content__inner md-typeset"> <h1 id="utilities">Utilities</h1> <p>EDS-NLP provides a few utilities to deploy pipelines, process RegExps, etc.</p> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/utilities/matchers/index.html b/master/utilities/matchers/index.html
index 1c1e9511f..33b789bce 100644
--- a/master/utilities/matchers/index.html
+++ b/master/utilities/matchers/index.html
@@ -64,7 +64,7 @@
 
 <span class="nb">list</span><span class="p">(</span><span class="n">matcher</span><span class="p">(</span><span class="n">doc</span><span class="p">,</span> <span class="n">as_spans</span><span class="o">=</span><span class="kc">True</span><span class="p">))[</span><span class="mi">1</span><span class="p">]</span><span class="o">.</span><span class="n">text</span>
 <span class="c1"># Out: hepatocellulaire carcinome</span>
-</code></pre></div> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/utilities/regex/index.html b/master/utilities/regex/index.html
index 76c417a94..332d2dca2 100644
--- a/master/utilities/regex/index.html
+++ b/master/utilities/regex/index.html
@@ -1,5 +1,5 @@
 <!DOCTYPE html>
-<html class="no-js" lang="en"> <head><meta charset="utf-8"/><meta content="width=device-width,initial-scale=1" name="viewport"/><link href="../../assets/logo/edsnlp.svg" rel="icon"/><meta content="mkdocs-1.6.1, mkdocs-material-9.5.42" name="generator"/><title>Work with RegExp - EDS-NLP</title><link href="../../assets/stylesheets/main.0253249f.min.css" rel="stylesheet"/><link href="../../assets/stylesheets/palette.06af60db.min.css" rel="stylesheet"/><link crossorigin="" href="https://fonts.gstatic.com" rel="preconnect"/><link href="https://fonts.googleapis.com/css?family=Roboto:300,300i,400,400i,700,700i%7CRoboto+Mono:400,400i,700,700i&amp;display=fallback" rel="stylesheet"/><style>:root{--md-text-font:"Roboto";--md-code-font:"Roboto Mono"}</style><link href="../../assets/_mkdocstrings.css" rel="stylesheet"/><link href="../../assets/stylesheets/extra.css" rel="stylesheet"/><link href="../../assets/stylesheets/cards.css" rel="stylesheet"/><link href="../../assets/termynal/termynal.css" rel="stylesheet"/><script>__md_scope=new URL("../..",location),__md_hash=e=>[...e].reduce(((e,_)=>(e<<5)-e+_.charCodeAt(0)),0),__md_get=(e,_=localStorage,t=__md_scope)=>JSON.parse(_.getItem(t.pathname+"."+e)),__md_set=(e,_,t=localStorage,a=__md_scope)=>{try{t.setItem(a.pathname+"."+e,JSON.stringify(_))}catch(e){}}</script></head> <body data-md-color-accent="indigo" data-md-color-primary="indigo" data-md-color-scheme="default" dir="ltr"> <input autocomplete="off" class="md-toggle" data-md-toggle="drawer" id="__drawer" type="checkbox"/> <input autocomplete="off" class="md-toggle" data-md-toggle="search" id="__search" type="checkbox"/> <label class="md-overlay" for="__drawer"></label> <div data-md-component="skip"> <a class="md-skip" href="#work-with-regexp"> Skip to content </a> </div> <div data-md-component="announce"> <aside class="md-banner"> <div class="md-banner__inner md-grid md-typeset"> <button aria-label="Don't show this again" class="md-banner__button md-icon"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M19 6.41 17.59 5 12 10.59 6.41 5 5 6.41 10.59 12 5 17.59 6.41 19 12 13.41 17.59 19 19 17.59 13.41 12z"></path></svg> </button> Check out the new <a href="../../tutorials/training">Model Training tutorial</a> ! </div> <script>var el=document.querySelector("[data-md-component=announce]");if(el){var content=el.querySelector(".md-typeset");__md_hash(content.innerHTML)===__md_get("__announce")&&(el.hidden=!0)}</script> </aside> </div> <div data-md-color-scheme="default" data-md-component="outdated" hidden=""> </div> <header class="md-header md-header--shadow" data-md-component="header"> <nav aria-label="Header" class="md-header__inner md-grid"> <a aria-label="EDS-NLP" class="md-header__button md-logo" data-md-component="logo" href="../.." title="EDS-NLP"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a3 3 0 0 0 3-3 3 3 0 0 0-3-3 3 3 0 0 0-3 3 3 3 0 0 0 3 3m0 3.54C9.64 9.35 6.5 8 3 8v11c3.5 0 6.64 1.35 9 3.54 2.36-2.19 5.5-3.54 9-3.54V8c-3.5 0-6.64 1.35-9 3.54"></path></svg> </a> <label class="md-header__button md-icon" for="__drawer"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M3 6h18v2H3zm0 5h18v2H3zm0 5h18v2H3z"></path></svg> </label> <div class="md-header__title" data-md-component="header-title"> <div class="md-header__ellipsis"> <div class="md-header__topic"> <span class="md-ellipsis"> EDS-NLP </span> </div> <div class="md-header__topic" data-md-component="header-topic"> <span class="md-ellipsis"> Work with RegExp </span> </div> </div> </div> <form class="md-header__option" data-md-component="palette"> <input aria-label="Switch to dark mode" class="md-option" data-md-color-accent="indigo" data-md-color-media="" data-md-color-primary="indigo" data-md-color-scheme="default" id="__palette_0" name="__palette" type="radio"/> <label class="md-header__button md-icon" for="__palette_1" hidden="" title="Switch to dark mode"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 18c-.89 0-1.74-.2-2.5-.55C11.56 16.5 13 14.42 13 12s-1.44-4.5-3.5-5.45C10.26 6.2 11.11 6 12 6a6 6 0 0 1 6 6 6 6 0 0 1-6 6m8-9.31V4h-4.69L12 .69 8.69 4H4v4.69L.69 12 4 15.31V20h4.69L12 23.31 15.31 20H20v-4.69L23.31 12z"></path></svg> </label> <input aria-label="Switch to light mode" class="md-option" data-md-color-accent="indigo" data-md-color-media="" data-md-color-primary="indigo" data-md-color-scheme="slate" id="__palette_1" name="__palette" type="radio"/> <label class="md-header__button md-icon" for="__palette_0" hidden="" title="Switch to light mode"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a4 4 0 0 0-4 4 4 4 0 0 0 4 4 4 4 0 0 0 4-4 4 4 0 0 0-4-4m0 10a6 6 0 0 1-6-6 6 6 0 0 1 6-6 6 6 0 0 1 6 6 6 6 0 0 1-6 6m8-9.31V4h-4.69L12 .69 8.69 4H4v4.69L.69 12 4 15.31V20h4.69L12 23.31 15.31 20H20v-4.69L23.31 12z"></path></svg> </label> </form> <script>var palette=__md_get("__palette");if(palette&&palette.color){if("(prefers-color-scheme)"===palette.color.media){var media=matchMedia("(prefers-color-scheme: light)"),input=document.querySelector(media.matches?"[data-md-color-media='(prefers-color-scheme: light)']":"[data-md-color-media='(prefers-color-scheme: dark)']");palette.color.media=input.getAttribute("data-md-color-media"),palette.color.scheme=input.getAttribute("data-md-color-scheme"),palette.color.primary=input.getAttribute("data-md-color-primary"),palette.color.accent=input.getAttribute("data-md-color-accent")}for(var[key,value]of Object.entries(palette.color))document.body.setAttribute("data-md-color-"+key,value)}</script> <label class="md-header__button md-icon" for="__search"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M9.5 3A6.5 6.5 0 0 1 16 9.5c0 1.61-.59 3.09-1.56 4.23l.27.27h.79l5 5-1.5 1.5-5-5v-.79l-.27-.27A6.52 6.52 0 0 1 9.5 16 6.5 6.5 0 0 1 3 9.5 6.5 6.5 0 0 1 9.5 3m0 2C7 5 5 7 5 9.5S7 14 9.5 14 14 12 14 9.5 12 5 9.5 5"></path></svg> </label> <div class="md-search" data-md-component="search" role="dialog"> <label class="md-search__overlay" for="__search"></label> <div class="md-search__inner" role="search"> <form class="md-search__form" name="search"> <input aria-label="Search" autocapitalize="off" autocomplete="off" autocorrect="off" class="md-search__input" data-md-component="search-query" name="query" placeholder="Search" required="" spellcheck="false" type="text"/> <label class="md-search__icon md-icon" for="__search"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M9.5 3A6.5 6.5 0 0 1 16 9.5c0 1.61-.59 3.09-1.56 4.23l.27.27h.79l5 5-1.5 1.5-5-5v-.79l-.27-.27A6.52 6.52 0 0 1 9.5 16 6.5 6.5 0 0 1 3 9.5 6.5 6.5 0 0 1 9.5 3m0 2C7 5 5 7 5 9.5S7 14 9.5 14 14 12 14 9.5 12 5 9.5 5"></path></svg> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M20 11v2H8l5.5 5.5-1.42 1.42L4.16 12l7.92-7.92L13.5 5.5 8 11z"></path></svg> </label> <nav aria-label="Search" class="md-search__options"> <button aria-label="Clear" class="md-search__icon md-icon" tabindex="-1" title="Clear" type="reset"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M19 6.41 17.59 5 12 10.59 6.41 5 5 6.41 10.59 12 5 17.59 6.41 19 12 13.41 17.59 19 19 17.59 13.41 12z"></path></svg> </button> </nav> </form> <div class="md-search__output"> <div class="md-search__scrollwrap" data-md-scrollfix="" tabindex="0"> <div class="md-search-result" data-md-component="search-result"> <div class="md-search-result__meta"> Initializing search </div> <ol class="md-search-result__list" role="presentation"></ol> </div> </div> </div> </div> </div> <div class="md-header__source"> <a class="md-source" data-md-component="source" href="https://github.com/aphp/edsnlp" title="Go to repository"> <div class="md-source__icon md-icon"> <svg viewbox="0 0 448 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M439.55 236.05 244 40.45a28.87 28.87 0 0 0-40.81 0l-40.66 40.63 51.52 51.52c27.06-9.14 52.68 16.77 43.39 43.68l49.66 49.66c34.23-11.8 61.18 31 35.47 56.69-26.49 26.49-70.21-2.87-56-37.34L240.22 199v121.85c25.3 12.54 22.26 41.85 9.08 55a34.34 34.34 0 0 1-48.55 0c-17.57-17.6-11.07-46.91 11.25-56v-123c-20.8-8.51-24.6-30.74-18.64-45L142.57 101 8.45 235.14a28.86 28.86 0 0 0 0 40.81l195.61 195.6a28.86 28.86 0 0 0 40.8 0l194.69-194.69a28.86 28.86 0 0 0 0-40.81"></path></svg> </div> <div class="md-source__repository"> aphp/edsnlp </div> </a> </div> </nav> </header> <div class="md-container" data-md-component="container"> <main class="md-main" data-md-component="main"> <div class="md-main__inner md-grid"> <div class="md-sidebar md-sidebar--primary" data-md-component="sidebar" data-md-type="navigation"> <div class="md-sidebar__scrollwrap"> <div class="md-sidebar__inner"> <nav aria-label="Navigation" class="md-nav md-nav--primary" data-md-level="0"> <label class="md-nav__title" for="__drawer"> <a aria-label="EDS-NLP" class="md-nav__button md-logo" data-md-component="logo" href="../.." title="EDS-NLP"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a3 3 0 0 0 3-3 3 3 0 0 0-3-3 3 3 0 0 0-3 3 3 3 0 0 0 3 3m0 3.54C9.64 9.35 6.5 8 3 8v11c3.5 0 6.64 1.35 9 3.54 2.36-2.19 5.5-3.54 9-3.54V8c-3.5 0-6.64 1.35-9 3.54"></path></svg> </a> EDS-NLP </label> <div class="md-nav__source"> <a class="md-source" data-md-component="source" href="https://github.com/aphp/edsnlp" title="Go to repository"> <div class="md-source__icon md-icon"> <svg viewbox="0 0 448 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M439.55 236.05 244 40.45a28.87 28.87 0 0 0-40.81 0l-40.66 40.63 51.52 51.52c27.06-9.14 52.68 16.77 43.39 43.68l49.66 49.66c34.23-11.8 61.18 31 35.47 56.69-26.49 26.49-70.21-2.87-56-37.34L240.22 199v121.85c25.3 12.54 22.26 41.85 9.08 55a34.34 34.34 0 0 1-48.55 0c-17.57-17.6-11.07-46.91 11.25-56v-123c-20.8-8.51-24.6-30.74-18.64-45L142.57 101 8.45 235.14a28.86 28.86 0 0 0 0 40.81l195.61 195.6a28.86 28.86 0 0 0 40.8 0l194.69-194.69a28.86 28.86 0 0 0 0-40.81"></path></svg> </div> <div class="md-source__repository"> aphp/edsnlp </div> </a> </div> <ul class="md-nav__list" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href="../.."> <span class="md-ellipsis"> Getting started </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="https://aphp.github.io/edsnlp/demo" target="_blank"> <span class="md-ellipsis"> Demo </span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../tutorials/"> <span class="md-ellipsis"> Tutorials </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../pipes/"> <span class="md-ellipsis"> Pipes </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../tokenizers/"> <span class="md-ellipsis"> Tokenizers </span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../data/"> <span class="md-ellipsis"> Data Connectors </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../concepts/pipeline/"> <span class="md-ellipsis"> Concepts </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../"> <span class="md-ellipsis"> Utilities </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../reference/edsnlp/"> <span class="md-ellipsis"> Code Reference </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../contributing/"> <span class="md-ellipsis"> Contributing to EDS-NLP </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../changelog/"> <span class="md-ellipsis"> Changelog </span> </a> </li> </ul> </nav> </div> </div> </div> <div class="md-sidebar md-sidebar--secondary" data-md-component="sidebar" data-md-type="toc"> <div class="md-sidebar__scrollwrap"> <div class="md-sidebar__inner"> <nav aria-label="Table of contents" class="md-nav md-nav--secondary"> </nav> </div> </div> </div> <div class="md-content" data-md-component="content"> <article class="md-content__inner md-typeset"> <h1 id="work-with-regexp">Work with RegExp</h1> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+<html class="no-js" lang="en"> <head><meta charset="utf-8"/><meta content="width=device-width,initial-scale=1" name="viewport"/><link href="../../assets/logo/edsnlp.svg" rel="icon"/><meta content="mkdocs-1.6.1, mkdocs-material-9.5.42" name="generator"/><title>Work with RegExp - EDS-NLP</title><link href="../../assets/stylesheets/main.0253249f.min.css" rel="stylesheet"/><link href="../../assets/stylesheets/palette.06af60db.min.css" rel="stylesheet"/><link crossorigin="" href="https://fonts.gstatic.com" rel="preconnect"/><link href="https://fonts.googleapis.com/css?family=Roboto:300,300i,400,400i,700,700i%7CRoboto+Mono:400,400i,700,700i&amp;display=fallback" rel="stylesheet"/><style>:root{--md-text-font:"Roboto";--md-code-font:"Roboto Mono"}</style><link href="../../assets/_mkdocstrings.css" rel="stylesheet"/><link href="../../assets/stylesheets/extra.css" rel="stylesheet"/><link href="../../assets/stylesheets/cards.css" rel="stylesheet"/><link href="../../assets/termynal/termynal.css" rel="stylesheet"/><script>__md_scope=new URL("../..",location),__md_hash=e=>[...e].reduce(((e,_)=>(e<<5)-e+_.charCodeAt(0)),0),__md_get=(e,_=localStorage,t=__md_scope)=>JSON.parse(_.getItem(t.pathname+"."+e)),__md_set=(e,_,t=localStorage,a=__md_scope)=>{try{t.setItem(a.pathname+"."+e,JSON.stringify(_))}catch(e){}}</script></head> <body data-md-color-accent="indigo" data-md-color-primary="indigo" data-md-color-scheme="default" dir="ltr"> <input autocomplete="off" class="md-toggle" data-md-toggle="drawer" id="__drawer" type="checkbox"/> <input autocomplete="off" class="md-toggle" data-md-toggle="search" id="__search" type="checkbox"/> <label class="md-overlay" for="__drawer"></label> <div data-md-component="skip"> <a class="md-skip" href="#work-with-regexp"> Skip to content </a> </div> <div data-md-component="announce"> <aside class="md-banner"> <div class="md-banner__inner md-grid md-typeset"> <button aria-label="Don't show this again" class="md-banner__button md-icon"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M19 6.41 17.59 5 12 10.59 6.41 5 5 6.41 10.59 12 5 17.59 6.41 19 12 13.41 17.59 19 19 17.59 13.41 12z"></path></svg> </button> Check out the new <a href="../../tutorials/training">Model Training tutorial</a> ! </div> <script>var el=document.querySelector("[data-md-component=announce]");if(el){var content=el.querySelector(".md-typeset");__md_hash(content.innerHTML)===__md_get("__announce")&&(el.hidden=!0)}</script> </aside> </div> <div data-md-color-scheme="default" data-md-component="outdated" hidden=""> </div> <header class="md-header md-header--shadow" data-md-component="header"> <nav aria-label="Header" class="md-header__inner md-grid"> <a aria-label="EDS-NLP" class="md-header__button md-logo" data-md-component="logo" href="../.." title="EDS-NLP"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a3 3 0 0 0 3-3 3 3 0 0 0-3-3 3 3 0 0 0-3 3 3 3 0 0 0 3 3m0 3.54C9.64 9.35 6.5 8 3 8v11c3.5 0 6.64 1.35 9 3.54 2.36-2.19 5.5-3.54 9-3.54V8c-3.5 0-6.64 1.35-9 3.54"></path></svg> </a> <label class="md-header__button md-icon" for="__drawer"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M3 6h18v2H3zm0 5h18v2H3zm0 5h18v2H3z"></path></svg> </label> <div class="md-header__title" data-md-component="header-title"> <div class="md-header__ellipsis"> <div class="md-header__topic"> <span class="md-ellipsis"> EDS-NLP </span> </div> <div class="md-header__topic" data-md-component="header-topic"> <span class="md-ellipsis"> Work with RegExp </span> </div> </div> </div> <form class="md-header__option" data-md-component="palette"> <input aria-label="Switch to dark mode" class="md-option" data-md-color-accent="indigo" data-md-color-media="" data-md-color-primary="indigo" data-md-color-scheme="default" id="__palette_0" name="__palette" type="radio"/> <label class="md-header__button md-icon" for="__palette_1" hidden="" title="Switch to dark mode"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 18c-.89 0-1.74-.2-2.5-.55C11.56 16.5 13 14.42 13 12s-1.44-4.5-3.5-5.45C10.26 6.2 11.11 6 12 6a6 6 0 0 1 6 6 6 6 0 0 1-6 6m8-9.31V4h-4.69L12 .69 8.69 4H4v4.69L.69 12 4 15.31V20h4.69L12 23.31 15.31 20H20v-4.69L23.31 12z"></path></svg> </label> <input aria-label="Switch to light mode" class="md-option" data-md-color-accent="indigo" data-md-color-media="" data-md-color-primary="indigo" data-md-color-scheme="slate" id="__palette_1" name="__palette" type="radio"/> <label class="md-header__button md-icon" for="__palette_0" hidden="" title="Switch to light mode"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a4 4 0 0 0-4 4 4 4 0 0 0 4 4 4 4 0 0 0 4-4 4 4 0 0 0-4-4m0 10a6 6 0 0 1-6-6 6 6 0 0 1 6-6 6 6 0 0 1 6 6 6 6 0 0 1-6 6m8-9.31V4h-4.69L12 .69 8.69 4H4v4.69L.69 12 4 15.31V20h4.69L12 23.31 15.31 20H20v-4.69L23.31 12z"></path></svg> </label> </form> <script>var palette=__md_get("__palette");if(palette&&palette.color){if("(prefers-color-scheme)"===palette.color.media){var media=matchMedia("(prefers-color-scheme: light)"),input=document.querySelector(media.matches?"[data-md-color-media='(prefers-color-scheme: light)']":"[data-md-color-media='(prefers-color-scheme: dark)']");palette.color.media=input.getAttribute("data-md-color-media"),palette.color.scheme=input.getAttribute("data-md-color-scheme"),palette.color.primary=input.getAttribute("data-md-color-primary"),palette.color.accent=input.getAttribute("data-md-color-accent")}for(var[key,value]of Object.entries(palette.color))document.body.setAttribute("data-md-color-"+key,value)}</script> <label class="md-header__button md-icon" for="__search"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M9.5 3A6.5 6.5 0 0 1 16 9.5c0 1.61-.59 3.09-1.56 4.23l.27.27h.79l5 5-1.5 1.5-5-5v-.79l-.27-.27A6.52 6.52 0 0 1 9.5 16 6.5 6.5 0 0 1 3 9.5 6.5 6.5 0 0 1 9.5 3m0 2C7 5 5 7 5 9.5S7 14 9.5 14 14 12 14 9.5 12 5 9.5 5"></path></svg> </label> <div class="md-search" data-md-component="search" role="dialog"> <label class="md-search__overlay" for="__search"></label> <div class="md-search__inner" role="search"> <form class="md-search__form" name="search"> <input aria-label="Search" autocapitalize="off" autocomplete="off" autocorrect="off" class="md-search__input" data-md-component="search-query" name="query" placeholder="Search" required="" spellcheck="false" type="text"/> <label class="md-search__icon md-icon" for="__search"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M9.5 3A6.5 6.5 0 0 1 16 9.5c0 1.61-.59 3.09-1.56 4.23l.27.27h.79l5 5-1.5 1.5-5-5v-.79l-.27-.27A6.52 6.52 0 0 1 9.5 16 6.5 6.5 0 0 1 3 9.5 6.5 6.5 0 0 1 9.5 3m0 2C7 5 5 7 5 9.5S7 14 9.5 14 14 12 14 9.5 12 5 9.5 5"></path></svg> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M20 11v2H8l5.5 5.5-1.42 1.42L4.16 12l7.92-7.92L13.5 5.5 8 11z"></path></svg> </label> <nav aria-label="Search" class="md-search__options"> <button aria-label="Clear" class="md-search__icon md-icon" tabindex="-1" title="Clear" type="reset"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M19 6.41 17.59 5 12 10.59 6.41 5 5 6.41 10.59 12 5 17.59 6.41 19 12 13.41 17.59 19 19 17.59 13.41 12z"></path></svg> </button> </nav> </form> <div class="md-search__output"> <div class="md-search__scrollwrap" data-md-scrollfix="" tabindex="0"> <div class="md-search-result" data-md-component="search-result"> <div class="md-search-result__meta"> Initializing search </div> <ol class="md-search-result__list" role="presentation"></ol> </div> </div> </div> </div> </div> <div class="md-header__source"> <a class="md-source" data-md-component="source" href="https://github.com/aphp/edsnlp" title="Go to repository"> <div class="md-source__icon md-icon"> <svg viewbox="0 0 448 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M439.55 236.05 244 40.45a28.87 28.87 0 0 0-40.81 0l-40.66 40.63 51.52 51.52c27.06-9.14 52.68 16.77 43.39 43.68l49.66 49.66c34.23-11.8 61.18 31 35.47 56.69-26.49 26.49-70.21-2.87-56-37.34L240.22 199v121.85c25.3 12.54 22.26 41.85 9.08 55a34.34 34.34 0 0 1-48.55 0c-17.57-17.6-11.07-46.91 11.25-56v-123c-20.8-8.51-24.6-30.74-18.64-45L142.57 101 8.45 235.14a28.86 28.86 0 0 0 0 40.81l195.61 195.6a28.86 28.86 0 0 0 40.8 0l194.69-194.69a28.86 28.86 0 0 0 0-40.81"></path></svg> </div> <div class="md-source__repository"> aphp/edsnlp </div> </a> </div> </nav> </header> <div class="md-container" data-md-component="container"> <main class="md-main" data-md-component="main"> <div class="md-main__inner md-grid"> <div class="md-sidebar md-sidebar--primary" data-md-component="sidebar" data-md-type="navigation"> <div class="md-sidebar__scrollwrap"> <div class="md-sidebar__inner"> <nav aria-label="Navigation" class="md-nav md-nav--primary" data-md-level="0"> <label class="md-nav__title" for="__drawer"> <a aria-label="EDS-NLP" class="md-nav__button md-logo" data-md-component="logo" href="../.." title="EDS-NLP"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a3 3 0 0 0 3-3 3 3 0 0 0-3-3 3 3 0 0 0-3 3 3 3 0 0 0 3 3m0 3.54C9.64 9.35 6.5 8 3 8v11c3.5 0 6.64 1.35 9 3.54 2.36-2.19 5.5-3.54 9-3.54V8c-3.5 0-6.64 1.35-9 3.54"></path></svg> </a> EDS-NLP </label> <div class="md-nav__source"> <a class="md-source" data-md-component="source" href="https://github.com/aphp/edsnlp" title="Go to repository"> <div class="md-source__icon md-icon"> <svg viewbox="0 0 448 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M439.55 236.05 244 40.45a28.87 28.87 0 0 0-40.81 0l-40.66 40.63 51.52 51.52c27.06-9.14 52.68 16.77 43.39 43.68l49.66 49.66c34.23-11.8 61.18 31 35.47 56.69-26.49 26.49-70.21-2.87-56-37.34L240.22 199v121.85c25.3 12.54 22.26 41.85 9.08 55a34.34 34.34 0 0 1-48.55 0c-17.57-17.6-11.07-46.91 11.25-56v-123c-20.8-8.51-24.6-30.74-18.64-45L142.57 101 8.45 235.14a28.86 28.86 0 0 0 0 40.81l195.61 195.6a28.86 28.86 0 0 0 40.8 0l194.69-194.69a28.86 28.86 0 0 0 0-40.81"></path></svg> </div> <div class="md-source__repository"> aphp/edsnlp </div> </a> </div> <ul class="md-nav__list" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href="../.."> <span class="md-ellipsis"> Getting started </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="https://aphp.github.io/edsnlp/demo" target="_blank"> <span class="md-ellipsis"> Demo </span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../tutorials/"> <span class="md-ellipsis"> Tutorials </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../pipes/"> <span class="md-ellipsis"> Pipes </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../tokenizers/"> <span class="md-ellipsis"> Tokenizers </span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../data/"> <span class="md-ellipsis"> Data Connectors </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../concepts/pipeline/"> <span class="md-ellipsis"> Concepts </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../"> <span class="md-ellipsis"> Utilities </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../reference/edsnlp/"> <span class="md-ellipsis"> Code Reference </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../contributing/"> <span class="md-ellipsis"> Contributing to EDS-NLP </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../changelog/"> <span class="md-ellipsis"> Changelog </span> </a> </li> </ul> </nav> </div> </div> </div> <div class="md-sidebar md-sidebar--secondary" data-md-component="sidebar" data-md-type="toc"> <div class="md-sidebar__scrollwrap"> <div class="md-sidebar__inner"> <nav aria-label="Table of contents" class="md-nav md-nav--secondary"> </nav> </div> </div> </div> <div class="md-content" data-md-component="content"> <article class="md-content__inner md-typeset"> <h1 id="work-with-regexp">Work with RegExp</h1> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/utilities/tests/blocs/index.html b/master/utilities/tests/blocs/index.html
index 5fa43ea65..9ba8baeb9 100644
--- a/master/utilities/tests/blocs/index.html
+++ b/master/utilities/tests/blocs/index.html
@@ -10,7 +10,7 @@
 </code></pre></div> <p>We can disable code checking for a specific code bloc by adding <code>&lt;!-- no-check --&gt;</code> above it:</p> <div class="highlight"><pre><span></span><code>```{ .python .no-check }
 test = undeclared_function(42)
 ```
-</code></pre></div> <p>See the [dedicated reference][edsnlp.utils.blocs.check_md_file] for more information</p> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> <p>See the [dedicated reference][edsnlp.utils.blocs.check_md_file] for more information</p> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/utilities/tests/examples/index.html b/master/utilities/tests/examples/index.html
index 6ec084237..382a59e34 100644
--- a/master/utilities/tests/examples/index.html
+++ b/master/utilities/tests/examples/index.html
@@ -10,7 +10,7 @@
 
 <span class="n">entities</span>
 <span class="c1"># Out: [Entity(start_char=10, end_char=42, modifiers=[Modifier(key='negated', value=True)])]</span>
-</code></pre></div> <p>Entities are defined using the <code>&lt;ent&gt;</code> tag. You can encode complexe information by adding keys into the tag (see example above). The <code>parse_example</code> method strips the text of the tags, and outputs a list of <code>Entity</code> objects that contain:</p> <ul> <li>the character indices of the entity ;</li> <li>custom user-defined "modifiers".</li> </ul> <p>See the <a class="autorefs autorefs-internal" href="../../../reference/edsnlp/utils/examples/#edsnlp.utils.examples.parse_example">dedicated reference page</a> for more information.</p> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+</code></pre></div> <p>Entities are defined using the <code>&lt;ent&gt;</code> tag. You can encode complexe information by adding keys into the tag (see example above). The <code>parse_example</code> method strips the text of the tags, and outputs a list of <code>Entity</code> objects that contain:</p> <ul> <li>the character indices of the entity ;</li> <li>custom user-defined "modifiers".</li> </ul> <p>See the <a class="autorefs autorefs-internal" href="../../../reference/edsnlp/utils/examples/#edsnlp.utils.examples.parse_example">dedicated reference page</a> for more information.</p> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 
diff --git a/master/utilities/tests/index.html b/master/utilities/tests/index.html
index 140a82a4e..57b273775 100644
--- a/master/utilities/tests/index.html
+++ b/master/utilities/tests/index.html
@@ -1,5 +1,5 @@
 <!DOCTYPE html>
-<html class="no-js" lang="en"> <head><meta charset="utf-8"/><meta content="width=device-width,initial-scale=1" name="viewport"/><link href="../../assets/logo/edsnlp.svg" rel="icon"/><meta content="mkdocs-1.6.1, mkdocs-material-9.5.42" name="generator"/><title>Tests Utilities - EDS-NLP</title><link href="../../assets/stylesheets/main.0253249f.min.css" rel="stylesheet"/><link href="../../assets/stylesheets/palette.06af60db.min.css" rel="stylesheet"/><link crossorigin="" href="https://fonts.gstatic.com" rel="preconnect"/><link href="https://fonts.googleapis.com/css?family=Roboto:300,300i,400,400i,700,700i%7CRoboto+Mono:400,400i,700,700i&amp;display=fallback" rel="stylesheet"/><style>:root{--md-text-font:"Roboto";--md-code-font:"Roboto Mono"}</style><link href="../../assets/_mkdocstrings.css" rel="stylesheet"/><link href="../../assets/stylesheets/extra.css" rel="stylesheet"/><link href="../../assets/stylesheets/cards.css" rel="stylesheet"/><link href="../../assets/termynal/termynal.css" rel="stylesheet"/><script>__md_scope=new URL("../..",location),__md_hash=e=>[...e].reduce(((e,_)=>(e<<5)-e+_.charCodeAt(0)),0),__md_get=(e,_=localStorage,t=__md_scope)=>JSON.parse(_.getItem(t.pathname+"."+e)),__md_set=(e,_,t=localStorage,a=__md_scope)=>{try{t.setItem(a.pathname+"."+e,JSON.stringify(_))}catch(e){}}</script></head> <body data-md-color-accent="indigo" data-md-color-primary="indigo" data-md-color-scheme="default" dir="ltr"> <input autocomplete="off" class="md-toggle" data-md-toggle="drawer" id="__drawer" type="checkbox"/> <input autocomplete="off" class="md-toggle" data-md-toggle="search" id="__search" type="checkbox"/> <label class="md-overlay" for="__drawer"></label> <div data-md-component="skip"> <a class="md-skip" href="#tests-utilities"> Skip to content </a> </div> <div data-md-component="announce"> <aside class="md-banner"> <div class="md-banner__inner md-grid md-typeset"> <button aria-label="Don't show this again" class="md-banner__button md-icon"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M19 6.41 17.59 5 12 10.59 6.41 5 5 6.41 10.59 12 5 17.59 6.41 19 12 13.41 17.59 19 19 17.59 13.41 12z"></path></svg> </button> Check out the new <a href="../../tutorials/training">Model Training tutorial</a> ! </div> <script>var el=document.querySelector("[data-md-component=announce]");if(el){var content=el.querySelector(".md-typeset");__md_hash(content.innerHTML)===__md_get("__announce")&&(el.hidden=!0)}</script> </aside> </div> <div data-md-color-scheme="default" data-md-component="outdated" hidden=""> </div> <header class="md-header md-header--shadow" data-md-component="header"> <nav aria-label="Header" class="md-header__inner md-grid"> <a aria-label="EDS-NLP" class="md-header__button md-logo" data-md-component="logo" href="../.." title="EDS-NLP"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a3 3 0 0 0 3-3 3 3 0 0 0-3-3 3 3 0 0 0-3 3 3 3 0 0 0 3 3m0 3.54C9.64 9.35 6.5 8 3 8v11c3.5 0 6.64 1.35 9 3.54 2.36-2.19 5.5-3.54 9-3.54V8c-3.5 0-6.64 1.35-9 3.54"></path></svg> </a> <label class="md-header__button md-icon" for="__drawer"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M3 6h18v2H3zm0 5h18v2H3zm0 5h18v2H3z"></path></svg> </label> <div class="md-header__title" data-md-component="header-title"> <div class="md-header__ellipsis"> <div class="md-header__topic"> <span class="md-ellipsis"> EDS-NLP </span> </div> <div class="md-header__topic" data-md-component="header-topic"> <span class="md-ellipsis"> Tests Utilities </span> </div> </div> </div> <form class="md-header__option" data-md-component="palette"> <input aria-label="Switch to dark mode" class="md-option" data-md-color-accent="indigo" data-md-color-media="" data-md-color-primary="indigo" data-md-color-scheme="default" id="__palette_0" name="__palette" type="radio"/> <label class="md-header__button md-icon" for="__palette_1" hidden="" title="Switch to dark mode"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 18c-.89 0-1.74-.2-2.5-.55C11.56 16.5 13 14.42 13 12s-1.44-4.5-3.5-5.45C10.26 6.2 11.11 6 12 6a6 6 0 0 1 6 6 6 6 0 0 1-6 6m8-9.31V4h-4.69L12 .69 8.69 4H4v4.69L.69 12 4 15.31V20h4.69L12 23.31 15.31 20H20v-4.69L23.31 12z"></path></svg> </label> <input aria-label="Switch to light mode" class="md-option" data-md-color-accent="indigo" data-md-color-media="" data-md-color-primary="indigo" data-md-color-scheme="slate" id="__palette_1" name="__palette" type="radio"/> <label class="md-header__button md-icon" for="__palette_0" hidden="" title="Switch to light mode"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a4 4 0 0 0-4 4 4 4 0 0 0 4 4 4 4 0 0 0 4-4 4 4 0 0 0-4-4m0 10a6 6 0 0 1-6-6 6 6 0 0 1 6-6 6 6 0 0 1 6 6 6 6 0 0 1-6 6m8-9.31V4h-4.69L12 .69 8.69 4H4v4.69L.69 12 4 15.31V20h4.69L12 23.31 15.31 20H20v-4.69L23.31 12z"></path></svg> </label> </form> <script>var palette=__md_get("__palette");if(palette&&palette.color){if("(prefers-color-scheme)"===palette.color.media){var media=matchMedia("(prefers-color-scheme: light)"),input=document.querySelector(media.matches?"[data-md-color-media='(prefers-color-scheme: light)']":"[data-md-color-media='(prefers-color-scheme: dark)']");palette.color.media=input.getAttribute("data-md-color-media"),palette.color.scheme=input.getAttribute("data-md-color-scheme"),palette.color.primary=input.getAttribute("data-md-color-primary"),palette.color.accent=input.getAttribute("data-md-color-accent")}for(var[key,value]of Object.entries(palette.color))document.body.setAttribute("data-md-color-"+key,value)}</script> <label class="md-header__button md-icon" for="__search"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M9.5 3A6.5 6.5 0 0 1 16 9.5c0 1.61-.59 3.09-1.56 4.23l.27.27h.79l5 5-1.5 1.5-5-5v-.79l-.27-.27A6.52 6.52 0 0 1 9.5 16 6.5 6.5 0 0 1 3 9.5 6.5 6.5 0 0 1 9.5 3m0 2C7 5 5 7 5 9.5S7 14 9.5 14 14 12 14 9.5 12 5 9.5 5"></path></svg> </label> <div class="md-search" data-md-component="search" role="dialog"> <label class="md-search__overlay" for="__search"></label> <div class="md-search__inner" role="search"> <form class="md-search__form" name="search"> <input aria-label="Search" autocapitalize="off" autocomplete="off" autocorrect="off" class="md-search__input" data-md-component="search-query" name="query" placeholder="Search" required="" spellcheck="false" type="text"/> <label class="md-search__icon md-icon" for="__search"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M9.5 3A6.5 6.5 0 0 1 16 9.5c0 1.61-.59 3.09-1.56 4.23l.27.27h.79l5 5-1.5 1.5-5-5v-.79l-.27-.27A6.52 6.52 0 0 1 9.5 16 6.5 6.5 0 0 1 3 9.5 6.5 6.5 0 0 1 9.5 3m0 2C7 5 5 7 5 9.5S7 14 9.5 14 14 12 14 9.5 12 5 9.5 5"></path></svg> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M20 11v2H8l5.5 5.5-1.42 1.42L4.16 12l7.92-7.92L13.5 5.5 8 11z"></path></svg> </label> <nav aria-label="Search" class="md-search__options"> <button aria-label="Clear" class="md-search__icon md-icon" tabindex="-1" title="Clear" type="reset"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M19 6.41 17.59 5 12 10.59 6.41 5 5 6.41 10.59 12 5 17.59 6.41 19 12 13.41 17.59 19 19 17.59 13.41 12z"></path></svg> </button> </nav> </form> <div class="md-search__output"> <div class="md-search__scrollwrap" data-md-scrollfix="" tabindex="0"> <div class="md-search-result" data-md-component="search-result"> <div class="md-search-result__meta"> Initializing search </div> <ol class="md-search-result__list" role="presentation"></ol> </div> </div> </div> </div> </div> <div class="md-header__source"> <a class="md-source" data-md-component="source" href="https://github.com/aphp/edsnlp" title="Go to repository"> <div class="md-source__icon md-icon"> <svg viewbox="0 0 448 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M439.55 236.05 244 40.45a28.87 28.87 0 0 0-40.81 0l-40.66 40.63 51.52 51.52c27.06-9.14 52.68 16.77 43.39 43.68l49.66 49.66c34.23-11.8 61.18 31 35.47 56.69-26.49 26.49-70.21-2.87-56-37.34L240.22 199v121.85c25.3 12.54 22.26 41.85 9.08 55a34.34 34.34 0 0 1-48.55 0c-17.57-17.6-11.07-46.91 11.25-56v-123c-20.8-8.51-24.6-30.74-18.64-45L142.57 101 8.45 235.14a28.86 28.86 0 0 0 0 40.81l195.61 195.6a28.86 28.86 0 0 0 40.8 0l194.69-194.69a28.86 28.86 0 0 0 0-40.81"></path></svg> </div> <div class="md-source__repository"> aphp/edsnlp </div> </a> </div> </nav> </header> <div class="md-container" data-md-component="container"> <main class="md-main" data-md-component="main"> <div class="md-main__inner md-grid"> <div class="md-sidebar md-sidebar--primary" data-md-component="sidebar" data-md-type="navigation"> <div class="md-sidebar__scrollwrap"> <div class="md-sidebar__inner"> <nav aria-label="Navigation" class="md-nav md-nav--primary" data-md-level="0"> <label class="md-nav__title" for="__drawer"> <a aria-label="EDS-NLP" class="md-nav__button md-logo" data-md-component="logo" href="../.." title="EDS-NLP"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a3 3 0 0 0 3-3 3 3 0 0 0-3-3 3 3 0 0 0-3 3 3 3 0 0 0 3 3m0 3.54C9.64 9.35 6.5 8 3 8v11c3.5 0 6.64 1.35 9 3.54 2.36-2.19 5.5-3.54 9-3.54V8c-3.5 0-6.64 1.35-9 3.54"></path></svg> </a> EDS-NLP </label> <div class="md-nav__source"> <a class="md-source" data-md-component="source" href="https://github.com/aphp/edsnlp" title="Go to repository"> <div class="md-source__icon md-icon"> <svg viewbox="0 0 448 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M439.55 236.05 244 40.45a28.87 28.87 0 0 0-40.81 0l-40.66 40.63 51.52 51.52c27.06-9.14 52.68 16.77 43.39 43.68l49.66 49.66c34.23-11.8 61.18 31 35.47 56.69-26.49 26.49-70.21-2.87-56-37.34L240.22 199v121.85c25.3 12.54 22.26 41.85 9.08 55a34.34 34.34 0 0 1-48.55 0c-17.57-17.6-11.07-46.91 11.25-56v-123c-20.8-8.51-24.6-30.74-18.64-45L142.57 101 8.45 235.14a28.86 28.86 0 0 0 0 40.81l195.61 195.6a28.86 28.86 0 0 0 40.8 0l194.69-194.69a28.86 28.86 0 0 0 0-40.81"></path></svg> </div> <div class="md-source__repository"> aphp/edsnlp </div> </a> </div> <ul class="md-nav__list" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href="../.."> <span class="md-ellipsis"> Getting started </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="https://aphp.github.io/edsnlp/demo" target="_blank"> <span class="md-ellipsis"> Demo </span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../tutorials/"> <span class="md-ellipsis"> Tutorials </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../pipes/"> <span class="md-ellipsis"> Pipes </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../tokenizers/"> <span class="md-ellipsis"> Tokenizers </span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../data/"> <span class="md-ellipsis"> Data Connectors </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../concepts/pipeline/"> <span class="md-ellipsis"> Concepts </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../"> <span class="md-ellipsis"> Utilities </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../reference/edsnlp/"> <span class="md-ellipsis"> Code Reference </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../contributing/"> <span class="md-ellipsis"> Contributing to EDS-NLP </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../changelog/"> <span class="md-ellipsis"> Changelog </span> </a> </li> </ul> </nav> </div> </div> </div> <div class="md-sidebar md-sidebar--secondary" data-md-component="sidebar" data-md-type="toc"> <div class="md-sidebar__scrollwrap"> <div class="md-sidebar__inner"> <nav aria-label="Table of contents" class="md-nav md-nav--secondary"> </nav> </div> </div> </div> <div class="md-content" data-md-component="content"> <article class="md-content__inner md-typeset"> <h1 id="tests-utilities">Tests Utilities</h1> <p>We provide a few testing utilities that simplify the process of:</p> <ul> <li>creating testing examples for NLP pipelines;</li> <li>testing documentation code blocs.</li> </ul> <div class="footnote"><hr/><ol></ol></div> <h2 id="__comments">Comments</h2> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
+<html class="no-js" lang="en"> <head><meta charset="utf-8"/><meta content="width=device-width,initial-scale=1" name="viewport"/><link href="../../assets/logo/edsnlp.svg" rel="icon"/><meta content="mkdocs-1.6.1, mkdocs-material-9.5.42" name="generator"/><title>Tests Utilities - EDS-NLP</title><link href="../../assets/stylesheets/main.0253249f.min.css" rel="stylesheet"/><link href="../../assets/stylesheets/palette.06af60db.min.css" rel="stylesheet"/><link crossorigin="" href="https://fonts.gstatic.com" rel="preconnect"/><link href="https://fonts.googleapis.com/css?family=Roboto:300,300i,400,400i,700,700i%7CRoboto+Mono:400,400i,700,700i&amp;display=fallback" rel="stylesheet"/><style>:root{--md-text-font:"Roboto";--md-code-font:"Roboto Mono"}</style><link href="../../assets/_mkdocstrings.css" rel="stylesheet"/><link href="../../assets/stylesheets/extra.css" rel="stylesheet"/><link href="../../assets/stylesheets/cards.css" rel="stylesheet"/><link href="../../assets/termynal/termynal.css" rel="stylesheet"/><script>__md_scope=new URL("../..",location),__md_hash=e=>[...e].reduce(((e,_)=>(e<<5)-e+_.charCodeAt(0)),0),__md_get=(e,_=localStorage,t=__md_scope)=>JSON.parse(_.getItem(t.pathname+"."+e)),__md_set=(e,_,t=localStorage,a=__md_scope)=>{try{t.setItem(a.pathname+"."+e,JSON.stringify(_))}catch(e){}}</script></head> <body data-md-color-accent="indigo" data-md-color-primary="indigo" data-md-color-scheme="default" dir="ltr"> <input autocomplete="off" class="md-toggle" data-md-toggle="drawer" id="__drawer" type="checkbox"/> <input autocomplete="off" class="md-toggle" data-md-toggle="search" id="__search" type="checkbox"/> <label class="md-overlay" for="__drawer"></label> <div data-md-component="skip"> <a class="md-skip" href="#tests-utilities"> Skip to content </a> </div> <div data-md-component="announce"> <aside class="md-banner"> <div class="md-banner__inner md-grid md-typeset"> <button aria-label="Don't show this again" class="md-banner__button md-icon"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M19 6.41 17.59 5 12 10.59 6.41 5 5 6.41 10.59 12 5 17.59 6.41 19 12 13.41 17.59 19 19 17.59 13.41 12z"></path></svg> </button> Check out the new <a href="../../tutorials/training">Model Training tutorial</a> ! </div> <script>var el=document.querySelector("[data-md-component=announce]");if(el){var content=el.querySelector(".md-typeset");__md_hash(content.innerHTML)===__md_get("__announce")&&(el.hidden=!0)}</script> </aside> </div> <div data-md-color-scheme="default" data-md-component="outdated" hidden=""> </div> <header class="md-header md-header--shadow" data-md-component="header"> <nav aria-label="Header" class="md-header__inner md-grid"> <a aria-label="EDS-NLP" class="md-header__button md-logo" data-md-component="logo" href="../.." title="EDS-NLP"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a3 3 0 0 0 3-3 3 3 0 0 0-3-3 3 3 0 0 0-3 3 3 3 0 0 0 3 3m0 3.54C9.64 9.35 6.5 8 3 8v11c3.5 0 6.64 1.35 9 3.54 2.36-2.19 5.5-3.54 9-3.54V8c-3.5 0-6.64 1.35-9 3.54"></path></svg> </a> <label class="md-header__button md-icon" for="__drawer"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M3 6h18v2H3zm0 5h18v2H3zm0 5h18v2H3z"></path></svg> </label> <div class="md-header__title" data-md-component="header-title"> <div class="md-header__ellipsis"> <div class="md-header__topic"> <span class="md-ellipsis"> EDS-NLP </span> </div> <div class="md-header__topic" data-md-component="header-topic"> <span class="md-ellipsis"> Tests Utilities </span> </div> </div> </div> <form class="md-header__option" data-md-component="palette"> <input aria-label="Switch to dark mode" class="md-option" data-md-color-accent="indigo" data-md-color-media="" data-md-color-primary="indigo" data-md-color-scheme="default" id="__palette_0" name="__palette" type="radio"/> <label class="md-header__button md-icon" for="__palette_1" hidden="" title="Switch to dark mode"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 18c-.89 0-1.74-.2-2.5-.55C11.56 16.5 13 14.42 13 12s-1.44-4.5-3.5-5.45C10.26 6.2 11.11 6 12 6a6 6 0 0 1 6 6 6 6 0 0 1-6 6m8-9.31V4h-4.69L12 .69 8.69 4H4v4.69L.69 12 4 15.31V20h4.69L12 23.31 15.31 20H20v-4.69L23.31 12z"></path></svg> </label> <input aria-label="Switch to light mode" class="md-option" data-md-color-accent="indigo" data-md-color-media="" data-md-color-primary="indigo" data-md-color-scheme="slate" id="__palette_1" name="__palette" type="radio"/> <label class="md-header__button md-icon" for="__palette_0" hidden="" title="Switch to light mode"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a4 4 0 0 0-4 4 4 4 0 0 0 4 4 4 4 0 0 0 4-4 4 4 0 0 0-4-4m0 10a6 6 0 0 1-6-6 6 6 0 0 1 6-6 6 6 0 0 1 6 6 6 6 0 0 1-6 6m8-9.31V4h-4.69L12 .69 8.69 4H4v4.69L.69 12 4 15.31V20h4.69L12 23.31 15.31 20H20v-4.69L23.31 12z"></path></svg> </label> </form> <script>var palette=__md_get("__palette");if(palette&&palette.color){if("(prefers-color-scheme)"===palette.color.media){var media=matchMedia("(prefers-color-scheme: light)"),input=document.querySelector(media.matches?"[data-md-color-media='(prefers-color-scheme: light)']":"[data-md-color-media='(prefers-color-scheme: dark)']");palette.color.media=input.getAttribute("data-md-color-media"),palette.color.scheme=input.getAttribute("data-md-color-scheme"),palette.color.primary=input.getAttribute("data-md-color-primary"),palette.color.accent=input.getAttribute("data-md-color-accent")}for(var[key,value]of Object.entries(palette.color))document.body.setAttribute("data-md-color-"+key,value)}</script> <label class="md-header__button md-icon" for="__search"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M9.5 3A6.5 6.5 0 0 1 16 9.5c0 1.61-.59 3.09-1.56 4.23l.27.27h.79l5 5-1.5 1.5-5-5v-.79l-.27-.27A6.52 6.52 0 0 1 9.5 16 6.5 6.5 0 0 1 3 9.5 6.5 6.5 0 0 1 9.5 3m0 2C7 5 5 7 5 9.5S7 14 9.5 14 14 12 14 9.5 12 5 9.5 5"></path></svg> </label> <div class="md-search" data-md-component="search" role="dialog"> <label class="md-search__overlay" for="__search"></label> <div class="md-search__inner" role="search"> <form class="md-search__form" name="search"> <input aria-label="Search" autocapitalize="off" autocomplete="off" autocorrect="off" class="md-search__input" data-md-component="search-query" name="query" placeholder="Search" required="" spellcheck="false" type="text"/> <label class="md-search__icon md-icon" for="__search"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M9.5 3A6.5 6.5 0 0 1 16 9.5c0 1.61-.59 3.09-1.56 4.23l.27.27h.79l5 5-1.5 1.5-5-5v-.79l-.27-.27A6.52 6.52 0 0 1 9.5 16 6.5 6.5 0 0 1 3 9.5 6.5 6.5 0 0 1 9.5 3m0 2C7 5 5 7 5 9.5S7 14 9.5 14 14 12 14 9.5 12 5 9.5 5"></path></svg> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M20 11v2H8l5.5 5.5-1.42 1.42L4.16 12l7.92-7.92L13.5 5.5 8 11z"></path></svg> </label> <nav aria-label="Search" class="md-search__options"> <button aria-label="Clear" class="md-search__icon md-icon" tabindex="-1" title="Clear" type="reset"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M19 6.41 17.59 5 12 10.59 6.41 5 5 6.41 10.59 12 5 17.59 6.41 19 12 13.41 17.59 19 19 17.59 13.41 12z"></path></svg> </button> </nav> </form> <div class="md-search__output"> <div class="md-search__scrollwrap" data-md-scrollfix="" tabindex="0"> <div class="md-search-result" data-md-component="search-result"> <div class="md-search-result__meta"> Initializing search </div> <ol class="md-search-result__list" role="presentation"></ol> </div> </div> </div> </div> </div> <div class="md-header__source"> <a class="md-source" data-md-component="source" href="https://github.com/aphp/edsnlp" title="Go to repository"> <div class="md-source__icon md-icon"> <svg viewbox="0 0 448 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M439.55 236.05 244 40.45a28.87 28.87 0 0 0-40.81 0l-40.66 40.63 51.52 51.52c27.06-9.14 52.68 16.77 43.39 43.68l49.66 49.66c34.23-11.8 61.18 31 35.47 56.69-26.49 26.49-70.21-2.87-56-37.34L240.22 199v121.85c25.3 12.54 22.26 41.85 9.08 55a34.34 34.34 0 0 1-48.55 0c-17.57-17.6-11.07-46.91 11.25-56v-123c-20.8-8.51-24.6-30.74-18.64-45L142.57 101 8.45 235.14a28.86 28.86 0 0 0 0 40.81l195.61 195.6a28.86 28.86 0 0 0 40.8 0l194.69-194.69a28.86 28.86 0 0 0 0-40.81"></path></svg> </div> <div class="md-source__repository"> aphp/edsnlp </div> </a> </div> </nav> </header> <div class="md-container" data-md-component="container"> <main class="md-main" data-md-component="main"> <div class="md-main__inner md-grid"> <div class="md-sidebar md-sidebar--primary" data-md-component="sidebar" data-md-type="navigation"> <div class="md-sidebar__scrollwrap"> <div class="md-sidebar__inner"> <nav aria-label="Navigation" class="md-nav md-nav--primary" data-md-level="0"> <label class="md-nav__title" for="__drawer"> <a aria-label="EDS-NLP" class="md-nav__button md-logo" data-md-component="logo" href="../.." title="EDS-NLP"> <svg viewbox="0 0 24 24" xmlns="http://www.w3.org/2000/svg"><path d="M12 8a3 3 0 0 0 3-3 3 3 0 0 0-3-3 3 3 0 0 0-3 3 3 3 0 0 0 3 3m0 3.54C9.64 9.35 6.5 8 3 8v11c3.5 0 6.64 1.35 9 3.54 2.36-2.19 5.5-3.54 9-3.54V8c-3.5 0-6.64 1.35-9 3.54"></path></svg> </a> EDS-NLP </label> <div class="md-nav__source"> <a class="md-source" data-md-component="source" href="https://github.com/aphp/edsnlp" title="Go to repository"> <div class="md-source__icon md-icon"> <svg viewbox="0 0 448 512" xmlns="http://www.w3.org/2000/svg"><!-- Font Awesome Free 6.6.0 by @fontawesome - https://fontawesome.com License - https://fontawesome.com/license/free (Icons: CC BY 4.0, Fonts: SIL OFL 1.1, Code: MIT License) Copyright 2024 Fonticons, Inc.--><path d="M439.55 236.05 244 40.45a28.87 28.87 0 0 0-40.81 0l-40.66 40.63 51.52 51.52c27.06-9.14 52.68 16.77 43.39 43.68l49.66 49.66c34.23-11.8 61.18 31 35.47 56.69-26.49 26.49-70.21-2.87-56-37.34L240.22 199v121.85c25.3 12.54 22.26 41.85 9.08 55a34.34 34.34 0 0 1-48.55 0c-17.57-17.6-11.07-46.91 11.25-56v-123c-20.8-8.51-24.6-30.74-18.64-45L142.57 101 8.45 235.14a28.86 28.86 0 0 0 0 40.81l195.61 195.6a28.86 28.86 0 0 0 40.8 0l194.69-194.69a28.86 28.86 0 0 0 0-40.81"></path></svg> </div> <div class="md-source__repository"> aphp/edsnlp </div> </a> </div> <ul class="md-nav__list" data-md-scrollfix=""> <li class="md-nav__item"> <a class="md-nav__link" href="../.."> <span class="md-ellipsis"> Getting started </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="https://aphp.github.io/edsnlp/demo" target="_blank"> <span class="md-ellipsis"> Demo </span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../tutorials/"> <span class="md-ellipsis"> Tutorials </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../pipes/"> <span class="md-ellipsis"> Pipes </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../tokenizers/"> <span class="md-ellipsis"> Tokenizers </span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../data/"> <span class="md-ellipsis"> Data Connectors </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../concepts/pipeline/"> <span class="md-ellipsis"> Concepts </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../"> <span class="md-ellipsis"> Utilities </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item md-nav__item--pruned md-nav__item--nested"> <a class="md-nav__link" href="../../reference/edsnlp/"> <span class="md-ellipsis"> Code Reference </span> <span class="md-nav__icon md-icon"></span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../contributing/"> <span class="md-ellipsis"> Contributing to EDS-NLP </span> </a> </li> <li class="md-nav__item"> <a class="md-nav__link" href="../../changelog/"> <span class="md-ellipsis"> Changelog </span> </a> </li> </ul> </nav> </div> </div> </div> <div class="md-sidebar md-sidebar--secondary" data-md-component="sidebar" data-md-type="toc"> <div class="md-sidebar__scrollwrap"> <div class="md-sidebar__inner"> <nav aria-label="Table of contents" class="md-nav md-nav--secondary"> </nav> </div> </div> </div> <div class="md-content" data-md-component="content"> <article class="md-content__inner md-typeset"> <h1 id="tests-utilities">Tests Utilities</h1> <p>We provide a few testing utilities that simplify the process of:</p> <ul> <li>creating testing examples for NLP pipelines;</li> <li>testing documentation code blocs.</li> </ul> <div class="footnote"><hr/><ol></ol></div> <script async="" crossorigin="anonymous" data-category="Announcements" data-category-id="DIC_kwDOG97JnM4CkS1h" data-emit-metadata="0" data-input-position="bottom" data-lang="en" data-mapping="title" data-reactions-enabled="1" data-repo="aphp/edsnlp" data-repo-id="R_kgDOG97JnA" data-strict="0" data-theme="https://aphp.github.io/edsnlp/master/assets/stylesheets/giscus_light.css" loading="lazy" src="https://giscus.app/client.js">
 </script> <script>
     var giscus = document.querySelector("script[src*=giscus]")
 

Component	Description
`eds.normalizer`	Non-destructive input text normalisation
`eds.sentences`	Better sentence boundary detection
`eds.matcher`	A simple yet powerful entity extractor
`eds.terminology`	A simple yet powerful terminology matcher
`eds.contextual_matcher`	A conditional entity extractor
`eds.endlines`	An unsupervised model to classify each end line
Component	Description
`eds.dates`	Date extraction and normalisation
`eds.consultation_dates`	Identify consultation dates
`eds.quantities`	Quantity extraction and normalisation
`eds.sections`	Section detection
`eds.reason`	Rule-based hospitalisation reason detection
`eds.tables`	Tables detection
`eds.split`	Doc splitting
Component	Description
`eds.covid`	A COVID mentions detector
`eds.charlson`	A Charlson score extractor
`eds.sofa`	A SOFA score extractor
`eds.elston_ellis`	An Elston & Ellis code extractor
`eds.emergency_priority`	A priority score extractor
`eds.emergency_ccmu`	A CCMU score extractor
`eds.emergency_gemsa`	A GEMSA score extractor
`eds.tnm`	A TNM score extractor
`eds.adicap`	A ADICAP codes extractor
`eds.drugs`	A drug mentions extractor
`eds.cim10`	A CIM10 terminology matcher
`eds.umls`	An UMLS terminology matcher
`eds.ckd`	CKD extractor
`eds.copd`	COPD extractor
`eds.cerebrovascular_accident`	Cerebrovascular accident extractor
`eds.congestive_heart_failure`	Congestive heart failure extractor
`eds.connective_tissue_disease`	Connective tissue disease extractor
`eds.dementia`	Dementia extractor
`eds.diabetes`	Diabetes extractor
`eds.hemiplegia`	Hemiplegia extractor
`eds.leukemia`	Leukemia extractor
`eds.liver_disease`	Liver disease extractor
`eds.lymphoma`	Lymphoma extractor
`eds.myocardial_infarction`	Myocardial infarction extractor
`eds.peptic_ulcer_disease`	Peptic ulcer disease extractor
`eds.peripheral_vascular_disease`	Peripheral vascular disease extractor
`eds.solid_tumor`	Solid tumor extractor
`eds.alcohol`	Alcohol consumption extractor
`eds.tobacco`	Tobacco consumption extractor
PARAMETER	DESCRIPTION
`nlp`	The pipeline object TYPE: `Optional[Pipeline]` DEFAULT: `None`
`name`	Name of the component TYPE: `str` DEFAULT: `'span_pooler'`
`embedding`	The word embedding component TYPE: `WordEmbeddingComponent`
`pooling_mode`	How word embeddings are aggregated into a single embedding per span. TYPE: `Literal['max', 'sum', 'mean']` DEFAULT: `mean`
`hidden_size`	The size of the hidden layer. If None, no projection is done and the output of the span pooler is used directly. TYPE: `Optional[int]` DEFAULT: `None`
PARAMETER	DESCRIPTION
`nlp`	The pipeline object TYPE: `PipelineProtocol` DEFAULT: `None`
`name`	The name of the component TYPE: `str` DEFAULT: `'text_cnn'`
`embedding`	Embedding module to apply to the input TYPE: `TorchComponent[WordEmbeddingBatchOutput, BatchInput]`
`output_size`	Size of the output embeddings Defaults to the `input_size` TYPE: `Optional[int]` DEFAULT: `None`
`out_channels`	Number of channels TYPE: `int` DEFAULT: `None`
`kernel_sizes`	Window size of each kernel TYPE: `Sequence[int]` DEFAULT: `(3, 4, 5)`
`activation`	Activation function to use TYPE: `str` DEFAULT: `relu`
`residual`	Whether to use residual connections TYPE: `bool` DEFAULT: `True`
`normalize`	Whether to normalize before or after the residual connection TYPE: `Literal['pre', 'post', 'none']` DEFAULT: `pre`
Name	Description
`eds.transformer`	Embed text with a transformer model
`eds.text_cnn`	Contextualize embeddings with a CNN
`eds.span_pooler`	A span embedding component that aggregates word embeddings
`eds.ner_crf`	A trainable component to extract entities
`eds.span_classifier`	A trainable component for multi-class multi-label span classification
`eds.span_linker`	A trainable entity linker (i.e. to a list of concepts)