diff --git a/previews/PR4/index.html b/previews/PR4/index.html
index 797a381..3ec40f4 100644
--- a/previews/PR4/index.html
+++ b/previews/PR4/index.html
@@ -78,7 +78,7 @@
     <div data-md-component="skip">
       
         
-        <a href="#what-is-tidierfilesjl" class="md-skip">
+        <a href="#tidierfilesjl" class="md-skip">
           Skip to content
         </a>
       
@@ -274,6 +274,8 @@
       <input class="md-nav__toggle md-toggle" type="checkbox" id="__toc">
       
       
+        
+      
       
         <label class="md-nav__link md-nav__link--active" for="__toc">
           
@@ -302,6 +304,8 @@
   
   
   
+    
+  
   
     <label class="md-nav__title" for="__toc">
       <span class="md-nav__icon md-icon"></span>
@@ -403,6 +407,8 @@
   
   
   
+    
+  
   
     <label class="md-nav__title" for="__toc">
       <span class="md-nav__icon md-icon"></span>
@@ -434,16 +440,15 @@
                   
 
 
-  <h1>Home</h1>
-
-<div><p><a id="What-is-TidierFiles.jl?"></a></p>
+<div><p><a id="TidierFiles.jl"></a></p>
+<p><a id="TidierFiles.jl-1"></a></p>
+<h1 id="tidierfilesjl">TidierFiles.jl<a class="headerlink" href="#tidierfilesjl" title="Permanent link">¤</a></h1>
+<p><a id="What-is-TidierFiles.jl?"></a></p>
 <p><a id="What-is-TidierFiles.jl?-1"></a></p>
 <h2 id="what-is-tidierfilesjl">What is TidierFiles.jl?<a class="headerlink" href="#what-is-tidierfilesjl" title="Permanent link">¤</a></h2>
-<p>TidierFiles.jl is a 100% Julia implementation of the readr and haven R packages. Powered by the CSV.jl, XLSX.jl and ReadStatTables.jl packages, TidierFiles.jl  seeks to harmonize file reading/writing by unifying the arguments across multiple  file types. </p>
-<p>TidierFiles.jl currently supports </p>
-<div class="admonition example">
-<p class="admonition-title">Example</p>
-</div>
+<p>TidierFiles.jl is a 100% Julia implementation of the readr, haven, readxl, and writexl R packages.</p>
+<p>Powered by the CSV.jl, XLSX.jl and ReadStatTables.jl packages, TidierFiles.jl aims to bring a consistent interface to the reading and writing of tabular data, including a consistent syntax to read files locally versus from the web and consistent keyword arguments across data formats.</p>
+<p>Currently supported file types:</p>
 <ul>
 <li><code>read_csv</code> and <code>write_csv</code></li>
 <li><code>read_tsv</code> and <code>write_tsv</code></li>
@@ -453,9 +458,36 @@ <h2 id="what-is-tidierfilesjl">What is TidierFiles.jl?<a class="headerlink" href
 <li><code>read_fwf</code> and <code>fwf_empty</code></li>
 <li><code>read_sav</code> and <code>write_sav</code> (.sav and .por)</li>
 <li><code>read_sas</code> and <code>write_sas</code> (.sas7bdat and .xpt)</li>
-<li><code>read_dta</code> and <code>write_dta</code> (.dta) </li>
+<li><code>read_dta</code> and <code>write_dta</code> (.dta)</li>
 </ul>
-<p>Read functions include the following arguments and support HTTP reading:</p>
+<p><a id="Examples"></a></p>
+<p><a id="Examples-1"></a></p>
+<h1 id="examples">Examples<a class="headerlink" href="#examples" title="Permanent link">¤</a></h1>
+<p>Here is an example of how to write and read a CSV file.</p>
+<div class="highlight"><pre><span></span><code><span class="k">using</span><span class="w"> </span><span class="n">TidierFiles</span>
+
+<span class="n">df</span><span class="w"> </span><span class="o">=</span><span class="w"> </span><span class="n">DataFrame</span><span class="p">(</span>
+<span class="w">       </span><span class="n">integers</span><span class="w"> </span><span class="o">=</span><span class="w"> </span><span class="p">[</span><span class="mi">1</span><span class="p">,</span><span class="w"> </span><span class="mi">2</span><span class="p">,</span><span class="w"> </span><span class="mi">3</span><span class="p">,</span><span class="w"> </span><span class="mi">4</span><span class="p">],</span>
+<span class="w">       </span><span class="n">strings</span><span class="w"> </span><span class="o">=</span><span class="w"> </span><span class="p">[</span><span class="s">"This"</span><span class="p">,</span><span class="w"> </span><span class="s">"Package makes"</span><span class="p">,</span><span class="w"> </span><span class="s">"File reading/writing"</span><span class="p">,</span><span class="w"> </span><span class="s">"even smoother"</span><span class="p">],</span>
+<span class="w">       </span><span class="n">floats</span><span class="w"> </span><span class="o">=</span><span class="w"> </span><span class="p">[</span><span class="mf">10.2</span><span class="p">,</span><span class="w"> </span><span class="mf">20.3</span><span class="p">,</span><span class="w"> </span><span class="mf">30.4</span><span class="p">,</span><span class="w"> </span><span class="mf">40.5</span><span class="p">],</span>
+<span class="w">       </span><span class="n">dates</span><span class="w"> </span><span class="o">=</span><span class="w"> </span><span class="p">[</span><span class="n">Date</span><span class="p">(</span><span class="mi">2018</span><span class="p">,</span><span class="mi">2</span><span class="p">,</span><span class="mi">20</span><span class="p">),</span><span class="w"> </span><span class="n">Date</span><span class="p">(</span><span class="mi">2018</span><span class="p">,</span><span class="mi">2</span><span class="p">,</span><span class="mi">21</span><span class="p">),</span><span class="w"> </span><span class="n">Date</span><span class="p">(</span><span class="mi">2018</span><span class="p">,</span><span class="mi">2</span><span class="p">,</span><span class="mi">22</span><span class="p">),</span><span class="w"> </span><span class="n">Date</span><span class="p">(</span><span class="mi">2018</span><span class="p">,</span><span class="mi">2</span><span class="p">,</span><span class="mi">23</span><span class="p">)],</span>
+<span class="w">       </span><span class="n">times</span><span class="w"> </span><span class="o">=</span><span class="w"> </span><span class="p">[</span><span class="n">Dates</span><span class="o">.</span><span class="n">Time</span><span class="p">(</span><span class="mi">19</span><span class="p">,</span><span class="mi">10</span><span class="p">),</span><span class="w"> </span><span class="n">Dates</span><span class="o">.</span><span class="n">Time</span><span class="p">(</span><span class="mi">19</span><span class="p">,</span><span class="mi">20</span><span class="p">),</span><span class="w"> </span><span class="n">Dates</span><span class="o">.</span><span class="n">Time</span><span class="p">(</span><span class="mi">19</span><span class="p">,</span><span class="mi">30</span><span class="p">),</span><span class="w"> </span><span class="n">Dates</span><span class="o">.</span><span class="n">Time</span><span class="p">(</span><span class="mi">19</span><span class="p">,</span><span class="mi">40</span><span class="p">)]</span>
+<span class="w">     </span><span class="p">)</span>
+
+<span class="n">write_csv</span><span class="p">(</span><span class="n">df</span><span class="p">,</span><span class="w"> </span><span class="s">"testing.csv"</span><span class="w"> </span><span class="p">,</span><span class="w"> </span><span class="n">col_names</span><span class="w"> </span><span class="o">=</span><span class="w"> </span><span class="nb">true</span><span class="p">)</span>
+
+<span class="n">read_csv</span><span class="p">(</span><span class="s">"testing.csv"</span><span class="p">,</span><span class="w"> </span><span class="n">missingstring</span><span class="o">=</span><span class="p">[</span><span class="s">"40.5"</span><span class="p">,</span><span class="w"> </span><span class="s">"10.2"</span><span class="p">])</span>
+</code></pre></div>
+<div class="highlight"><pre><span></span><code>4×5 DataFrame
+ Row │ integers  strings               floats     dates       times    
+     │ Int64     String31              Float64?   Date        Time     
+─────┼─────────────────────────────────────────────────────────────────
+   1 │        1  This                  missing    2018-02-20  19:10:00
+   2 │        2  Package makes              20.3  2018-02-21  19:20:00
+   3 │        3  File reading/writing       30.4  2018-02-22  19:30:00
+   4 │        4  even smoother         missing    2018-02-23  19:40:00:00
+</code></pre></div>
+<p>The file reading functions include the following keyword arguments:</p>
 <ul>
 <li><code>path</code></li>
 <li><code>missingstring</code></li>
@@ -464,11 +496,10 @@ <h2 id="what-is-tidierfilesjl">What is TidierFiles.jl?<a class="headerlink" href
 <li><code>num_threads</code></li>
 <li><code>skip</code></li>
 <li><code>n_max</code></li>
-<li><code>delim</code> (where applies)</li>
+<li><code>delim</code> (where applicable)</li>
 </ul>
-<div class="highlight"><pre><span></span><code><span class="k">using</span><span class="w"> </span><span class="n">TidierFiles</span>
-
-<span class="n">read_csv</span><span class="p">(</span><span class="s">"https://raw.githubusercontent.com/TidierOrg/TidierFiles.jl/main/testing_files/csvtest.csv"</span><span class="p">,</span><span class="w"> </span><span class="n">skip</span><span class="w"> </span><span class="o">=</span><span class="w"> </span><span class="mi">2</span><span class="p">,</span><span class="w"> </span><span class="n">n_max</span><span class="w"> </span><span class="o">=</span><span class="w"> </span><span class="mi">3</span><span class="p">,</span><span class="w"> </span><span class="n">col_select</span><span class="w"> </span><span class="o">=</span><span class="w"> </span><span class="p">[</span><span class="s">"ID"</span><span class="p">,</span><span class="w"> </span><span class="s">"Score"</span><span class="p">],</span><span class="w"> </span><span class="n">missingstring</span><span class="w"> </span><span class="o">=</span><span class="w"> </span><span class="p">[</span><span class="s">"4"</span><span class="p">])</span>
+<p>The path can be a file available either locally or on the web.</p>
+<div class="highlight"><pre><span></span><code><span class="n">read_csv</span><span class="p">(</span><span class="s">"https://raw.githubusercontent.com/TidierOrg/TidierFiles.jl/main/testing_files/csvtest.csv"</span><span class="p">,</span><span class="w"> </span><span class="n">skip</span><span class="w"> </span><span class="o">=</span><span class="w"> </span><span class="mi">2</span><span class="p">,</span><span class="w"> </span><span class="n">n_max</span><span class="w"> </span><span class="o">=</span><span class="w"> </span><span class="mi">3</span><span class="p">,</span><span class="w"> </span><span class="n">col_select</span><span class="w"> </span><span class="o">=</span><span class="w"> </span><span class="p">[</span><span class="s">"ID"</span><span class="p">,</span><span class="w"> </span><span class="s">"Score"</span><span class="p">],</span><span class="w"> </span><span class="n">missingstring</span><span class="w"> </span><span class="o">=</span><span class="w"> </span><span class="p">[</span><span class="s">"4"</span><span class="p">])</span>
 </code></pre></div>
 <div class="highlight"><pre><span></span><code>3×2 DataFrame
  Row │ ID       Score 
diff --git a/previews/PR4/reference/index.html b/previews/PR4/reference/index.html
index 93f76c0..c1e2b5d 100644
--- a/previews/PR4/reference/index.html
+++ b/previews/PR4/reference/index.html
@@ -480,7 +480,7 @@ <h2 id="reference-exported-functions">Reference - Exported functions<a class="he
 <span class="gp">julia&gt;</span><span class="w"> </span><span class="n">fwf_empty</span><span class="p">(</span><span class="n">path</span><span class="p">,</span><span class="w"> </span><span class="n">num_lines</span><span class="o">=</span><span class="mi">4</span><span class="p">,</span><span class="w"> </span><span class="n">col_names</span><span class="w"> </span><span class="o">=</span><span class="w"> </span><span class="p">[</span><span class="s">"Name"</span><span class="p">,</span><span class="w"> </span><span class="s">"Age"</span><span class="p">,</span><span class="w"> </span><span class="s">"ID"</span><span class="p">,</span><span class="w"> </span><span class="s">"Position"</span><span class="p">,</span><span class="w"> </span><span class="s">"Salary"</span><span class="p">])</span>
 <span class="go">([13, 5, 8, 20, 8], ["Name", "Age", "ID", "Position", "Salary"])</span>
 </code></pre></div>
-<p><a target="_blank" href="https://github.com/TidierOrg/TidierFiles.jl/blob/aa1c800b94c409bbf769f1c03b766b8e9c7c9a75/src/fwf.jl#L62-L92" class="documenter-source">source</a><br></p>
+<p><a target="_blank" href="https://github.com/TidierOrg/TidierFiles.jl/blob/9ae1770b5773c006e1c812dadd54af45e1eaefbd/src/fwf.jl#L62-L92" class="documenter-source">source</a><br></p>
 <p><a id="TidierFiles.read_csv-Tuple{Any}" href="#TidierFiles.read_csv-Tuple{Any}">#</a>
 <strong><code>TidierFiles.read_csv</code></strong> — <em>Method</em>.</p>
 <div class="highlight"><pre><span></span><code><span class="n">read_csv</span><span class="p">(</span><span class="n">file</span><span class="p">;</span><span class="w"> </span><span class="n">delim</span><span class="o">=</span><span class="sc">','</span><span class="p">,</span><span class="n">col_names</span><span class="o">=</span><span class="nb">true</span><span class="p">,</span><span class="w"> </span><span class="n">skip</span><span class="o">=</span><span class="mi">0</span><span class="p">,</span><span class="w"> </span><span class="n">n_max</span><span class="o">=</span><span class="nb">Inf</span><span class="p">,</span><span class="w"> </span>
@@ -503,7 +503,7 @@ <h2 id="reference-exported-functions">Reference - Exported functions<a class="he
 <span class="go">   2 │     4  David         85</span>
 <span class="go">   3 │     5  Eva      missing </span>
 </code></pre></div>
-<p><a target="_blank" href="https://github.com/TidierOrg/TidierFiles.jl/blob/aa1c800b94c409bbf769f1c03b766b8e9c7c9a75/src/TidierFiles.jl#L23-L55" class="documenter-source">source</a><br></p>
+<p><a target="_blank" href="https://github.com/TidierOrg/TidierFiles.jl/blob/9ae1770b5773c006e1c812dadd54af45e1eaefbd/src/TidierFiles.jl#L23-L55" class="documenter-source">source</a><br></p>
 <p><a id="TidierFiles.read_delim-Tuple{Any}" href="#TidierFiles.read_delim-Tuple{Any}">#</a>
 <strong><code>TidierFiles.read_delim</code></strong> — <em>Method</em>.</p>
 <div class="highlight"><pre><span></span><code><span class="n">read_delim</span><span class="p">(</span><span class="n">file</span><span class="p">;</span><span class="w"> </span><span class="n">delim</span><span class="o">=</span><span class="err">'</span><span class="w">    </span><span class="err">'</span><span class="p">,</span><span class="n">col_names</span><span class="o">=</span><span class="nb">true</span><span class="p">,</span><span class="w"> </span><span class="n">skip</span><span class="o">=</span><span class="mi">0</span><span class="p">,</span><span class="w"> </span><span class="n">n_max</span><span class="o">=</span><span class="nb">Inf</span><span class="p">,</span><span class="w"> </span>
@@ -529,7 +529,7 @@ <h2 id="reference-exported-functions">Reference - Exported functions<a class="he
 <span class="go">   5 │ 4        David    85</span>
 <span class="go">   6 │ 5        Eva      95</span>
 </code></pre></div>
-<p><a target="_blank" href="https://github.com/TidierOrg/TidierFiles.jl/blob/aa1c800b94c409bbf769f1c03b766b8e9c7c9a75/src/TidierFiles.jl#L87-L124" class="documenter-source">source</a><br></p>
+<p><a target="_blank" href="https://github.com/TidierOrg/TidierFiles.jl/blob/9ae1770b5773c006e1c812dadd54af45e1eaefbd/src/TidierFiles.jl#L87-L124" class="documenter-source">source</a><br></p>
 <p><a id="TidierFiles.read_dta-Tuple{Any}" href="#TidierFiles.read_dta-Tuple{Any}">#</a>
 <strong><code>TidierFiles.read_dta</code></strong> — <em>Method</em>.</p>
 <div class="highlight"><pre><span></span><code><span class="k">function</span><span class="w"> </span><span class="n">read_dta</span><span class="p">(</span><span class="n">data_file</span><span class="p">;</span><span class="w">  </span><span class="n">encoding</span><span class="o">=</span><span class="nb">nothing</span><span class="p">,</span><span class="w"> </span><span class="n">col_select</span><span class="o">=</span><span class="nb">nothing</span><span class="p">,</span><span class="w"> </span><span class="n">skip</span><span class="o">=</span><span class="mi">0</span><span class="p">,</span><span class="w"> </span><span class="n">n_max</span><span class="o">=</span><span class="nb">Inf</span><span class="p">)</span>
@@ -550,7 +550,7 @@ <h2 id="reference-exported-functions">Reference - Exported functions<a class="he
 <span class="go">   1 │ sav         10.1</span>
 <span class="go">   2 │ por         10.2</span>
 </code></pre></div>
-<p><a target="_blank" href="https://github.com/TidierOrg/TidierFiles.jl/blob/aa1c800b94c409bbf769f1c03b766b8e9c7c9a75/src/statsfiles.jl#L91-L118" class="documenter-source">source</a><br></p>
+<p><a target="_blank" href="https://github.com/TidierOrg/TidierFiles.jl/blob/9ae1770b5773c006e1c812dadd54af45e1eaefbd/src/statsfiles.jl#L91-L118" class="documenter-source">source</a><br></p>
 <p><a id="TidierFiles.read_fwf-Tuple{String, Tuple{Vector{Int64}, Union{Nothing, Vector{String}}}}" href="#TidierFiles.read_fwf-Tuple{String,%20Tuple{Vector{Int64},%20Union{Nothing,%20Vector{String}}}}">#</a>
 <strong><code>TidierFiles.read_fwf</code></strong> — <em>Method</em>.</p>
 <div class="highlight"><pre><span></span><code><span class="n">read_fwf</span><span class="p">(</span><span class="n">filepath</span><span class="o">::</span><span class="kt">String</span><span class="p">;</span><span class="w"> </span><span class="n">num_lines</span><span class="o">::</span><span class="kt">Int</span><span class="o">=</span><span class="mi">4</span><span class="p">,</span><span class="w"> </span><span class="n">col_names</span><span class="o">=</span><span class="nb">nothing</span><span class="p">)</span>
@@ -582,7 +582,7 @@ <h2 id="reference-exported-functions">Reference - Exported functions<a class="he
 <span class="go">   2 │ Charlie Day  28      345     Sales Associate  70,000</span>
 <span class="go">   3 │ Diane Poe    35      23456   Data Scientist   130,000</span>
 </code></pre></div>
-<p><a target="_blank" href="https://github.com/TidierOrg/TidierFiles.jl/blob/aa1c800b94c409bbf769f1c03b766b8e9c7c9a75/src/fwf.jl#L11-L42" class="documenter-source">source</a><br></p>
+<p><a target="_blank" href="https://github.com/TidierOrg/TidierFiles.jl/blob/9ae1770b5773c006e1c812dadd54af45e1eaefbd/src/fwf.jl#L11-L42" class="documenter-source">source</a><br></p>
 <p><a id="TidierFiles.read_sas-Tuple{Any}" href="#TidierFiles.read_sas-Tuple{Any}">#</a>
 <strong><code>TidierFiles.read_sas</code></strong> — <em>Method</em>.</p>
 <div class="highlight"><pre><span></span><code><span class="k">function</span><span class="w"> </span><span class="n">read_sas</span><span class="p">(</span><span class="n">data_file</span><span class="p">;</span><span class="w">  </span><span class="n">encoding</span><span class="o">=</span><span class="nb">nothing</span><span class="p">,</span><span class="w"> </span><span class="n">col_select</span><span class="o">=</span><span class="nb">nothing</span><span class="p">,</span><span class="w"> </span><span class="n">skip</span><span class="o">=</span><span class="mi">0</span><span class="p">,</span><span class="w"> </span><span class="n">n_max</span><span class="o">=</span><span class="nb">Inf</span><span class="p">,</span><span class="w"> </span><span class="n">num_threads</span><span class="p">)</span>
@@ -596,7 +596,7 @@ <h2 id="reference-exported-functions">Reference - Exported functions<a class="he
 <p>julia&gt; read_sas("test.sas7bdat") 2×2 DataFrame  Row │ AA       AB            │ String3  Float64  ─────┼──────────────────    1 │ sav         10.1    2 │ por         10.2</p>
 <p>julia&gt; write_sas(df, "test.xpt");</p>
 <p>julia&gt; read_sas("test.xpt") 2×2 DataFrame  Row │ AA       AB            │ String3  Float64  ─────┼──────────────────    1 │ sav         10.1    2 │ por         10.2</p>
-<p><a target="_blank" href="https://github.com/TidierOrg/TidierFiles.jl/blob/aa1c800b94c409bbf769f1c03b766b8e9c7c9a75/src/statsfiles.jl#L1-L36" class="documenter-source">source</a><br></p>
+<p><a target="_blank" href="https://github.com/TidierOrg/TidierFiles.jl/blob/9ae1770b5773c006e1c812dadd54af45e1eaefbd/src/statsfiles.jl#L1-L36" class="documenter-source">source</a><br></p>
 <p><a id="TidierFiles.read_sav-Tuple{Any}" href="#TidierFiles.read_sav-Tuple{Any}">#</a>
 <strong><code>TidierFiles.read_sav</code></strong> — <em>Method</em>.</p>
 <div class="highlight"><pre><span></span><code><span class="k">function</span><span class="w"> </span><span class="n">read_sav</span><span class="p">(</span><span class="n">data_file</span><span class="p">;</span><span class="w">  </span><span class="n">encoding</span><span class="o">=</span><span class="nb">nothing</span><span class="p">,</span><span class="w"> </span><span class="n">col_select</span><span class="o">=</span><span class="nb">nothing</span><span class="p">,</span><span class="w"> </span><span class="n">skip</span><span class="o">=</span><span class="mi">0</span><span class="p">,</span><span class="w"> </span><span class="n">n_max</span><span class="o">=</span><span class="nb">Inf</span><span class="p">)</span>
@@ -627,7 +627,7 @@ <h2 id="reference-exported-functions">Reference - Exported functions<a class="he
 <span class="go">   1 │ sav        10.1</span>
 <span class="go">   2 │ por        10.2</span>
 </code></pre></div>
-<p><a target="_blank" href="https://github.com/TidierOrg/TidierFiles.jl/blob/aa1c800b94c409bbf769f1c03b766b8e9c7c9a75/src/statsfiles.jl#L46-L82" class="documenter-source">source</a><br></p>
+<p><a target="_blank" href="https://github.com/TidierOrg/TidierFiles.jl/blob/9ae1770b5773c006e1c812dadd54af45e1eaefbd/src/statsfiles.jl#L46-L82" class="documenter-source">source</a><br></p>
 <p><a id="TidierFiles.read_table-Tuple{Any}" href="#TidierFiles.read_table-Tuple{Any}">#</a>
 <strong><code>TidierFiles.read_table</code></strong> — <em>Method</em>.</p>
 <div class="highlight"><pre><span></span><code><span class="n">read_table</span><span class="p">(</span><span class="n">file</span><span class="p">;</span><span class="w"> </span><span class="n">col_names</span><span class="o">=</span><span class="nb">true</span><span class="p">,</span><span class="w"> </span><span class="n">skip</span><span class="o">=</span><span class="mi">0</span><span class="p">,</span><span class="w"> </span><span class="n">n_max</span><span class="o">=</span><span class="nb">Inf</span><span class="p">,</span><span class="w"> </span><span class="n">comment</span><span class="o">=</span><span class="nb">nothing</span><span class="p">,</span><span class="w"> </span><span class="n">col_select</span><span class="p">,</span><span class="w"> </span><span class="n">missingstring</span><span class="o">=</span><span class="s">""</span><span class="p">,</span><span class="w"> </span><span class="n">kwargs</span><span class="o">...</span><span class="p">)</span>
@@ -649,7 +649,7 @@ <h2 id="reference-exported-functions">Reference - Exported functions<a class="he
 <span class="go">   2 │ David</span>
 <span class="go">   3 │ Eva</span>
 </code></pre></div>
-<p><a target="_blank" href="https://github.com/TidierOrg/TidierFiles.jl/blob/aa1c800b94c409bbf769f1c03b766b8e9c7c9a75/src/TidierFiles.jl#L206-L236" class="documenter-source">source</a><br></p>
+<p><a target="_blank" href="https://github.com/TidierOrg/TidierFiles.jl/blob/9ae1770b5773c006e1c812dadd54af45e1eaefbd/src/TidierFiles.jl#L206-L236" class="documenter-source">source</a><br></p>
 <p><a id="TidierFiles.read_tsv-Tuple{Any}" href="#TidierFiles.read_tsv-Tuple{Any}">#</a>
 <strong><code>TidierFiles.read_tsv</code></strong> — <em>Method</em>.</p>
 <div class="highlight"><pre><span></span><code><span class="n">read_tsv</span><span class="p">(</span><span class="n">file</span><span class="p">;</span><span class="w"> </span><span class="n">delim</span><span class="o">=</span><span class="err">'</span><span class="w">  </span><span class="err">'</span><span class="p">,</span><span class="n">col_names</span><span class="o">=</span><span class="nb">true</span><span class="p">,</span><span class="w"> </span><span class="n">skip</span><span class="o">=</span><span class="mi">0</span><span class="p">,</span><span class="w"> </span><span class="n">n_max</span><span class="o">=</span><span class="nb">Inf</span><span class="p">,</span><span class="w"> </span>
@@ -672,7 +672,7 @@ <h2 id="reference-exported-functions">Reference - Exported functions<a class="he
 <span class="go">   2 │     4  David       85</span>
 <span class="go">   3 │     5  Eva         95</span>
 </code></pre></div>
-<p><a target="_blank" href="https://github.com/TidierOrg/TidierFiles.jl/blob/aa1c800b94c409bbf769f1c03b766b8e9c7c9a75/src/TidierFiles.jl#L148-L181" class="documenter-source">source</a><br></p>
+<p><a target="_blank" href="https://github.com/TidierOrg/TidierFiles.jl/blob/9ae1770b5773c006e1c812dadd54af45e1eaefbd/src/TidierFiles.jl#L148-L181" class="documenter-source">source</a><br></p>
 <p><a id="TidierFiles.read_xlsx-Tuple{Any}" href="#TidierFiles.read_xlsx-Tuple{Any}">#</a>
 <strong><code>TidierFiles.read_xlsx</code></strong> — <em>Method</em>.</p>
 <div class="highlight"><pre><span></span><code><span class="n">read_xlsx</span><span class="p">(</span><span class="n">path</span><span class="p">;</span><span class="w"> </span><span class="n">sheet</span><span class="p">,</span><span class="w"> </span><span class="n">range</span><span class="p">,</span><span class="w"> </span><span class="n">col_names</span><span class="p">,</span><span class="w"> </span><span class="n">col_types</span><span class="p">,</span><span class="w"> </span><span class="n">missingstring</span><span class="p">,</span><span class="w"> </span><span class="n">trim_ws</span><span class="p">,</span><span class="w"> </span><span class="n">skip</span><span class="p">,</span><span class="w"> </span><span class="n">n_max</span><span class="p">,</span><span class="w"> </span><span class="n">guess_max</span><span class="p">)</span>
@@ -698,7 +698,7 @@ <h2 id="reference-exported-functions">Reference - Exported functions<a class="he
 <span class="go">   2 │ 3         File reading/writing     30.4</span>
 <span class="go">   3 │ 4         even smoother            40.5</span>
 </code></pre></div>
-<p><a target="_blank" href="https://github.com/TidierOrg/TidierFiles.jl/blob/aa1c800b94c409bbf769f1c03b766b8e9c7c9a75/src/xlfiles.jl#L35-L70" class="documenter-source">source</a><br></p>
+<p><a target="_blank" href="https://github.com/TidierOrg/TidierFiles.jl/blob/9ae1770b5773c006e1c812dadd54af45e1eaefbd/src/xlfiles.jl#L35-L70" class="documenter-source">source</a><br></p>
 <p><a id="TidierFiles.write_csv-Tuple{DataFrame, String}" href="#TidierFiles.write_csv-Tuple{DataFrame,%20String}">#</a>
 <strong><code>TidierFiles.write_csv</code></strong> — <em>Method</em>.</p>
 <div class="highlight"><pre><span></span><code><span class="n">write_csv</span><span class="p">(</span><span class="n">DataFrame</span><span class="p">,</span><span class="w"> </span><span class="n">filepath</span><span class="p">;</span><span class="w"> </span><span class="n">na</span><span class="w"> </span><span class="o">=</span><span class="w"> </span><span class="s">""</span><span class="p">,</span><span class="w"> </span><span class="n">append</span><span class="w"> </span><span class="o">=</span><span class="w"> </span><span class="nb">false</span><span class="p">,</span><span class="w"> </span><span class="n">col_names</span><span class="w"> </span><span class="o">=</span><span class="w"> </span><span class="nb">true</span><span class="p">,</span><span class="w"> </span><span class="n">missingstring</span><span class="p">,</span><span class="w"> </span><span class="n">eol</span><span class="w"> </span><span class="o">=</span><span class="w"> </span><span class="s">"</span>
@@ -722,7 +722,7 @@ <h2 id="reference-exported-functions">Reference - Exported functions<a class="he
 
 <span class="gp">julia&gt;</span><span class="w"> </span><span class="n">write_csv</span><span class="p">(</span><span class="n">df</span><span class="p">,</span><span class="w"> </span><span class="s">"csvtest.csv"</span><span class="p">);</span>
 </code></pre></div>
-<p><a target="_blank" href="https://github.com/TidierOrg/TidierFiles.jl/blob/aa1c800b94c409bbf769f1c03b766b8e9c7c9a75/src/TidierFiles.jl#L265-L286" class="documenter-source">source</a><br></p>
+<p><a target="_blank" href="https://github.com/TidierOrg/TidierFiles.jl/blob/9ae1770b5773c006e1c812dadd54af45e1eaefbd/src/TidierFiles.jl#L265-L286" class="documenter-source">source</a><br></p>
 <p><a id="TidierFiles.write_dta-Tuple{DataFrame, String}" href="#TidierFiles.write_dta-Tuple{DataFrame,%20String}">#</a>
 <strong><code>TidierFiles.write_dta</code></strong> — <em>Method</em>.</p>
 <div class="highlight"><pre><span></span><code><span class="n">write_dta</span><span class="p">(</span><span class="n">df</span><span class="p">,</span><span class="w"> </span><span class="n">path</span><span class="p">)</span>
@@ -740,7 +740,7 @@ <h2 id="reference-exported-functions">Reference - Exported functions<a class="he
 <span class="go">   1 │    sav      10.1</span>
 <span class="go">   2 │    por      10.2</span>
 </code></pre></div>
-<p><a target="_blank" href="https://github.com/TidierOrg/TidierFiles.jl/blob/aa1c800b94c409bbf769f1c03b766b8e9c7c9a75/src/statsfiles.jl#L150-L170" class="documenter-source">source</a><br></p>
+<p><a target="_blank" href="https://github.com/TidierOrg/TidierFiles.jl/blob/9ae1770b5773c006e1c812dadd54af45e1eaefbd/src/statsfiles.jl#L150-L170" class="documenter-source">source</a><br></p>
 <p><a id="TidierFiles.write_sas-Tuple{DataFrame, String}" href="#TidierFiles.write_sas-Tuple{DataFrame,%20String}">#</a>
 <strong><code>TidierFiles.write_sas</code></strong> — <em>Method</em>.</p>
 <div class="highlight"><pre><span></span><code><span class="n">write_sas</span><span class="p">(</span><span class="n">df</span><span class="p">,</span><span class="w"> </span><span class="n">path</span><span class="p">)</span>
@@ -766,7 +766,7 @@ <h2 id="reference-exported-functions">Reference - Exported functions<a class="he
 <span class="go">   1 │    sav      10.1</span>
 <span class="go">   2 │    por      10.2</span>
 </code></pre></div>
-<p><a target="_blank" href="https://github.com/TidierOrg/TidierFiles.jl/blob/aa1c800b94c409bbf769f1c03b766b8e9c7c9a75/src/statsfiles.jl#L136-L164" class="documenter-source">source</a><br></p>
+<p><a target="_blank" href="https://github.com/TidierOrg/TidierFiles.jl/blob/9ae1770b5773c006e1c812dadd54af45e1eaefbd/src/statsfiles.jl#L136-L164" class="documenter-source">source</a><br></p>
 <p><a id="TidierFiles.write_sav-Tuple{DataFrame, String}" href="#TidierFiles.write_sav-Tuple{DataFrame,%20String}">#</a>
 <strong><code>TidierFiles.write_sav</code></strong> — <em>Method</em>.</p>
 <div class="highlight"><pre><span></span><code><span class="n">write_sav</span><span class="p">(</span><span class="n">df</span><span class="p">,</span><span class="w"> </span><span class="n">path</span><span class="p">)</span>
@@ -792,7 +792,7 @@ <h2 id="reference-exported-functions">Reference - Exported functions<a class="he
 <span class="go">   1 │    sav      10.1</span>
 <span class="go">   2 │    por      10.2</span>
 </code></pre></div>
-<p><a target="_blank" href="https://github.com/TidierOrg/TidierFiles.jl/blob/aa1c800b94c409bbf769f1c03b766b8e9c7c9a75/src/statsfiles.jl#L143-L171" class="documenter-source">source</a><br></p>
+<p><a target="_blank" href="https://github.com/TidierOrg/TidierFiles.jl/blob/9ae1770b5773c006e1c812dadd54af45e1eaefbd/src/statsfiles.jl#L143-L171" class="documenter-source">source</a><br></p>
 <p><a id="TidierFiles.write_table-Tuple{DataFrame, String}" href="#TidierFiles.write_table-Tuple{DataFrame,%20String}">#</a>
 <strong><code>TidierFiles.write_table</code></strong> — <em>Method</em>.</p>
 <div class="highlight"><pre><span></span><code><span class="n">write_table</span><span class="p">(</span><span class="n">x</span><span class="p">,</span><span class="w"> </span><span class="n">file</span><span class="p">;</span><span class="w"> </span><span class="n">delim</span><span class="w"> </span><span class="o">=</span><span class="w"> </span><span class="err">'</span><span class="w">  </span><span class="err">'</span><span class="p">,</span><span class="w"> </span><span class="n">na</span><span class="p">,</span><span class="w"> </span><span class="n">append</span><span class="p">,</span><span class="w"> </span><span class="n">col_names</span><span class="p">,</span><span class="w"> </span><span class="n">eol</span><span class="p">,</span><span class="w"> </span><span class="n">num_threads</span><span class="p">)</span>
@@ -805,7 +805,7 @@ <h2 id="reference-exported-functions">Reference - Exported functions<a class="he
 
 <span class="gp">julia&gt;</span><span class="w"> </span><span class="n">write_table</span><span class="p">(</span><span class="n">df</span><span class="p">,</span><span class="w"> </span><span class="s">"tabletest.txt"</span><span class="p">);</span>
 </code></pre></div>
-<p><a target="_blank" href="https://github.com/TidierOrg/TidierFiles.jl/blob/aa1c800b94c409bbf769f1c03b766b8e9c7c9a75/src/TidierFiles.jl#L312-L333" class="documenter-source">source</a><br></p>
+<p><a target="_blank" href="https://github.com/TidierOrg/TidierFiles.jl/blob/9ae1770b5773c006e1c812dadd54af45e1eaefbd/src/TidierFiles.jl#L312-L333" class="documenter-source">source</a><br></p>
 <p><a id="TidierFiles.write_tsv-Tuple{DataFrame, String}" href="#TidierFiles.write_tsv-Tuple{DataFrame,%20String}">#</a>
 <strong><code>TidierFiles.write_tsv</code></strong> — <em>Method</em>.</p>
 <div class="highlight"><pre><span></span><code><span class="n">write_tsv</span><span class="p">(</span><span class="n">DataFrame</span><span class="p">,</span><span class="w"> </span><span class="n">filepath</span><span class="p">;</span><span class="w"> </span><span class="n">na</span><span class="w"> </span><span class="o">=</span><span class="w"> </span><span class="s">""</span><span class="p">,</span><span class="w"> </span><span class="n">append</span><span class="w"> </span><span class="o">=</span><span class="w"> </span><span class="nb">false</span><span class="p">,</span><span class="w"> </span><span class="n">col_names</span><span class="w"> </span><span class="o">=</span><span class="w"> </span><span class="nb">true</span><span class="p">,</span><span class="w"> </span><span class="n">missingstring</span><span class="p">,</span><span class="w"> </span><span class="n">eol</span><span class="w"> </span><span class="o">=</span><span class="w"> </span><span class="s">"</span>
@@ -829,7 +829,7 @@ <h2 id="reference-exported-functions">Reference - Exported functions<a class="he
 
 <span class="gp">julia&gt;</span><span class="w"> </span><span class="n">write_tsv</span><span class="p">(</span><span class="n">df</span><span class="p">,</span><span class="w"> </span><span class="s">"tsvtest.tsv"</span><span class="p">);</span>
 </code></pre></div>
-<p><a target="_blank" href="https://github.com/TidierOrg/TidierFiles.jl/blob/aa1c800b94c409bbf769f1c03b766b8e9c7c9a75/src/TidierFiles.jl#L288-L309" class="documenter-source">source</a><br></p>
+<p><a target="_blank" href="https://github.com/TidierOrg/TidierFiles.jl/blob/9ae1770b5773c006e1c812dadd54af45e1eaefbd/src/TidierFiles.jl#L288-L309" class="documenter-source">source</a><br></p>
 <p><a id="TidierFiles.write_xlsx-Tuple{Any}" href="#TidierFiles.write_xlsx-Tuple{Any}">#</a>
 <strong><code>TidierFiles.write_xlsx</code></strong> — <em>Method</em>.</p>
 <div class="highlight"><pre><span></span><code><span class="n">write_xlsx</span><span class="p">(</span><span class="n">x</span><span class="p">;</span><span class="w"> </span><span class="n">path</span><span class="p">,</span><span class="w"> </span><span class="n">overwrite</span><span class="p">)</span>
@@ -845,7 +845,7 @@ <h1 id="arguments-x-the-data-to-write-can-be-a-single-pairstring-dataframe-for-w
 
 <span class="gp">julia&gt;</span><span class="w"> </span><span class="n">write_xlsx</span><span class="p">((</span><span class="s">"REPORT_A"</span><span class="w"> </span><span class="o">=&gt;</span><span class="w"> </span><span class="n">df</span><span class="p">,</span><span class="w"> </span><span class="s">"REPORT_B"</span><span class="w"> </span><span class="o">=&gt;</span><span class="w"> </span><span class="n">df2</span><span class="p">);</span><span class="w"> </span><span class="n">path</span><span class="o">=</span><span class="s">"xlsxtest.xlsx"</span><span class="p">,</span><span class="w"> </span><span class="n">overwrite</span><span class="w"> </span><span class="o">=</span><span class="w"> </span><span class="nb">true</span><span class="p">);</span>
 </code></pre></div>
-<p><a target="_blank" href="https://github.com/TidierOrg/TidierFiles.jl/blob/aa1c800b94c409bbf769f1c03b766b8e9c7c9a75/src/xlfiles.jl#L123-L142" class="documenter-source">source</a><br></p>
+<p><a target="_blank" href="https://github.com/TidierOrg/TidierFiles.jl/blob/9ae1770b5773c006e1c812dadd54af45e1eaefbd/src/xlfiles.jl#L123-L142" class="documenter-source">source</a><br></p>
 <p><a id="Reference-Internal-functions"></a></p>
 <p><a id="Reference-Internal-functions-1"></a></p>
 <h2 id="reference-internal-functions">Reference - Internal functions<a class="headerlink" href="#reference-internal-functions" title="Permanent link">¤</a></h2></div>
diff --git a/previews/PR4/search/search_index.json b/previews/PR4/search/search_index.json
index 66a2434..afbe279 100644
--- a/previews/PR4/search/search_index.json
+++ b/previews/PR4/search/search_index.json
@@ -1 +1 @@
-{"config":{"lang":["en"],"separator":"[\\s\\-]+","pipeline":["stopWordFilter"]},"docs":[{"location":"","title":"Home","text":""},{"location":"#what-is-tidierfilesjl","title":"What is TidierFiles.jl?","text":"<p>TidierFiles.jl is a 100% Julia implementation of the readr and haven R packages. Powered by the CSV.jl, XLSX.jl and ReadStatTables.jl packages, TidierFiles.jl  seeks to harmonize file reading/writing by unifying the arguments across multiple  file types. </p> <p>TidierFiles.jl currently supports </p> <p>Example</p> <ul> <li><code>read_csv</code> and <code>write_csv</code></li> <li><code>read_tsv</code> and <code>write_tsv</code></li> <li><code>read_xlsx</code> and <code>write_xlsx</code></li> <li><code>read_delim</code> and <code>write_delim</code></li> <li><code>read_table</code> and <code>write_table</code></li> <li><code>read_fwf</code> and <code>fwf_empty</code></li> <li><code>read_sav</code> and <code>write_sav</code> (.sav and .por)</li> <li><code>read_sas</code> and <code>write_sas</code> (.sas7bdat and .xpt)</li> <li><code>read_dta</code> and <code>write_dta</code> (.dta) </li> </ul> <p>Read functions include the following arguments and support HTTP reading:</p> <ul> <li><code>path</code></li> <li><code>missingstring</code></li> <li><code>col_names</code></li> <li><code>col_select</code></li> <li><code>num_threads</code></li> <li><code>skip</code></li> <li><code>n_max</code></li> <li><code>delim</code> (where applies)</li> </ul> <pre><code>using TidierFiles\n\nread_csv(\"https://raw.githubusercontent.com/TidierOrg/TidierFiles.jl/main/testing_files/csvtest.csv\", skip = 2, n_max = 3, col_select = [\"ID\", \"Score\"], missingstring = [\"4\"])\n</code></pre> <pre><code>3\u00d72 DataFrame\n Row \u2502 ID       Score \n     \u2502 Int64?   Int64 \n\u2500\u2500\u2500\u2500\u2500\u253c\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\n   1 \u2502       3     77\n   2 \u2502 missing     85\n   3 \u2502       5     95\n</code></pre>"},{"location":"reference/","title":"Reference","text":""},{"location":"reference/#index","title":"Index","text":"<ul> <li><code>TidierFiles.fwf_empty</code></li> <li><code>TidierFiles.read_csv</code></li> <li><code>TidierFiles.read_delim</code></li> <li><code>TidierFiles.read_dta</code></li> <li><code>TidierFiles.read_fwf</code></li> <li><code>TidierFiles.read_sas</code></li> <li><code>TidierFiles.read_sav</code></li> <li><code>TidierFiles.read_table</code></li> <li><code>TidierFiles.read_tsv</code></li> <li><code>TidierFiles.read_xlsx</code></li> <li><code>TidierFiles.write_csv</code></li> <li><code>TidierFiles.write_dta</code></li> <li><code>TidierFiles.write_sas</code></li> <li><code>TidierFiles.write_sav</code></li> <li><code>TidierFiles.write_table</code></li> <li><code>TidierFiles.write_tsv</code></li> <li><code>TidierFiles.write_xlsx</code></li> </ul>"},{"location":"reference/#reference-exported-functions","title":"Reference - Exported functions","text":"<p># <code>TidierFiles.fwf_empty</code> \u2014 Method.</p> <pre><code>fwf_empty(filepath::String; num_lines::Int=4, col_names=nothing)\n</code></pre> <p>Analyze a fixed-width format (FWF) file to automatically determine column widths and provide column names.</p> <p>Arguments</p> <ul> <li><code>filepath</code>::String: Path to the FWF file to analyze.</li> </ul> <p>num_lines::Int=4: Number of lines to sample from the beginning of the file for analysis. Default is 4.</p> <ul> <li><code>col_names</code>: Optional; a vector of strings specifying column names. If not provided, column names are generated as Column1, Column2, etc.</li> </ul> <p>Returns</p> <ul> <li>A tuple containing two elements:</li> <li>A vector of integers representing the detected column widths.</li> <li>A vector of strings representing the column names.</li> </ul> <p>Examples</p> <pre><code>julia&gt; fwf_data = \n       \"John Smith   35    12345  Software Engineer   120,000 \\nJane Doe     29     2345  Marketing Manager   95,000  \\nAlice Jones  42   123456  CEO                 250,000 \\nBob Brown    31    12345  Product Manager     110,000 \\nCharlie Day  28      345  Sales Associate     70,000  \\nDiane Poe    35    23456  Data Scientist      130,000 \\nEve Stone    40   123456  Chief Financial Off 200,000 \\nFrank Moore  33     1234  Graphic Designer    80,000  \\nGrace Lee    27   123456  Software Developer  115,000 \\nHank Zuse    45    12345  System Analyst      120,000 \";\n\njulia&gt; open(\"fwftest.txt\", \"w\") do file\n         write(file, fwf_data)\n       end;\n\njulia&gt; path = \"fwftest.txt\";\n\njulia&gt; fwf_empty(path)\n([13, 5, 8, 20, 8], [\"Column_1\", \"Column_2\", \"Column_3\", \"Column_4\", \"Column_5\"])\n\njulia&gt; fwf_empty(path, num_lines=4, col_names = [\"Name\", \"Age\", \"ID\", \"Position\", \"Salary\"])\n([13, 5, 8, 20, 8], [\"Name\", \"Age\", \"ID\", \"Position\", \"Salary\"])\n</code></pre> <p>source</p> <p># <code>TidierFiles.read_csv</code> \u2014 Method.</p> <pre><code>read_csv(file; delim=',',col_names=true, skip=0, n_max=Inf, \n    comment=nothing, missingstring=\"\", col_select, escape_double=true, col_types=nothing, num_threads = 1)\n</code></pre> <p>Reads a CSV file or URL into a DataFrame, with options to specify delimiter, column names, and other CSV parsing options.</p> <p>Arguments</p> <p><code>file</code>: Path to the CSV file or a URL to a CSV file. <code>delim</code>: The character delimiting fields in the file. Default is ','. <code>col_names</code>: Indicates if the first row of the CSV is used as column names. Can be true, false, or an array of strings. Default is true. <code>skip</code>: Number of initial lines to skip before reading data. Default is 0. <code>n_max</code>: Maximum number of rows to read. Default is Inf (read all rows). -<code>col_select</code>: Optional vector of symbols or strings to select which columns to load. <code>comment</code>: Character that starts a comment line. Lines beginning with this character are ignored. Default is nothing (no comment lines). <code>missingstring</code>: String that represents missing values in the CSV. Default is \"\", can be set to a vector of multiple items. <code>escape_double</code>: Indicates whether to interpret two consecutive quote characters as a single quote in the data. Default is true. <code>num_threads</code>: specifies the number of concurrent tasks or threads to use for processing, allowing for parallel execution. Defaults to 1</p> <p>Examples</p> <pre><code>julia&gt; df = DataFrame(ID = 1:5, Name = [\"Alice\", \"Bob\", \"Charlie\", \"David\", \"Eva\"], Score = [88, 92, 77, 85, 95]);\n\njulia&gt; write_csv(df, \"csvtest.csv\");\n\njulia&gt; read_csv(\"csvtest.csv\", skip = 2, n_max = 3, missingstring = [\"95\", \"Charlie\"])\n3\u00d73 DataFrame\n Row \u2502 ID     Name     Score   \n     \u2502 Int64  String7  Int64?  \n\u2500\u2500\u2500\u2500\u2500\u253c\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\n   1 \u2502     3  missing       77\n   2 \u2502     4  David         85\n   3 \u2502     5  Eva      missing \n</code></pre> <p>source</p> <p># <code>TidierFiles.read_delim</code> \u2014 Method.</p> <pre><code>read_delim(file; delim='    ',col_names=true, skip=0, n_max=Inf, \n    comment=nothing, missingstring=\"\", col_select, escape_double=true, col_types=nothing)\n</code></pre> <p>Reads a delimited file or URL into a DataFrame, with options to specify delimiter, column names, and other CSV parsing options.</p> <p>Arguments</p> <p><code>file</code>: Path to the CSV file or a URL to a CSV file. <code>delim</code>: The character delimiting fields in the file. Default is ','. <code>col_names</code>: Indicates if the first row of the CSV is used as column names. Can be true, false, or an array of strings. Default is true. <code>skip</code>: Number of initial lines to skip before reading data. Default is 0. <code>n_max</code>: Maximum number of rows to read. Default is Inf (read all rows). -<code>col_select</code>: Optional vector of symbols or strings to select which columns to load. <code>comment</code>: Character that starts a comment line. Lines beginning with this character are ignored. Default is nothing (no comment lines). <code>missingstring</code>: String that represents missing values in the CSV. Default is \"\", can be set to a vector of multiple items. <code>escape_double</code>: Indicates whether to interpret two consecutive quote characters as a single quote in the data. Default is true. <code>col_types</code>: An optional specification of column types, can be a single type applied to all columns, or a collection of types with one for each column. Default is nothing (types are inferred). <code>num_threads</code>: specifies the number of concurrent tasks or threads to use for processing, allowing for parallel execution. Default is the number of available threads.</p> <p>Examples</p> <pre><code>julia&gt; df = DataFrame(ID = 1:5, Name = [\"Alice\", \"Bob\", \"Charlie\", \"David\", \"Eva\"], Score = [88, 92, 77, 85, 95]);\n\njulia&gt; write_csv(df, \"csvtest.csv\");\n\njulia&gt; read_delim(\"csvtest.csv\", delim = \",\", col_names = false, num_threads = 4) # col_names are false here for the purpose of demonstration\n6\u00d73 DataFrame\n Row \u2502 Column1  Column2  Column3 \n     \u2502 String3  String7  String7 \n\u2500\u2500\u2500\u2500\u2500\u253c\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\n   1 \u2502 ID       Name     Score\n   2 \u2502 1        Alice    88\n   3 \u2502 2        Bob      92\n   4 \u2502 3        Charlie  77\n   5 \u2502 4        David    85\n   6 \u2502 5        Eva      95\n</code></pre> <p>source</p> <p># <code>TidierFiles.read_dta</code> \u2014 Method.</p> <pre><code>function read_dta(data_file;  encoding=nothing, col_select=nothing, skip=0, n_max=Inf)\n</code></pre> <p>Read data from a Stata (.dta) file into a DataFrame, supporting both local and remote sources.</p> <p>Arguments</p> <p>-<code>filepath</code>: The path to the .dta file or a URL pointing to such a file. If a URL is provided, the file will be downloaded and then read. <code>encoding</code>: Optional; specifies the encoding of the input file. If not provided, defaults to the package's or function's default. <code>col_select</code>: Optional; allows specifying a subset of columns to read. This can be a vector of column names or indices. If nothing, all columns are read. skip=0: Number of rows at the beginning of the file to skip before reading. n*max=Inf: Maximum number of rows to read from the file, after skipping. If Inf, read all available rows. <code>num*threads</code>: specifies the number of concurrent tasks or threads to use for processing, allowing for parallel execution. Defaults to 1</p> <p>Examples</p> <pre><code>julia&gt; df = DataFrame(AA=[\"sav\", \"por\"], AB=[10.1, 10.2]);\n\njulia&gt; write_dta(df, \"test.dta\");\n\njulia&gt; read_dta(\"test.dta\")\n2\u00d72 DataFrame\n Row \u2502 AA       AB      \n     \u2502 String3  Float64 \n\u2500\u2500\u2500\u2500\u2500\u253c\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\n   1 \u2502 sav         10.1\n   2 \u2502 por         10.2\n</code></pre> <p>source</p> <p># <code>TidierFiles.read_fwf</code> \u2014 Method.</p> <pre><code>read_fwf(filepath::String; num_lines::Int=4, col_names=nothing)\n</code></pre> <p>Read fixed-width format (FWF) files into a DataFrame.</p> <p>Arguments</p> <ul> <li><code>filepath</code>::String: Path to the FWF file to read.</li> <li><code>widths_colnames</code>::Tuple{Vector{Int}, Union{Nothing, Vector{String}}}: A tuple containing two elements:       - A vector of integers specifying the widths of each field.       - Optionally, a vector of strings specifying column names. If nothing, column names are generated as Column1, Column2, etc.</li> <li><code>skip_to</code>=0: Number of lines at the beginning of the file to skip before reading data.</li> <li><code>n_max</code>=nothing: Maximum number of lines to read from the file. If nothing, read all lines.</li> </ul> <p>Examples</p> <pre><code>julia&gt; fwf_data = \n       \"John Smith   35    12345  Software Engineer   120,000 \\nJane Doe     29     2345  Marketing Manager   95,000  \\nAlice Jones  42   123456  CEO                 250,000 \\nBob Brown    31    12345  Product Manager     110,000 \\nCharlie Day  28      345  Sales Associate     70,000  \\nDiane Poe    35    23456  Data Scientist      130,000 \\nEve Stone    40   123456  Chief Financial Off 200,000 \\nFrank Moore  33     1234  Graphic Designer    80,000  \\nGrace Lee    27   123456  Software Developer  115,000 \\nHank Zuse    45    12345  System Analyst      120,000 \";\n\njulia&gt; open(\"fwftest.txt\", \"w\") do file\n         write(file, fwf_data)\n       end;\n\njulia&gt; path = \"fwftest.txt\";\n\njulia&gt; read_fwf(path, fwf_empty(path, num_lines=4, col_names = [\"Name\", \"Age\", \"ID\", \"Position\", \"Salary\"]), skip_to=3, n_max=3)\n3\u00d75 DataFrame\n Row \u2502 Name         Age     ID      Position         Salary  \n     \u2502 String       String  String  String           String  \n\u2500\u2500\u2500\u2500\u2500\u253c\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\n   1 \u2502 Bob Brown    31      12345   Product Manager  110,000\n   2 \u2502 Charlie Day  28      345     Sales Associate  70,000\n   3 \u2502 Diane Poe    35      23456   Data Scientist   130,000\n</code></pre> <p>source</p> <p># <code>TidierFiles.read_sas</code> \u2014 Method.</p> <pre><code>function read_sas(data_file;  encoding=nothing, col_select=nothing, skip=0, n_max=Inf, num_threads)\n</code></pre> <p>Read data from a SAS (.sas7bdat and .xpt) file into a DataFrame, supporting both local and remote sources.</p> <p>Arguments</p> <p>-<code>filepath</code>: The path to the .dta file or a URL pointing to such a file. If a URL is provided, the file will be downloaded and then read. <code>encoding</code>: Optional; specifies the encoding of the input file. If not provided, defaults to the package's or function's default. <code>col_select</code>: Optional; allows specifying a subset of columns to read. This can be a vector of column names or indices. If nothing, all columns are read. skip=0: Number of rows at the beginning of the file to skip before reading. n*max=Inf: Maximum number of rows to read from the file, after skipping. If Inf, read all available rows. <code>num*threads</code>: specifies the number of concurrent tasks or threads to use for processing, allowing for parallel execution. Defaults to 1</p> <p>Examples</p> <p>```jldoctest julia&gt; df = DataFrame(AA=[\"sav\", \"por\"], AB=[10.1, 10.2]);</p> <p>julia&gt; write_sas(df, \"test.sas7bdat\");</p> <p>julia&gt; read_sas(\"test.sas7bdat\") 2\u00d72 DataFrame  Row \u2502 AA       AB            \u2502 String3  Float64  \u2500\u2500\u2500\u2500\u2500\u253c\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500    1 \u2502 sav         10.1    2 \u2502 por         10.2</p> <p>julia&gt; write_sas(df, \"test.xpt\");</p> <p>julia&gt; read_sas(\"test.xpt\") 2\u00d72 DataFrame  Row \u2502 AA       AB            \u2502 String3  Float64  \u2500\u2500\u2500\u2500\u2500\u253c\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500    1 \u2502 sav         10.1    2 \u2502 por         10.2</p> <p>source</p> <p># <code>TidierFiles.read_sav</code> \u2014 Method.</p> <pre><code>function read_sav(data_file;  encoding=nothing, col_select=nothing, skip=0, n_max=Inf)\n</code></pre> <p>Read data from a SPSS (.sav and .por) file into a DataFrame, supporting both local and remote sources.</p> <p>Arguments</p> <p>-<code>filepath</code>: The path to the .sav or .por file or a URL pointing to such a file. If a URL is provided, the file will be downloaded and then read. <code>encoding</code>: Optional; specifies the encoding of the input file. If not provided, defaults to the package's or function's default. <code>col_select</code>: Optional; allows specifying a subset of columns to read. This can be a vector of column names or indices. If nothing, all columns are read. skip=0: Number of rows at the beginning of the file to skip before reading. n*max=Inf: Maximum number of rows to read from the file, after skipping. If Inf, read all available rows. <code>num*threads</code>: specifies the number of concurrent tasks or threads to use for processing, allowing for parallel execution. Defaults to 1</p> <p>Examples</p> <pre><code>julia&gt; df = DataFrame(AA=[\"sav\", \"por\"], AB=[10.1, 10.2]);\n\njulia&gt; write_sav(df, \"test.sav\");\n\njulia&gt; read_sav(\"test.sav\")\n2\u00d72 DataFrame\n Row \u2502 AA      AB      \n     \u2502 String  Float64 \n\u2500\u2500\u2500\u2500\u2500\u253c\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\n   1 \u2502 sav        10.1\n   2 \u2502 por        10.2\n\njulia&gt; write_sav(df, \"test.por\");\n\njulia&gt; read_sav(\"test.por\")\n2\u00d72 DataFrame\n Row \u2502 AA      AB      \n     \u2502 String  Float64 \n\u2500\u2500\u2500\u2500\u2500\u253c\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\n   1 \u2502 sav        10.1\n   2 \u2502 por        10.2\n</code></pre> <p>source</p> <p># <code>TidierFiles.read_table</code> \u2014 Method.</p> <pre><code>read_table(file; col_names=true, skip=0, n_max=Inf, comment=nothing, col_select, missingstring=\"\", kwargs...)\n</code></pre> <p>Read a table from a file where columns are separated by any amount of whitespace, processing it into a DataFrame.</p> <p>Arguments</p> <p>-<code>file</code>: The path to the file to read. -<code>col_names</code>=true: Indicates whether the first non-skipped line should be treated as column names. If false, columns are named automatically. -<code>skip</code>: Number of lines at the beginning of the file to skip before processing starts. -<code>n_max</code>: The maximum number of lines to read from the file, after skipping. Inf means read all lines. -<code>col_select</code>: Optional vector of symbols or strings to select which columns to load. -<code>comment</code>: A character or string indicating the start of a comment. Lines starting with this character are ignored. -<code>missingstring</code>: The string that represents missing values in the table. -<code>kwargs</code>: Additional keyword arguments passed to CSV.File.</p> <p>Examples</p> <pre><code>julia&gt; df = DataFrame(ID = 1:5, Name = [\"Alice\", \"Bob\", \"Charlie\", \"David\", \"Eva\"], Score = [88, 92, 77, 85, 95]);\n\njulia&gt; write_table(df, \"tabletest.txt\");\n\njulia&gt; read_table(\"tabletest.txt\", skip = 2, n_max = 3, col_select = [\"Name\"])\n3\u00d71 DataFrame\n Row \u2502 Name    \n     \u2502 String7 \n\u2500\u2500\u2500\u2500\u2500\u253c\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\n   1 \u2502 Charlie\n   2 \u2502 David\n   3 \u2502 Eva\n</code></pre> <p>source</p> <p># <code>TidierFiles.read_tsv</code> \u2014 Method.</p> <pre><code>read_tsv(file; delim='  ',col_names=true, skip=0, n_max=Inf, \n    comment=nothing, missingstring=\"\", col_select, escape_double=true, col_types=nothing)\n</code></pre> <p>Reads a TSV file or URL into a DataFrame, with options to specify delimiter, column names, and other CSV parsing options.</p> <p>Arguments</p> <p><code>file</code>: Path to the TSV file or a URL to a TSV file. <code>delim</code>: The character delimiting fields in the file. Default is ','. <code>col_names</code>: Indicates if the first row of the CSV is used as column names. Can be true, false, or an array of strings. Default is true. <code>skip</code>: Number of initial lines to skip before reading data. Default is 0. <code>n_max</code>: Maximum number of rows to read. Default is Inf (read all rows). -<code>col_select</code>: Optional vector of symbols or strings to select which columns to load. <code>comment</code>: Character that starts a comment line. Lines beginning with this character are ignored. Default is nothing (no comment lines). <code>missingstring</code>: String that represents missing values in the CSV. Default is \"\", can be set to a vector of multiple items. <code>escape_double</code>: Indicates whether to interpret two consecutive quote characters as a single quote in the data. Default is true. <code>num_threads</code>: specifies the number of concurrent tasks or threads to use for processing, allowing for parallel execution. Default is the number of available threads.</p> <p>Examples</p> <pre><code>julia&gt; df = DataFrame(ID = 1:5, Name = [\"Alice\", \"Bob\", \"Charlie\", \"David\", \"Eva\"], Score = [88, 92, 77, 85, 95]);\n\njulia&gt; write_tsv(df, \"tsvtest.tsv\");\n\njulia&gt; read_tsv(\"tsvtest.tsv\", skip = 2, n_max = 3, missingstring = [\"Charlie\"])\n3\u00d73 DataFrame\n Row \u2502 ID     Name     Score \n     \u2502 Int64  String7  Int64 \n\u2500\u2500\u2500\u2500\u2500\u253c\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\n   1 \u2502     3  missing     77\n   2 \u2502     4  David       85\n   3 \u2502     5  Eva         95\n</code></pre> <p>source</p> <p># <code>TidierFiles.read_xlsx</code> \u2014 Method.</p> <pre><code>read_xlsx(path; sheet, range, col_names, col_types, missingstring, trim_ws, skip, n_max, guess_max)\n</code></pre> <p>Read data from an Excel file into a DataFrame.</p> <p>Arguments</p> <p>-<code>path</code>: The path to the Excel file to be read. -<code>sheet</code>: Specifies the sheet to be read. Can be either the name of the sheet as a string or its index as an integer. If nothing, the first sheet is read. -<code>range</code>: Specifies a specific range of cells to be read from the sheet. If nothing, the entire sheet is read. -<code>col_names</code>: Indicates whether the first row of the specified range should be treated as column names. If false, columns will be named automatically. -<code>col_types</code>: Allows specifying column types explicitly. Can be a single type applied to all columns, a list or a dictionary mapping column names or indices to types. If nothing, types will be inferred. -<code>missingstring</code>: The value or vector that represents missing values in the Excel file. -<code>trim_ws</code>: Whether to trim leading and trailing whitespace from cells in the Excel file. -<code>skip</code>: Number of rows to skip at the beginning of the sheet or range before reading data. -<code>n_max</code>: The maximum number of rows to read from the sheet or range, after skipping. Inf means read all available rows. -<code>guess_max</code>: The maximum number of rows to scan for type guessing and column names detection. Only relevant if coltypes is nothing or colnames is true. If nothing, a default heuristic is used.</p> <p>Examples</p> <pre><code>julia&gt; df = DataFrame(integers=[1, 2, 3, 4],\n       strings=[\"This\", \"Package makes\", \"File reading/writing\", \"even smoother\"],\n       floats=[10.2, 20.3, 30.4, 40.5]);\n\njulia&gt; df2 = DataFrame(AA=[\"aa\", \"bb\"], AB=[10.1, 10.2]);\n\njulia&gt; write_xlsx((\"REPORT_A\" =&gt; df, \"REPORT_B\" =&gt; df2); path=\"xlsxtest.xlsx\", overwrite = true);\n\njulia&gt; read_xlsx(\"xlsxtest.xlsx\", sheet = \"REPORT_A\", skip = 1, n_max = 4, missingstring = [2])\n3\u00d73 DataFrame\n Row \u2502 integers  strings               floats  \n     \u2502 Any       String                Float64 \n\u2500\u2500\u2500\u2500\u2500\u253c\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\n   1 \u2502 missing   Package makes            20.3\n   2 \u2502 3         File reading/writing     30.4\n   3 \u2502 4         even smoother            40.5\n</code></pre> <p>source</p> <p># <code>TidierFiles.write_csv</code> \u2014 Method.</p> <pre><code>write_csv(DataFrame, filepath; na = \"\", append = false, col_names = true, missingstring, eol = \"\n</code></pre> <p>\", num_threads = Threads.nthreads()) Write a DataFrame to a CSV (comma-separated values) file.</p> <p>Arguments</p> <ul> <li><code>x</code>: The DataFrame to write to the CSV file.</li> <li><code>file</code>: The path to the output CSV file.</li> <li><code>missingstring</code>: = \"\": The string to represent missing values in the output file. Default is an empty string.</li> <li><code>append</code>: Whether to append to the file if it already exists. Default is false.</li> <li><code>col_names</code>: = true: Whether to write column names as the first line of the file. Default is true.</li> <li><code>eol</code>: = \"</li> </ul> <p>\": The end-of-line character to use in the output file. Default is the newline character.</p> <ul> <li><code>num_threads</code> = Threads.nthreads(): The number of threads to use for writing the file. Default is the number of available threads.</li> </ul> <p>Examples</p> <pre><code>julia&gt; df = DataFrame(ID = 1:5, Name = [\"Alice\", \"Bob\", \"Charlie\", \"David\", \"Eva\"], Score = [88, 92, 77, 85, 95]);\n\njulia&gt; write_csv(df, \"csvtest.csv\");\n</code></pre> <p>source</p> <p># <code>TidierFiles.write_dta</code> \u2014 Method.</p> <pre><code>write_dta(df, path)\n</code></pre> <p>Write a DataFrame to a Stata (.dta) file.</p> <p>Arguments -<code>df</code>: The DataFrame to be written to a file. -<code>path</code>: String as path where the .dta file will be created. If a file at this path already exists, it will be overwritten.</p> <p>Examples</p> <pre><code>julia&gt; df = DataFrame(AA=[\"sav\", \"por\"], AB=[10.1, 10.2]);\n\njulia&gt; write_dta(df, \"test.dta\")\n2\u00d72 ReadStatTable:\n Row \u2502     AA        AB \n     \u2502 String  Float64? \n\u2500\u2500\u2500\u2500\u2500\u253c\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\n   1 \u2502    sav      10.1\n   2 \u2502    por      10.2\n</code></pre> <p>source</p> <p># <code>TidierFiles.write_sas</code> \u2014 Method.</p> <pre><code>write_sas(df, path)\n</code></pre> <p>Write a DataFrame to a SAS (.sas7bdat or .xpt) file.</p> <p>Arguments -<code>df</code>: The DataFrame to be written to a file. -<code>path</code>: String as path where the .dta file will be created. If a file at this path already exists, it will be overwritten.</p> <p>Examples</p> <pre><code>julia&gt; df = DataFrame(AA=[\"sav\", \"por\"], AB=[10.1, 10.2]);\n\njulia&gt; write_sas(df, \"test.sas7bdat\")\n2\u00d72 ReadStatTable:\n Row \u2502     AA        AB \n     \u2502 String  Float64? \n\u2500\u2500\u2500\u2500\u2500\u253c\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\n   1 \u2502    sav      10.1\n   2 \u2502    por      10.2\n\njulia&gt; write_sas(df, \"test.xpt\")\n2\u00d72 ReadStatTable:\n Row \u2502     AA        AB \n     \u2502 String  Float64? \n\u2500\u2500\u2500\u2500\u2500\u253c\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\n   1 \u2502    sav      10.1\n   2 \u2502    por      10.2\n</code></pre> <p>source</p> <p># <code>TidierFiles.write_sav</code> \u2014 Method.</p> <pre><code>write_sav(df, path)\n</code></pre> <p>Write a DataFrame to a SPSS (.sav or .por) file.</p> <p>Arguments -<code>df</code>: The DataFrame to be written to a file. -<code>path</code>: String as path where the .dta file will be created. If a file at this path already exists, it will be overwritten.</p> <p>Examples</p> <pre><code>julia&gt; df = DataFrame(AA=[\"sav\", \"por\"], AB=[10.1, 10.2]);\n\njulia&gt; write_sav(df, \"test.sav\")\n2\u00d72 ReadStatTable:\n Row \u2502     AA        AB \n     \u2502 String  Float64? \n\u2500\u2500\u2500\u2500\u2500\u253c\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\n   1 \u2502    sav      10.1\n   2 \u2502    por      10.2\n\njulia&gt; write_sav(df, \"test.por\")\n2\u00d72 ReadStatTable:\n Row \u2502     AA        AB \n     \u2502 String  Float64? \n\u2500\u2500\u2500\u2500\u2500\u253c\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\n   1 \u2502    sav      10.1\n   2 \u2502    por      10.2\n</code></pre> <p>source</p> <p># <code>TidierFiles.write_table</code> \u2014 Method.</p> <pre><code>write_table(x, file; delim = '  ', na, append, col_names, eol, num_threads)\n</code></pre> <p>Write a DataFrame to a file, allowing for customization of the delimiter and other options.</p> <p>Arguments</p> <p>-<code>x</code>: The DataFrame to write to a file. -<code>file</code>: The path to the file where the DataFrame will be written. -delim: Character to use as the field delimiter. The default is tab ('   '), making it a TSV (tab-separated values) file by default, but can be changed to accommodate other formats. -<code>missingstring</code>: The string to represent missing data in the output file. -<code>append</code>: Whether to append to the file if it already exists. If false, the file will be overwritten. -<code>col_names</code>: Whether to write column names as the first line of the file. If appending to an existing file with append = true, column names will not be written regardless of this parameter's value. -<code>eol</code>: The end-of-line character to use in the file. Defaults to \" \". -<code>num_threads</code>: Number of threads to use for writing the file. Uses the number of available Julia threads by default.</p> <p>Examples</p> <pre><code>julia&gt; df = DataFrame(ID = 1:5, Name = [\"Alice\", \"Bob\", \"Charlie\", \"David\", \"Eva\"], Score = [88, 92, 77, 85, 95]);\n\njulia&gt; write_table(df, \"tabletest.txt\");\n</code></pre> <p>source</p> <p># <code>TidierFiles.write_tsv</code> \u2014 Method.</p> <pre><code>write_tsv(DataFrame, filepath; na = \"\", append = false, col_names = true, missingstring, eol = \"\n</code></pre> <p>\", num_threads = Threads.nthreads()) Write a DataFrame to a TSV (tab-separated values) file.</p> <p>Arguments</p> <ul> <li><code>x</code>: The DataFrame to write to the TSV file.</li> <li><code>file</code>: The path to the output TSV file.</li> <li><code>missingstring</code>: = \"\": The string to represent missing values in the output file. Default is an empty string.</li> <li><code>append</code>: Whether to append to the file if it already exists. Default is false.</li> <li><code>col_names</code>: = true: Whether to write column names as the first line of the file. Default is true.</li> <li><code>eol</code>: = \"</li> </ul> <p>\": The end-of-line character to use in the output file. Default is the newline character.</p> <ul> <li><code>num_threads</code> = Threads.nthreads(): The number of threads to use for writing the file. Default is the number of available threads.</li> </ul> <p>Examples</p> <pre><code>julia&gt; df = DataFrame(ID = 1:5, Name = [\"Alice\", \"Bob\", \"Charlie\", \"David\", \"Eva\"], Score = [88, 92, 77, 85, 95]);\n\njulia&gt; write_tsv(df, \"tsvtest.tsv\");\n</code></pre> <p>source</p> <p># <code>TidierFiles.write_xlsx</code> \u2014 Method.</p> <pre><code>write_xlsx(x; path, overwrite)\n</code></pre> <p>Write a DataFrame, or multiple DataFrames, to an Excel file.</p>"},{"location":"reference/#arguments-x-the-data-to-write-can-be-a-single-pairstring-dataframe-for-writing-one-sheet-or-a-tuple-of-such-pairs-for-writing-multiple-sheets-the-string-in-each-pair-specifies-the-sheet-name-and-the-dataframe-is-the-data-to-write-to-that-sheet-path-the-path-to-the-excel-file-where-the-data-will-be-written-overwrite-defaults-to-false-whether-to-overwrite-an-existing-file-if-false-an-error-is-thrown-when-attempting-to-write-to-an-existing-file","title":"Arguments -<code>x</code>: The data to write. Can be a single Pair{String, DataFrame} for writing one sheet, or a Tuple of such pairs for writing multiple sheets. The String in each pair specifies the sheet name, and the DataFrame is the data to write to that sheet. -<code>path</code>: The path to the Excel file where the data will be written. -<code>overwrite</code>: Defaults to false. Whether to overwrite an existing file. If false, an error is thrown when attempting to write to an existing file.","text":"<p>Examples</p> <pre><code>julia&gt; df = DataFrame(integers=[1, 2, 3, 4],\n       strings=[\"This\", \"Package makes\", \"File reading/writing\", \"even smoother\"],\n       floats=[10.2, 20.3, 30.4, 40.5]);\n\njulia&gt; df2 = DataFrame(AA=[\"aa\", \"bb\"], AB=[10.1, 10.2]);\n\njulia&gt; write_xlsx((\"REPORT_A\" =&gt; df, \"REPORT_B\" =&gt; df2); path=\"xlsxtest.xlsx\", overwrite = true);\n</code></pre> <p>source</p> <p></p> <p></p>"},{"location":"reference/#reference-internal-functions","title":"Reference - Internal functions","text":""},{"location":"examples/generated/UserGuide/delim/","title":"Delimited Files","text":"<p>The goal of reading and writing throughout TidierFiles.jl is to use consistent syntax. This functions on this page focus on delimited files and are powered by CSV.jl.</p> <pre><code>using TidierFiles\n</code></pre> <p></p> <p></p>"},{"location":"examples/generated/UserGuide/delim/#read_csvtsvdelim","title":"read_csv/tsv/delim","text":"<pre><code>read_csv(\"https://raw.githubusercontent.com/TidierOrg/TidierFiles.jl/main/testing_files/csvtest.csv\", skip = 2, n_max = 3, col_select = [\"ID\", \"Score\"], missingstring = [\"4\"])\n\n#read_csv(file; delim=',', col_names=true, skip=0, n_max=Inf, comment=nothing, missingstring=\"\", col_select=nothing, escape_double=true, col_types=nothing, num_threads=1)\n\n#read_tsv(file; delim='\\t', col_names=true, skip=0, n_max=Inf, comment=nothing, missingstring=\"\", col_select=nothing, escape_double=true, col_types=nothing, num_threads=Threads.nthreads())\n\n#read_delim(file; delim='\\t', col_names=true, skip=0, n_max=Inf, comment=nothing, missingstring=\"\", col_select=nothing, escape_double=true, col_types=nothing, num_threads=Threads.nthreads())\n\n#These functions read a delimited file (CSV, TSV, or custom delimiter) into a DataFrame. The arguments are:\n</code></pre> 3\u00d72 DataFrame RowIDScoreInt64?Int6413772missing853595 <ul> <li><code>file</code>: Path to the file or a URL.</li> <li><code>delim</code>: Field delimiter. Default is ',' for <code>read_csv</code>, '\\t' for <code>read_tsv</code> and <code>read_delim</code>.</li> <li><code>col_names</code>: Use first row as column names. Can be <code>true</code>, <code>false</code>, or an array of strings. Default is <code>true</code>.</li> <li><code>skip</code>: Number of lines to skip before reading data. Default is 0.</li> <li><code>n_max</code>: Maximum number of rows to read. Default is <code>Inf</code> (read all rows).</li> <li><code>comment</code>: Character indicating comment lines to ignore. Default is <code>nothing</code>.</li> <li><code>missingstring</code>: String(s) representing missing values. Default is <code>\"\"</code>.</li> <li><code>col_select</code>: Optional vector of symbols or strings to select columns to load. Default is <code>nothing</code>.</li> <li><code>escape_double</code>: Interpret two consecutive quote characters as a single quote. Default is <code>true</code>.</li> <li><code>col_types</code>: Optional specification of column types. Default is <code>nothing</code> (types are inferred).</li> <li><code>num_threads</code>: Number of threads to use for parallel execution. Default is 1 for <code>read_csv</code> and the number of available threads for <code>read_tsv</code> and <code>read_delim</code>.</li> </ul> <p>The functions return a DataFrame containing the parsed data from the file.</p> <p></p> <p></p>"},{"location":"examples/generated/UserGuide/delim/#write_csv-and-write_tsv","title":"<code>write_csv</code> and # ## <code>write_tsv</code>","text":"<p>writecsv(x, file; missingstring=\"\", append=false, colnames=true, eol=\"\\n\", num_threads=Threads.nthreads())</p> <p>writetsv(x, file; missingstring=\"\", append=false, colnames=true, eol=\"\\n\", num_threads=Threads.nthreads())</p> <p>These functions write a DataFrame to a CSV or TSV file. The arguments are:</p> <ul> <li><code>x</code>: The DataFrame to write.</li> <li><code>file</code>: The path to the output file.</li> <li><code>missingstring</code>: The string to represent missing values. Default is an empty string.</li> <li><code>append</code>: Whether to append to an existing file. Default is <code>false</code>.</li> <li><code>col_names</code>: Whether to write column names as the first line. Default is <code>true</code>.</li> <li><code>eol</code>: The end-of-line character. Default is <code>\"\\n\"</code>.</li> <li><code>num_threads</code>: The number of threads to use for writing. Default is the number of available threads.</li> </ul> <p></p> <p></p>"},{"location":"examples/generated/UserGuide/delim/#read_table","title":"<code>read_table</code>","text":"<p>readtable(file; colnames=true, skip=0, nmax=Inf, comment=nothing, colselect=nothing, missingstring=\"\", num_threads)</p> <p>This function reads a table from a whitespace-delimited file into a DataFrame. The arguments are:</p> <ul> <li><code>file</code>: The path to the file to read.</li> <li><code>col_names</code>: Whether the first non-skipped line contains column names. Default is <code>true</code>.</li> <li><code>skip</code>: Number of lines to skip before processing. Default is 0.</li> <li><code>n_max</code>: Maximum number of lines to read. Default is <code>Inf</code> (read all lines).</li> <li><code>comment</code>: Character or string indicating comment lines to ignore. Default is <code>nothing</code>.</li> <li><code>col_select</code>: Optional vector of symbols or strings to select columns to load. Default is <code>nothing</code>.</li> <li><code>missingstring</code>: The string representing missing values. Default is <code>\"\"</code>.</li> <li><code>num_threads</code>: The number of threads to use for writing. Default is the number of available threads.</li> </ul> <p></p> <p></p>"},{"location":"examples/generated/UserGuide/delim/#write_table","title":"<code>write_table</code>","text":"<p>writetable(x, file; delim='\\t', missingstring=\"\", append=false, colnames=true, eol=\"\\n\", num_threads=Threads.nthreads())</p> <p>This function writes a DataFrame to a file with customizable delimiter and options. The arguments are:</p> <ul> <li><code>x</code>: The DataFrame to write.</li> <li><code>file</code>: The path to the output file.</li> <li><code>delim</code>: The field delimiter. Default is <code>'\\t'</code> (tab-separated).</li> <li><code>missingstring</code>: The string to represent missing values. Default is <code>\"\"</code>.</li> <li><code>append</code>: Whether to append to an existing file. Default is <code>false</code>.</li> <li><code>col_names</code>: Whether to write column names as the first line. Default is <code>true</code>.</li> <li><code>eol</code>: The end-of-line character. Default is <code>\"\\n\"</code>.</li> <li><code>num_threads</code>: The number of threads to use for writing. Default is the number of available threads.</li> </ul> <p>This page was generated using Literate.jl.</p>"},{"location":"examples/generated/UserGuide/stats/","title":"Stats Files","text":"<p>The functions for reading and writing stats files are made possible by ReadStatTables.jl</p> <p></p> <p></p>"},{"location":"examples/generated/UserGuide/stats/#reading-stats-files","title":"reading stats files","text":"<p>readdta(filepath; encoding=nothing, colselect=nothing, skip=0, nmax=Inf, numthreads=1) readsas(filepath; encoding=nothing, colselect=nothing, skip=0, nmax=Inf, numthreads=1) readsav(filepath; encoding=nothing, colselect=nothing, skip=0, nmax=Inf, numthreads=1)</p> <p>These functions read data from Stata (.dta), SAS (.sas7bdat and .xpt), and SPSS (.sav and .por) files into a DataFrame. The arguments are:</p> <ul> <li><code>filepath</code>: The path to the file or a URL pointing to the file. If a URL is provided, the file will be downloaded and then read.</li> <li><code>encoding</code>: Optional; specifies the encoding of the input file. Default is the package's or function's default.</li> <li><code>col_select</code>: Optional; allows specifying a subset of columns to read. Can be a vector of column names or indices. Default is <code>nothing</code> (all columns are read).</li> <li><code>skip</code>: Number of rows to skip at the beginning of the file. Default is 0.</li> <li><code>n_max</code>: Maximum number of rows to read after skipping. Default is <code>Inf</code> (read all rows).</li> <li><code>num_threads</code>: Number of concurrent tasks or threads to use for processing. Default is 1.</li> </ul> <p></p> <p></p>"},{"location":"examples/generated/UserGuide/stats/#writing-stats-files","title":"writing stats files","text":"<p>writesav(df, path) writesas(df, path) write_dta(df, path)</p> <p>These functions write a DataFrame to SPSS (.sav or .por), SAS (.sas7bdat or .xpt), and Stata (.dta) files. The arguments are:</p> <ul> <li><code>df</code>: The DataFrame to be written to a file.</li> <li><code>path</code>: The path where the file will be created. If a file at this path already exists, it will be overwritten.</li> </ul> <p>This page was generated using Literate.jl.</p>"},{"location":"examples/generated/UserGuide/xl/","title":"Excel Files","text":"<p>Reading and writing XLSX files are made possible by XLSX.jl</p> <p></p> <p></p>"},{"location":"examples/generated/UserGuide/xl/#read_xlsx","title":"<code>read_xlsx</code>","text":"<p>readxlsx(path; sheet=nothing, range=nothing, colnames=true, coltypes=nothing, missingstring=\"\", trimws=true, skip=0, nmax=Inf, guessmax=nothing)</p> <p>This function reads data from an Excel file into a DataFrame. The arguments are:</p> <ul> <li><code>path</code>: The path or URL to the Excel file to be read.</li> <li><code>sheet</code>: The sheet to be read. Can be a sheet name (string) or index (integer). Default is the first sheet.</li> <li><code>range</code>: A specific range of cells to be read from the sheet. Default is the entire sheet.</li> <li><code>col_names</code>: Whether the first row of the range contains column names. Default is <code>true</code>.</li> <li><code>col_types</code>: Explicit specification of column types. Can be a single type, a list, or a dictionary mapping column names or indices to types. Default is <code>nothing</code> (types are inferred).</li> <li><code>missingstring</code>: The string representing missing values. Default is <code>\"\"</code>.</li> <li><code>trim_ws</code>: Whether to trim leading and trailing whitespace from cells. Default is <code>true</code>.</li> <li><code>skip</code>: Number of rows to skip before reading data. Default is 0.</li> <li><code>n_max</code>: Maximum number of rows to read. Default is <code>Inf</code> (read all rows).</li> <li><code>guess_max</code>: Maximum number of rows to scan for type guessing and column names detection. Default is <code>nothing</code> (a default heuristic is used).</li> </ul> <p></p> <p></p>"},{"location":"examples/generated/UserGuide/xl/#write_xlsx","title":"<code>write_xlsx</code>","text":"<p>write_xlsx(x; path, overwrite=false)</p> <p>This function writes a DataFrame, or multiple DataFrames, to an Excel file. The arguments are:</p> <ul> <li><code>x</code>: The data to write. Can be a single <code>Pair{String, DataFrame}</code> for writing one sheet, or a <code>Tuple</code> of such pairs for writing multiple sheets. The <code>String</code> in each pair specifies the sheet name, and the <code>DataFrame</code> is the data to write to that sheet.</li> <li><code>path</code>: The path to the output Excel file.</li> <li><code>overwrite</code>: Whether to overwrite an existing file. Default is <code>false</code>.</li> </ul> <p>This page was generated using Literate.jl.</p>"}]}
\ No newline at end of file
+{"config":{"lang":["en"],"separator":"[\\s\\-]+","pipeline":["stopWordFilter"]},"docs":[{"location":"","title":"Home","text":""},{"location":"#tidierfilesjl","title":"TidierFiles.jl","text":""},{"location":"#what-is-tidierfilesjl","title":"What is TidierFiles.jl?","text":"<p>TidierFiles.jl is a 100% Julia implementation of the readr, haven, readxl, and writexl R packages.</p> <p>Powered by the CSV.jl, XLSX.jl and ReadStatTables.jl packages, TidierFiles.jl aims to bring a consistent interface to the reading and writing of tabular data, including a consistent syntax to read files locally versus from the web and consistent keyword arguments across data formats.</p> <p>Currently supported file types:</p> <ul> <li><code>read_csv</code> and <code>write_csv</code></li> <li><code>read_tsv</code> and <code>write_tsv</code></li> <li><code>read_xlsx</code> and <code>write_xlsx</code></li> <li><code>read_delim</code> and <code>write_delim</code></li> <li><code>read_table</code> and <code>write_table</code></li> <li><code>read_fwf</code> and <code>fwf_empty</code></li> <li><code>read_sav</code> and <code>write_sav</code> (.sav and .por)</li> <li><code>read_sas</code> and <code>write_sas</code> (.sas7bdat and .xpt)</li> <li><code>read_dta</code> and <code>write_dta</code> (.dta)</li> </ul> <p></p> <p></p>"},{"location":"#examples","title":"Examples","text":"<p>Here is an example of how to write and read a CSV file.</p> <pre><code>using TidierFiles\n\ndf = DataFrame(\n       integers = [1, 2, 3, 4],\n       strings = [\"This\", \"Package makes\", \"File reading/writing\", \"even smoother\"],\n       floats = [10.2, 20.3, 30.4, 40.5],\n       dates = [Date(2018,2,20), Date(2018,2,21), Date(2018,2,22), Date(2018,2,23)],\n       times = [Dates.Time(19,10), Dates.Time(19,20), Dates.Time(19,30), Dates.Time(19,40)]\n     )\n\nwrite_csv(df, \"testing.csv\" , col_names = true)\n\nread_csv(\"testing.csv\", missingstring=[\"40.5\", \"10.2\"])\n</code></pre> <pre><code>4\u00d75 DataFrame\n Row \u2502 integers  strings               floats     dates       times    \n     \u2502 Int64     String31              Float64?   Date        Time     \n\u2500\u2500\u2500\u2500\u2500\u253c\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\n   1 \u2502        1  This                  missing    2018-02-20  19:10:00\n   2 \u2502        2  Package makes              20.3  2018-02-21  19:20:00\n   3 \u2502        3  File reading/writing       30.4  2018-02-22  19:30:00\n   4 \u2502        4  even smoother         missing    2018-02-23  19:40:00:00\n</code></pre> <p>The file reading functions include the following keyword arguments:</p> <ul> <li><code>path</code></li> <li><code>missingstring</code></li> <li><code>col_names</code></li> <li><code>col_select</code></li> <li><code>num_threads</code></li> <li><code>skip</code></li> <li><code>n_max</code></li> <li><code>delim</code> (where applicable)</li> </ul> <p>The path can be a file available either locally or on the web.</p> <pre><code>read_csv(\"https://raw.githubusercontent.com/TidierOrg/TidierFiles.jl/main/testing_files/csvtest.csv\", skip = 2, n_max = 3, col_select = [\"ID\", \"Score\"], missingstring = [\"4\"])\n</code></pre> <pre><code>3\u00d72 DataFrame\n Row \u2502 ID       Score \n     \u2502 Int64?   Int64 \n\u2500\u2500\u2500\u2500\u2500\u253c\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\n   1 \u2502       3     77\n   2 \u2502 missing     85\n   3 \u2502       5     95\n</code></pre>"},{"location":"reference/","title":"Reference","text":""},{"location":"reference/#index","title":"Index","text":"<ul> <li><code>TidierFiles.fwf_empty</code></li> <li><code>TidierFiles.read_csv</code></li> <li><code>TidierFiles.read_delim</code></li> <li><code>TidierFiles.read_dta</code></li> <li><code>TidierFiles.read_fwf</code></li> <li><code>TidierFiles.read_sas</code></li> <li><code>TidierFiles.read_sav</code></li> <li><code>TidierFiles.read_table</code></li> <li><code>TidierFiles.read_tsv</code></li> <li><code>TidierFiles.read_xlsx</code></li> <li><code>TidierFiles.write_csv</code></li> <li><code>TidierFiles.write_dta</code></li> <li><code>TidierFiles.write_sas</code></li> <li><code>TidierFiles.write_sav</code></li> <li><code>TidierFiles.write_table</code></li> <li><code>TidierFiles.write_tsv</code></li> <li><code>TidierFiles.write_xlsx</code></li> </ul>"},{"location":"reference/#reference-exported-functions","title":"Reference - Exported functions","text":"<p># <code>TidierFiles.fwf_empty</code> \u2014 Method.</p> <pre><code>fwf_empty(filepath::String; num_lines::Int=4, col_names=nothing)\n</code></pre> <p>Analyze a fixed-width format (FWF) file to automatically determine column widths and provide column names.</p> <p>Arguments</p> <ul> <li><code>filepath</code>::String: Path to the FWF file to analyze.</li> </ul> <p>num_lines::Int=4: Number of lines to sample from the beginning of the file for analysis. Default is 4.</p> <ul> <li><code>col_names</code>: Optional; a vector of strings specifying column names. If not provided, column names are generated as Column1, Column2, etc.</li> </ul> <p>Returns</p> <ul> <li>A tuple containing two elements:</li> <li>A vector of integers representing the detected column widths.</li> <li>A vector of strings representing the column names.</li> </ul> <p>Examples</p> <pre><code>julia&gt; fwf_data = \n       \"John Smith   35    12345  Software Engineer   120,000 \\nJane Doe     29     2345  Marketing Manager   95,000  \\nAlice Jones  42   123456  CEO                 250,000 \\nBob Brown    31    12345  Product Manager     110,000 \\nCharlie Day  28      345  Sales Associate     70,000  \\nDiane Poe    35    23456  Data Scientist      130,000 \\nEve Stone    40   123456  Chief Financial Off 200,000 \\nFrank Moore  33     1234  Graphic Designer    80,000  \\nGrace Lee    27   123456  Software Developer  115,000 \\nHank Zuse    45    12345  System Analyst      120,000 \";\n\njulia&gt; open(\"fwftest.txt\", \"w\") do file\n         write(file, fwf_data)\n       end;\n\njulia&gt; path = \"fwftest.txt\";\n\njulia&gt; fwf_empty(path)\n([13, 5, 8, 20, 8], [\"Column_1\", \"Column_2\", \"Column_3\", \"Column_4\", \"Column_5\"])\n\njulia&gt; fwf_empty(path, num_lines=4, col_names = [\"Name\", \"Age\", \"ID\", \"Position\", \"Salary\"])\n([13, 5, 8, 20, 8], [\"Name\", \"Age\", \"ID\", \"Position\", \"Salary\"])\n</code></pre> <p>source</p> <p># <code>TidierFiles.read_csv</code> \u2014 Method.</p> <pre><code>read_csv(file; delim=',',col_names=true, skip=0, n_max=Inf, \n    comment=nothing, missingstring=\"\", col_select, escape_double=true, col_types=nothing, num_threads = 1)\n</code></pre> <p>Reads a CSV file or URL into a DataFrame, with options to specify delimiter, column names, and other CSV parsing options.</p> <p>Arguments</p> <p><code>file</code>: Path to the CSV file or a URL to a CSV file. <code>delim</code>: The character delimiting fields in the file. Default is ','. <code>col_names</code>: Indicates if the first row of the CSV is used as column names. Can be true, false, or an array of strings. Default is true. <code>skip</code>: Number of initial lines to skip before reading data. Default is 0. <code>n_max</code>: Maximum number of rows to read. Default is Inf (read all rows). -<code>col_select</code>: Optional vector of symbols or strings to select which columns to load. <code>comment</code>: Character that starts a comment line. Lines beginning with this character are ignored. Default is nothing (no comment lines). <code>missingstring</code>: String that represents missing values in the CSV. Default is \"\", can be set to a vector of multiple items. <code>escape_double</code>: Indicates whether to interpret two consecutive quote characters as a single quote in the data. Default is true. <code>num_threads</code>: specifies the number of concurrent tasks or threads to use for processing, allowing for parallel execution. Defaults to 1</p> <p>Examples</p> <pre><code>julia&gt; df = DataFrame(ID = 1:5, Name = [\"Alice\", \"Bob\", \"Charlie\", \"David\", \"Eva\"], Score = [88, 92, 77, 85, 95]);\n\njulia&gt; write_csv(df, \"csvtest.csv\");\n\njulia&gt; read_csv(\"csvtest.csv\", skip = 2, n_max = 3, missingstring = [\"95\", \"Charlie\"])\n3\u00d73 DataFrame\n Row \u2502 ID     Name     Score   \n     \u2502 Int64  String7  Int64?  \n\u2500\u2500\u2500\u2500\u2500\u253c\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\n   1 \u2502     3  missing       77\n   2 \u2502     4  David         85\n   3 \u2502     5  Eva      missing \n</code></pre> <p>source</p> <p># <code>TidierFiles.read_delim</code> \u2014 Method.</p> <pre><code>read_delim(file; delim='    ',col_names=true, skip=0, n_max=Inf, \n    comment=nothing, missingstring=\"\", col_select, escape_double=true, col_types=nothing)\n</code></pre> <p>Reads a delimited file or URL into a DataFrame, with options to specify delimiter, column names, and other CSV parsing options.</p> <p>Arguments</p> <p><code>file</code>: Path to the CSV file or a URL to a CSV file. <code>delim</code>: The character delimiting fields in the file. Default is ','. <code>col_names</code>: Indicates if the first row of the CSV is used as column names. Can be true, false, or an array of strings. Default is true. <code>skip</code>: Number of initial lines to skip before reading data. Default is 0. <code>n_max</code>: Maximum number of rows to read. Default is Inf (read all rows). -<code>col_select</code>: Optional vector of symbols or strings to select which columns to load. <code>comment</code>: Character that starts a comment line. Lines beginning with this character are ignored. Default is nothing (no comment lines). <code>missingstring</code>: String that represents missing values in the CSV. Default is \"\", can be set to a vector of multiple items. <code>escape_double</code>: Indicates whether to interpret two consecutive quote characters as a single quote in the data. Default is true. <code>col_types</code>: An optional specification of column types, can be a single type applied to all columns, or a collection of types with one for each column. Default is nothing (types are inferred). <code>num_threads</code>: specifies the number of concurrent tasks or threads to use for processing, allowing for parallel execution. Default is the number of available threads.</p> <p>Examples</p> <pre><code>julia&gt; df = DataFrame(ID = 1:5, Name = [\"Alice\", \"Bob\", \"Charlie\", \"David\", \"Eva\"], Score = [88, 92, 77, 85, 95]);\n\njulia&gt; write_csv(df, \"csvtest.csv\");\n\njulia&gt; read_delim(\"csvtest.csv\", delim = \",\", col_names = false, num_threads = 4) # col_names are false here for the purpose of demonstration\n6\u00d73 DataFrame\n Row \u2502 Column1  Column2  Column3 \n     \u2502 String3  String7  String7 \n\u2500\u2500\u2500\u2500\u2500\u253c\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\n   1 \u2502 ID       Name     Score\n   2 \u2502 1        Alice    88\n   3 \u2502 2        Bob      92\n   4 \u2502 3        Charlie  77\n   5 \u2502 4        David    85\n   6 \u2502 5        Eva      95\n</code></pre> <p>source</p> <p># <code>TidierFiles.read_dta</code> \u2014 Method.</p> <pre><code>function read_dta(data_file;  encoding=nothing, col_select=nothing, skip=0, n_max=Inf)\n</code></pre> <p>Read data from a Stata (.dta) file into a DataFrame, supporting both local and remote sources.</p> <p>Arguments</p> <p>-<code>filepath</code>: The path to the .dta file or a URL pointing to such a file. If a URL is provided, the file will be downloaded and then read. <code>encoding</code>: Optional; specifies the encoding of the input file. If not provided, defaults to the package's or function's default. <code>col_select</code>: Optional; allows specifying a subset of columns to read. This can be a vector of column names or indices. If nothing, all columns are read. skip=0: Number of rows at the beginning of the file to skip before reading. n*max=Inf: Maximum number of rows to read from the file, after skipping. If Inf, read all available rows. <code>num*threads</code>: specifies the number of concurrent tasks or threads to use for processing, allowing for parallel execution. Defaults to 1</p> <p>Examples</p> <pre><code>julia&gt; df = DataFrame(AA=[\"sav\", \"por\"], AB=[10.1, 10.2]);\n\njulia&gt; write_dta(df, \"test.dta\");\n\njulia&gt; read_dta(\"test.dta\")\n2\u00d72 DataFrame\n Row \u2502 AA       AB      \n     \u2502 String3  Float64 \n\u2500\u2500\u2500\u2500\u2500\u253c\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\n   1 \u2502 sav         10.1\n   2 \u2502 por         10.2\n</code></pre> <p>source</p> <p># <code>TidierFiles.read_fwf</code> \u2014 Method.</p> <pre><code>read_fwf(filepath::String; num_lines::Int=4, col_names=nothing)\n</code></pre> <p>Read fixed-width format (FWF) files into a DataFrame.</p> <p>Arguments</p> <ul> <li><code>filepath</code>::String: Path to the FWF file to read.</li> <li><code>widths_colnames</code>::Tuple{Vector{Int}, Union{Nothing, Vector{String}}}: A tuple containing two elements:       - A vector of integers specifying the widths of each field.       - Optionally, a vector of strings specifying column names. If nothing, column names are generated as Column1, Column2, etc.</li> <li><code>skip_to</code>=0: Number of lines at the beginning of the file to skip before reading data.</li> <li><code>n_max</code>=nothing: Maximum number of lines to read from the file. If nothing, read all lines.</li> </ul> <p>Examples</p> <pre><code>julia&gt; fwf_data = \n       \"John Smith   35    12345  Software Engineer   120,000 \\nJane Doe     29     2345  Marketing Manager   95,000  \\nAlice Jones  42   123456  CEO                 250,000 \\nBob Brown    31    12345  Product Manager     110,000 \\nCharlie Day  28      345  Sales Associate     70,000  \\nDiane Poe    35    23456  Data Scientist      130,000 \\nEve Stone    40   123456  Chief Financial Off 200,000 \\nFrank Moore  33     1234  Graphic Designer    80,000  \\nGrace Lee    27   123456  Software Developer  115,000 \\nHank Zuse    45    12345  System Analyst      120,000 \";\n\njulia&gt; open(\"fwftest.txt\", \"w\") do file\n         write(file, fwf_data)\n       end;\n\njulia&gt; path = \"fwftest.txt\";\n\njulia&gt; read_fwf(path, fwf_empty(path, num_lines=4, col_names = [\"Name\", \"Age\", \"ID\", \"Position\", \"Salary\"]), skip_to=3, n_max=3)\n3\u00d75 DataFrame\n Row \u2502 Name         Age     ID      Position         Salary  \n     \u2502 String       String  String  String           String  \n\u2500\u2500\u2500\u2500\u2500\u253c\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\n   1 \u2502 Bob Brown    31      12345   Product Manager  110,000\n   2 \u2502 Charlie Day  28      345     Sales Associate  70,000\n   3 \u2502 Diane Poe    35      23456   Data Scientist   130,000\n</code></pre> <p>source</p> <p># <code>TidierFiles.read_sas</code> \u2014 Method.</p> <pre><code>function read_sas(data_file;  encoding=nothing, col_select=nothing, skip=0, n_max=Inf, num_threads)\n</code></pre> <p>Read data from a SAS (.sas7bdat and .xpt) file into a DataFrame, supporting both local and remote sources.</p> <p>Arguments</p> <p>-<code>filepath</code>: The path to the .dta file or a URL pointing to such a file. If a URL is provided, the file will be downloaded and then read. <code>encoding</code>: Optional; specifies the encoding of the input file. If not provided, defaults to the package's or function's default. <code>col_select</code>: Optional; allows specifying a subset of columns to read. This can be a vector of column names or indices. If nothing, all columns are read. skip=0: Number of rows at the beginning of the file to skip before reading. n*max=Inf: Maximum number of rows to read from the file, after skipping. If Inf, read all available rows. <code>num*threads</code>: specifies the number of concurrent tasks or threads to use for processing, allowing for parallel execution. Defaults to 1</p> <p>Examples</p> <p>```jldoctest julia&gt; df = DataFrame(AA=[\"sav\", \"por\"], AB=[10.1, 10.2]);</p> <p>julia&gt; write_sas(df, \"test.sas7bdat\");</p> <p>julia&gt; read_sas(\"test.sas7bdat\") 2\u00d72 DataFrame  Row \u2502 AA       AB            \u2502 String3  Float64  \u2500\u2500\u2500\u2500\u2500\u253c\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500    1 \u2502 sav         10.1    2 \u2502 por         10.2</p> <p>julia&gt; write_sas(df, \"test.xpt\");</p> <p>julia&gt; read_sas(\"test.xpt\") 2\u00d72 DataFrame  Row \u2502 AA       AB            \u2502 String3  Float64  \u2500\u2500\u2500\u2500\u2500\u253c\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500    1 \u2502 sav         10.1    2 \u2502 por         10.2</p> <p>source</p> <p># <code>TidierFiles.read_sav</code> \u2014 Method.</p> <pre><code>function read_sav(data_file;  encoding=nothing, col_select=nothing, skip=0, n_max=Inf)\n</code></pre> <p>Read data from a SPSS (.sav and .por) file into a DataFrame, supporting both local and remote sources.</p> <p>Arguments</p> <p>-<code>filepath</code>: The path to the .sav or .por file or a URL pointing to such a file. If a URL is provided, the file will be downloaded and then read. <code>encoding</code>: Optional; specifies the encoding of the input file. If not provided, defaults to the package's or function's default. <code>col_select</code>: Optional; allows specifying a subset of columns to read. This can be a vector of column names or indices. If nothing, all columns are read. skip=0: Number of rows at the beginning of the file to skip before reading. n*max=Inf: Maximum number of rows to read from the file, after skipping. If Inf, read all available rows. <code>num*threads</code>: specifies the number of concurrent tasks or threads to use for processing, allowing for parallel execution. Defaults to 1</p> <p>Examples</p> <pre><code>julia&gt; df = DataFrame(AA=[\"sav\", \"por\"], AB=[10.1, 10.2]);\n\njulia&gt; write_sav(df, \"test.sav\");\n\njulia&gt; read_sav(\"test.sav\")\n2\u00d72 DataFrame\n Row \u2502 AA      AB      \n     \u2502 String  Float64 \n\u2500\u2500\u2500\u2500\u2500\u253c\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\n   1 \u2502 sav        10.1\n   2 \u2502 por        10.2\n\njulia&gt; write_sav(df, \"test.por\");\n\njulia&gt; read_sav(\"test.por\")\n2\u00d72 DataFrame\n Row \u2502 AA      AB      \n     \u2502 String  Float64 \n\u2500\u2500\u2500\u2500\u2500\u253c\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\n   1 \u2502 sav        10.1\n   2 \u2502 por        10.2\n</code></pre> <p>source</p> <p># <code>TidierFiles.read_table</code> \u2014 Method.</p> <pre><code>read_table(file; col_names=true, skip=0, n_max=Inf, comment=nothing, col_select, missingstring=\"\", kwargs...)\n</code></pre> <p>Read a table from a file where columns are separated by any amount of whitespace, processing it into a DataFrame.</p> <p>Arguments</p> <p>-<code>file</code>: The path to the file to read. -<code>col_names</code>=true: Indicates whether the first non-skipped line should be treated as column names. If false, columns are named automatically. -<code>skip</code>: Number of lines at the beginning of the file to skip before processing starts. -<code>n_max</code>: The maximum number of lines to read from the file, after skipping. Inf means read all lines. -<code>col_select</code>: Optional vector of symbols or strings to select which columns to load. -<code>comment</code>: A character or string indicating the start of a comment. Lines starting with this character are ignored. -<code>missingstring</code>: The string that represents missing values in the table. -<code>kwargs</code>: Additional keyword arguments passed to CSV.File.</p> <p>Examples</p> <pre><code>julia&gt; df = DataFrame(ID = 1:5, Name = [\"Alice\", \"Bob\", \"Charlie\", \"David\", \"Eva\"], Score = [88, 92, 77, 85, 95]);\n\njulia&gt; write_table(df, \"tabletest.txt\");\n\njulia&gt; read_table(\"tabletest.txt\", skip = 2, n_max = 3, col_select = [\"Name\"])\n3\u00d71 DataFrame\n Row \u2502 Name    \n     \u2502 String7 \n\u2500\u2500\u2500\u2500\u2500\u253c\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\n   1 \u2502 Charlie\n   2 \u2502 David\n   3 \u2502 Eva\n</code></pre> <p>source</p> <p># <code>TidierFiles.read_tsv</code> \u2014 Method.</p> <pre><code>read_tsv(file; delim='  ',col_names=true, skip=0, n_max=Inf, \n    comment=nothing, missingstring=\"\", col_select, escape_double=true, col_types=nothing)\n</code></pre> <p>Reads a TSV file or URL into a DataFrame, with options to specify delimiter, column names, and other CSV parsing options.</p> <p>Arguments</p> <p><code>file</code>: Path to the TSV file or a URL to a TSV file. <code>delim</code>: The character delimiting fields in the file. Default is ','. <code>col_names</code>: Indicates if the first row of the CSV is used as column names. Can be true, false, or an array of strings. Default is true. <code>skip</code>: Number of initial lines to skip before reading data. Default is 0. <code>n_max</code>: Maximum number of rows to read. Default is Inf (read all rows). -<code>col_select</code>: Optional vector of symbols or strings to select which columns to load. <code>comment</code>: Character that starts a comment line. Lines beginning with this character are ignored. Default is nothing (no comment lines). <code>missingstring</code>: String that represents missing values in the CSV. Default is \"\", can be set to a vector of multiple items. <code>escape_double</code>: Indicates whether to interpret two consecutive quote characters as a single quote in the data. Default is true. <code>num_threads</code>: specifies the number of concurrent tasks or threads to use for processing, allowing for parallel execution. Default is the number of available threads.</p> <p>Examples</p> <pre><code>julia&gt; df = DataFrame(ID = 1:5, Name = [\"Alice\", \"Bob\", \"Charlie\", \"David\", \"Eva\"], Score = [88, 92, 77, 85, 95]);\n\njulia&gt; write_tsv(df, \"tsvtest.tsv\");\n\njulia&gt; read_tsv(\"tsvtest.tsv\", skip = 2, n_max = 3, missingstring = [\"Charlie\"])\n3\u00d73 DataFrame\n Row \u2502 ID     Name     Score \n     \u2502 Int64  String7  Int64 \n\u2500\u2500\u2500\u2500\u2500\u253c\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\n   1 \u2502     3  missing     77\n   2 \u2502     4  David       85\n   3 \u2502     5  Eva         95\n</code></pre> <p>source</p> <p># <code>TidierFiles.read_xlsx</code> \u2014 Method.</p> <pre><code>read_xlsx(path; sheet, range, col_names, col_types, missingstring, trim_ws, skip, n_max, guess_max)\n</code></pre> <p>Read data from an Excel file into a DataFrame.</p> <p>Arguments</p> <p>-<code>path</code>: The path to the Excel file to be read. -<code>sheet</code>: Specifies the sheet to be read. Can be either the name of the sheet as a string or its index as an integer. If nothing, the first sheet is read. -<code>range</code>: Specifies a specific range of cells to be read from the sheet. If nothing, the entire sheet is read. -<code>col_names</code>: Indicates whether the first row of the specified range should be treated as column names. If false, columns will be named automatically. -<code>col_types</code>: Allows specifying column types explicitly. Can be a single type applied to all columns, a list or a dictionary mapping column names or indices to types. If nothing, types will be inferred. -<code>missingstring</code>: The value or vector that represents missing values in the Excel file. -<code>trim_ws</code>: Whether to trim leading and trailing whitespace from cells in the Excel file. -<code>skip</code>: Number of rows to skip at the beginning of the sheet or range before reading data. -<code>n_max</code>: The maximum number of rows to read from the sheet or range, after skipping. Inf means read all available rows. -<code>guess_max</code>: The maximum number of rows to scan for type guessing and column names detection. Only relevant if coltypes is nothing or colnames is true. If nothing, a default heuristic is used.</p> <p>Examples</p> <pre><code>julia&gt; df = DataFrame(integers=[1, 2, 3, 4],\n       strings=[\"This\", \"Package makes\", \"File reading/writing\", \"even smoother\"],\n       floats=[10.2, 20.3, 30.4, 40.5]);\n\njulia&gt; df2 = DataFrame(AA=[\"aa\", \"bb\"], AB=[10.1, 10.2]);\n\njulia&gt; write_xlsx((\"REPORT_A\" =&gt; df, \"REPORT_B\" =&gt; df2); path=\"xlsxtest.xlsx\", overwrite = true);\n\njulia&gt; read_xlsx(\"xlsxtest.xlsx\", sheet = \"REPORT_A\", skip = 1, n_max = 4, missingstring = [2])\n3\u00d73 DataFrame\n Row \u2502 integers  strings               floats  \n     \u2502 Any       String                Float64 \n\u2500\u2500\u2500\u2500\u2500\u253c\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\n   1 \u2502 missing   Package makes            20.3\n   2 \u2502 3         File reading/writing     30.4\n   3 \u2502 4         even smoother            40.5\n</code></pre> <p>source</p> <p># <code>TidierFiles.write_csv</code> \u2014 Method.</p> <pre><code>write_csv(DataFrame, filepath; na = \"\", append = false, col_names = true, missingstring, eol = \"\n</code></pre> <p>\", num_threads = Threads.nthreads()) Write a DataFrame to a CSV (comma-separated values) file.</p> <p>Arguments</p> <ul> <li><code>x</code>: The DataFrame to write to the CSV file.</li> <li><code>file</code>: The path to the output CSV file.</li> <li><code>missingstring</code>: = \"\": The string to represent missing values in the output file. Default is an empty string.</li> <li><code>append</code>: Whether to append to the file if it already exists. Default is false.</li> <li><code>col_names</code>: = true: Whether to write column names as the first line of the file. Default is true.</li> <li><code>eol</code>: = \"</li> </ul> <p>\": The end-of-line character to use in the output file. Default is the newline character.</p> <ul> <li><code>num_threads</code> = Threads.nthreads(): The number of threads to use for writing the file. Default is the number of available threads.</li> </ul> <p>Examples</p> <pre><code>julia&gt; df = DataFrame(ID = 1:5, Name = [\"Alice\", \"Bob\", \"Charlie\", \"David\", \"Eva\"], Score = [88, 92, 77, 85, 95]);\n\njulia&gt; write_csv(df, \"csvtest.csv\");\n</code></pre> <p>source</p> <p># <code>TidierFiles.write_dta</code> \u2014 Method.</p> <pre><code>write_dta(df, path)\n</code></pre> <p>Write a DataFrame to a Stata (.dta) file.</p> <p>Arguments -<code>df</code>: The DataFrame to be written to a file. -<code>path</code>: String as path where the .dta file will be created. If a file at this path already exists, it will be overwritten.</p> <p>Examples</p> <pre><code>julia&gt; df = DataFrame(AA=[\"sav\", \"por\"], AB=[10.1, 10.2]);\n\njulia&gt; write_dta(df, \"test.dta\")\n2\u00d72 ReadStatTable:\n Row \u2502     AA        AB \n     \u2502 String  Float64? \n\u2500\u2500\u2500\u2500\u2500\u253c\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\n   1 \u2502    sav      10.1\n   2 \u2502    por      10.2\n</code></pre> <p>source</p> <p># <code>TidierFiles.write_sas</code> \u2014 Method.</p> <pre><code>write_sas(df, path)\n</code></pre> <p>Write a DataFrame to a SAS (.sas7bdat or .xpt) file.</p> <p>Arguments -<code>df</code>: The DataFrame to be written to a file. -<code>path</code>: String as path where the .dta file will be created. If a file at this path already exists, it will be overwritten.</p> <p>Examples</p> <pre><code>julia&gt; df = DataFrame(AA=[\"sav\", \"por\"], AB=[10.1, 10.2]);\n\njulia&gt; write_sas(df, \"test.sas7bdat\")\n2\u00d72 ReadStatTable:\n Row \u2502     AA        AB \n     \u2502 String  Float64? \n\u2500\u2500\u2500\u2500\u2500\u253c\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\n   1 \u2502    sav      10.1\n   2 \u2502    por      10.2\n\njulia&gt; write_sas(df, \"test.xpt\")\n2\u00d72 ReadStatTable:\n Row \u2502     AA        AB \n     \u2502 String  Float64? \n\u2500\u2500\u2500\u2500\u2500\u253c\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\n   1 \u2502    sav      10.1\n   2 \u2502    por      10.2\n</code></pre> <p>source</p> <p># <code>TidierFiles.write_sav</code> \u2014 Method.</p> <pre><code>write_sav(df, path)\n</code></pre> <p>Write a DataFrame to a SPSS (.sav or .por) file.</p> <p>Arguments -<code>df</code>: The DataFrame to be written to a file. -<code>path</code>: String as path where the .dta file will be created. If a file at this path already exists, it will be overwritten.</p> <p>Examples</p> <pre><code>julia&gt; df = DataFrame(AA=[\"sav\", \"por\"], AB=[10.1, 10.2]);\n\njulia&gt; write_sav(df, \"test.sav\")\n2\u00d72 ReadStatTable:\n Row \u2502     AA        AB \n     \u2502 String  Float64? \n\u2500\u2500\u2500\u2500\u2500\u253c\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\n   1 \u2502    sav      10.1\n   2 \u2502    por      10.2\n\njulia&gt; write_sav(df, \"test.por\")\n2\u00d72 ReadStatTable:\n Row \u2502     AA        AB \n     \u2502 String  Float64? \n\u2500\u2500\u2500\u2500\u2500\u253c\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\n   1 \u2502    sav      10.1\n   2 \u2502    por      10.2\n</code></pre> <p>source</p> <p># <code>TidierFiles.write_table</code> \u2014 Method.</p> <pre><code>write_table(x, file; delim = '  ', na, append, col_names, eol, num_threads)\n</code></pre> <p>Write a DataFrame to a file, allowing for customization of the delimiter and other options.</p> <p>Arguments</p> <p>-<code>x</code>: The DataFrame to write to a file. -<code>file</code>: The path to the file where the DataFrame will be written. -delim: Character to use as the field delimiter. The default is tab ('   '), making it a TSV (tab-separated values) file by default, but can be changed to accommodate other formats. -<code>missingstring</code>: The string to represent missing data in the output file. -<code>append</code>: Whether to append to the file if it already exists. If false, the file will be overwritten. -<code>col_names</code>: Whether to write column names as the first line of the file. If appending to an existing file with append = true, column names will not be written regardless of this parameter's value. -<code>eol</code>: The end-of-line character to use in the file. Defaults to \" \". -<code>num_threads</code>: Number of threads to use for writing the file. Uses the number of available Julia threads by default.</p> <p>Examples</p> <pre><code>julia&gt; df = DataFrame(ID = 1:5, Name = [\"Alice\", \"Bob\", \"Charlie\", \"David\", \"Eva\"], Score = [88, 92, 77, 85, 95]);\n\njulia&gt; write_table(df, \"tabletest.txt\");\n</code></pre> <p>source</p> <p># <code>TidierFiles.write_tsv</code> \u2014 Method.</p> <pre><code>write_tsv(DataFrame, filepath; na = \"\", append = false, col_names = true, missingstring, eol = \"\n</code></pre> <p>\", num_threads = Threads.nthreads()) Write a DataFrame to a TSV (tab-separated values) file.</p> <p>Arguments</p> <ul> <li><code>x</code>: The DataFrame to write to the TSV file.</li> <li><code>file</code>: The path to the output TSV file.</li> <li><code>missingstring</code>: = \"\": The string to represent missing values in the output file. Default is an empty string.</li> <li><code>append</code>: Whether to append to the file if it already exists. Default is false.</li> <li><code>col_names</code>: = true: Whether to write column names as the first line of the file. Default is true.</li> <li><code>eol</code>: = \"</li> </ul> <p>\": The end-of-line character to use in the output file. Default is the newline character.</p> <ul> <li><code>num_threads</code> = Threads.nthreads(): The number of threads to use for writing the file. Default is the number of available threads.</li> </ul> <p>Examples</p> <pre><code>julia&gt; df = DataFrame(ID = 1:5, Name = [\"Alice\", \"Bob\", \"Charlie\", \"David\", \"Eva\"], Score = [88, 92, 77, 85, 95]);\n\njulia&gt; write_tsv(df, \"tsvtest.tsv\");\n</code></pre> <p>source</p> <p># <code>TidierFiles.write_xlsx</code> \u2014 Method.</p> <pre><code>write_xlsx(x; path, overwrite)\n</code></pre> <p>Write a DataFrame, or multiple DataFrames, to an Excel file.</p>"},{"location":"reference/#arguments-x-the-data-to-write-can-be-a-single-pairstring-dataframe-for-writing-one-sheet-or-a-tuple-of-such-pairs-for-writing-multiple-sheets-the-string-in-each-pair-specifies-the-sheet-name-and-the-dataframe-is-the-data-to-write-to-that-sheet-path-the-path-to-the-excel-file-where-the-data-will-be-written-overwrite-defaults-to-false-whether-to-overwrite-an-existing-file-if-false-an-error-is-thrown-when-attempting-to-write-to-an-existing-file","title":"Arguments -<code>x</code>: The data to write. Can be a single Pair{String, DataFrame} for writing one sheet, or a Tuple of such pairs for writing multiple sheets. The String in each pair specifies the sheet name, and the DataFrame is the data to write to that sheet. -<code>path</code>: The path to the Excel file where the data will be written. -<code>overwrite</code>: Defaults to false. Whether to overwrite an existing file. If false, an error is thrown when attempting to write to an existing file.","text":"<p>Examples</p> <pre><code>julia&gt; df = DataFrame(integers=[1, 2, 3, 4],\n       strings=[\"This\", \"Package makes\", \"File reading/writing\", \"even smoother\"],\n       floats=[10.2, 20.3, 30.4, 40.5]);\n\njulia&gt; df2 = DataFrame(AA=[\"aa\", \"bb\"], AB=[10.1, 10.2]);\n\njulia&gt; write_xlsx((\"REPORT_A\" =&gt; df, \"REPORT_B\" =&gt; df2); path=\"xlsxtest.xlsx\", overwrite = true);\n</code></pre> <p>source</p> <p></p> <p></p>"},{"location":"reference/#reference-internal-functions","title":"Reference - Internal functions","text":""},{"location":"examples/generated/UserGuide/delim/","title":"Delimited Files","text":"<p>The goal of reading and writing throughout TidierFiles.jl is to use consistent syntax. This functions on this page focus on delimited files and are powered by CSV.jl.</p> <pre><code>using TidierFiles\n</code></pre> <p></p> <p></p>"},{"location":"examples/generated/UserGuide/delim/#read_csvtsvdelim","title":"read_csv/tsv/delim","text":"<pre><code>read_csv(\"https://raw.githubusercontent.com/TidierOrg/TidierFiles.jl/main/testing_files/csvtest.csv\", skip = 2, n_max = 3, col_select = [\"ID\", \"Score\"], missingstring = [\"4\"])\n\n#read_csv(file; delim=',', col_names=true, skip=0, n_max=Inf, comment=nothing, missingstring=\"\", col_select=nothing, escape_double=true, col_types=nothing, num_threads=1)\n\n#read_tsv(file; delim='\\t', col_names=true, skip=0, n_max=Inf, comment=nothing, missingstring=\"\", col_select=nothing, escape_double=true, col_types=nothing, num_threads=Threads.nthreads())\n\n#read_delim(file; delim='\\t', col_names=true, skip=0, n_max=Inf, comment=nothing, missingstring=\"\", col_select=nothing, escape_double=true, col_types=nothing, num_threads=Threads.nthreads())\n\n#These functions read a delimited file (CSV, TSV, or custom delimiter) into a DataFrame. The arguments are:\n</code></pre> 3\u00d72 DataFrame RowIDScoreInt64?Int6413772missing853595 <ul> <li><code>file</code>: Path to the file or a URL.</li> <li><code>delim</code>: Field delimiter. Default is ',' for <code>read_csv</code>, '\\t' for <code>read_tsv</code> and <code>read_delim</code>.</li> <li><code>col_names</code>: Use first row as column names. Can be <code>true</code>, <code>false</code>, or an array of strings. Default is <code>true</code>.</li> <li><code>skip</code>: Number of lines to skip before reading data. Default is 0.</li> <li><code>n_max</code>: Maximum number of rows to read. Default is <code>Inf</code> (read all rows).</li> <li><code>comment</code>: Character indicating comment lines to ignore. Default is <code>nothing</code>.</li> <li><code>missingstring</code>: String(s) representing missing values. Default is <code>\"\"</code>.</li> <li><code>col_select</code>: Optional vector of symbols or strings to select columns to load. Default is <code>nothing</code>.</li> <li><code>escape_double</code>: Interpret two consecutive quote characters as a single quote. Default is <code>true</code>.</li> <li><code>col_types</code>: Optional specification of column types. Default is <code>nothing</code> (types are inferred).</li> <li><code>num_threads</code>: Number of threads to use for parallel execution. Default is 1 for <code>read_csv</code> and the number of available threads for <code>read_tsv</code> and <code>read_delim</code>.</li> </ul> <p>The functions return a DataFrame containing the parsed data from the file.</p> <p></p> <p></p>"},{"location":"examples/generated/UserGuide/delim/#write_csv-and-write_tsv","title":"<code>write_csv</code> and # ## <code>write_tsv</code>","text":"<p>writecsv(x, file; missingstring=\"\", append=false, colnames=true, eol=\"\\n\", num_threads=Threads.nthreads())</p> <p>writetsv(x, file; missingstring=\"\", append=false, colnames=true, eol=\"\\n\", num_threads=Threads.nthreads())</p> <p>These functions write a DataFrame to a CSV or TSV file. The arguments are:</p> <ul> <li><code>x</code>: The DataFrame to write.</li> <li><code>file</code>: The path to the output file.</li> <li><code>missingstring</code>: The string to represent missing values. Default is an empty string.</li> <li><code>append</code>: Whether to append to an existing file. Default is <code>false</code>.</li> <li><code>col_names</code>: Whether to write column names as the first line. Default is <code>true</code>.</li> <li><code>eol</code>: The end-of-line character. Default is <code>\"\\n\"</code>.</li> <li><code>num_threads</code>: The number of threads to use for writing. Default is the number of available threads.</li> </ul> <p></p> <p></p>"},{"location":"examples/generated/UserGuide/delim/#read_table","title":"<code>read_table</code>","text":"<p>readtable(file; colnames=true, skip=0, nmax=Inf, comment=nothing, colselect=nothing, missingstring=\"\", num_threads)</p> <p>This function reads a table from a whitespace-delimited file into a DataFrame. The arguments are:</p> <ul> <li><code>file</code>: The path to the file to read.</li> <li><code>col_names</code>: Whether the first non-skipped line contains column names. Default is <code>true</code>.</li> <li><code>skip</code>: Number of lines to skip before processing. Default is 0.</li> <li><code>n_max</code>: Maximum number of lines to read. Default is <code>Inf</code> (read all lines).</li> <li><code>comment</code>: Character or string indicating comment lines to ignore. Default is <code>nothing</code>.</li> <li><code>col_select</code>: Optional vector of symbols or strings to select columns to load. Default is <code>nothing</code>.</li> <li><code>missingstring</code>: The string representing missing values. Default is <code>\"\"</code>.</li> <li><code>num_threads</code>: The number of threads to use for writing. Default is the number of available threads.</li> </ul> <p></p> <p></p>"},{"location":"examples/generated/UserGuide/delim/#write_table","title":"<code>write_table</code>","text":"<p>writetable(x, file; delim='\\t', missingstring=\"\", append=false, colnames=true, eol=\"\\n\", num_threads=Threads.nthreads())</p> <p>This function writes a DataFrame to a file with customizable delimiter and options. The arguments are:</p> <ul> <li><code>x</code>: The DataFrame to write.</li> <li><code>file</code>: The path to the output file.</li> <li><code>delim</code>: The field delimiter. Default is <code>'\\t'</code> (tab-separated).</li> <li><code>missingstring</code>: The string to represent missing values. Default is <code>\"\"</code>.</li> <li><code>append</code>: Whether to append to an existing file. Default is <code>false</code>.</li> <li><code>col_names</code>: Whether to write column names as the first line. Default is <code>true</code>.</li> <li><code>eol</code>: The end-of-line character. Default is <code>\"\\n\"</code>.</li> <li><code>num_threads</code>: The number of threads to use for writing. Default is the number of available threads.</li> </ul> <p>This page was generated using Literate.jl.</p>"},{"location":"examples/generated/UserGuide/stats/","title":"Stats Files","text":"<p>The functions for reading and writing stats files are made possible by ReadStatTables.jl</p> <p></p> <p></p>"},{"location":"examples/generated/UserGuide/stats/#reading-stats-files","title":"reading stats files","text":"<p>readdta(filepath; encoding=nothing, colselect=nothing, skip=0, nmax=Inf, numthreads=1) readsas(filepath; encoding=nothing, colselect=nothing, skip=0, nmax=Inf, numthreads=1) readsav(filepath; encoding=nothing, colselect=nothing, skip=0, nmax=Inf, numthreads=1)</p> <p>These functions read data from Stata (.dta), SAS (.sas7bdat and .xpt), and SPSS (.sav and .por) files into a DataFrame. The arguments are:</p> <ul> <li><code>filepath</code>: The path to the file or a URL pointing to the file. If a URL is provided, the file will be downloaded and then read.</li> <li><code>encoding</code>: Optional; specifies the encoding of the input file. Default is the package's or function's default.</li> <li><code>col_select</code>: Optional; allows specifying a subset of columns to read. Can be a vector of column names or indices. Default is <code>nothing</code> (all columns are read).</li> <li><code>skip</code>: Number of rows to skip at the beginning of the file. Default is 0.</li> <li><code>n_max</code>: Maximum number of rows to read after skipping. Default is <code>Inf</code> (read all rows).</li> <li><code>num_threads</code>: Number of concurrent tasks or threads to use for processing. Default is 1.</li> </ul> <p></p> <p></p>"},{"location":"examples/generated/UserGuide/stats/#writing-stats-files","title":"writing stats files","text":"<p>writesav(df, path) writesas(df, path) write_dta(df, path)</p> <p>These functions write a DataFrame to SPSS (.sav or .por), SAS (.sas7bdat or .xpt), and Stata (.dta) files. The arguments are:</p> <ul> <li><code>df</code>: The DataFrame to be written to a file.</li> <li><code>path</code>: The path where the file will be created. If a file at this path already exists, it will be overwritten.</li> </ul> <p>This page was generated using Literate.jl.</p>"},{"location":"examples/generated/UserGuide/xl/","title":"Excel Files","text":"<p>Reading and writing XLSX files are made possible by XLSX.jl</p> <p></p> <p></p>"},{"location":"examples/generated/UserGuide/xl/#read_xlsx","title":"<code>read_xlsx</code>","text":"<p>readxlsx(path; sheet=nothing, range=nothing, colnames=true, coltypes=nothing, missingstring=\"\", trimws=true, skip=0, nmax=Inf, guessmax=nothing)</p> <p>This function reads data from an Excel file into a DataFrame. The arguments are:</p> <ul> <li><code>path</code>: The path or URL to the Excel file to be read.</li> <li><code>sheet</code>: The sheet to be read. Can be a sheet name (string) or index (integer). Default is the first sheet.</li> <li><code>range</code>: A specific range of cells to be read from the sheet. Default is the entire sheet.</li> <li><code>col_names</code>: Whether the first row of the range contains column names. Default is <code>true</code>.</li> <li><code>col_types</code>: Explicit specification of column types. Can be a single type, a list, or a dictionary mapping column names or indices to types. Default is <code>nothing</code> (types are inferred).</li> <li><code>missingstring</code>: The string representing missing values. Default is <code>\"\"</code>.</li> <li><code>trim_ws</code>: Whether to trim leading and trailing whitespace from cells. Default is <code>true</code>.</li> <li><code>skip</code>: Number of rows to skip before reading data. Default is 0.</li> <li><code>n_max</code>: Maximum number of rows to read. Default is <code>Inf</code> (read all rows).</li> <li><code>guess_max</code>: Maximum number of rows to scan for type guessing and column names detection. Default is <code>nothing</code> (a default heuristic is used).</li> </ul> <p></p> <p></p>"},{"location":"examples/generated/UserGuide/xl/#write_xlsx","title":"<code>write_xlsx</code>","text":"<p>write_xlsx(x; path, overwrite=false)</p> <p>This function writes a DataFrame, or multiple DataFrames, to an Excel file. The arguments are:</p> <ul> <li><code>x</code>: The data to write. Can be a single <code>Pair{String, DataFrame}</code> for writing one sheet, or a <code>Tuple</code> of such pairs for writing multiple sheets. The <code>String</code> in each pair specifies the sheet name, and the <code>DataFrame</code> is the data to write to that sheet.</li> <li><code>path</code>: The path to the output Excel file.</li> <li><code>overwrite</code>: Whether to overwrite an existing file. Default is <code>false</code>.</li> </ul> <p>This page was generated using Literate.jl.</p>"}]}
\ No newline at end of file
diff --git a/previews/PR4/sitemap.xml.gz b/previews/PR4/sitemap.xml.gz
index ea1ade2..7e678cc 100644
Binary files a/previews/PR4/sitemap.xml.gz and b/previews/PR4/sitemap.xml.gz differ