Skip to content

Commit

Permalink
more
Browse files Browse the repository at this point in the history
  • Loading branch information
gagolews committed Nov 13, 2023
1 parent aa70bee commit 0f1cf65
Show file tree
Hide file tree
Showing 127 changed files with 180 additions and 176 deletions.
16 changes: 8 additions & 8 deletions .devel/sphinx/news.md
Original file line number Diff line number Diff line change
Expand Up @@ -16,11 +16,11 @@
* [BACKWARD INCOMPATIBILITY] Removed the long-deprecated and defunct
`fallback_encoding` parameter of `stri_read_lines` and the ellipsis
parameter of `stri_opts_collator`, `stri_opts_regex`, `stri_opts_fixed`,
and `stri_opts_regex`.
`stri_opts_brkiter`, and `stri_opts_regex`.

* [BUILD TIME] As per the suggestion of Prof. Brian Ripley, `icudt74l`
(ICU data - little endian) is now included in the source tarball (compressed
with xz to save space). This allows for building *stringi* on systems with
with xz to save space). This allows for building **`stringi`** on systems with
no internet access.

* [NEW FEATURE] #476: In break iterator-, date-time-, and collator-based
Expand Down Expand Up @@ -53,7 +53,7 @@

## 1.7.8 (2022-07-11)

* [DOCUMENTATION] Paper on *stringi* has been published in
* [DOCUMENTATION] Paper on **`stringi`** has been published in
the *Journal of Statistical Software*;
see <https://doi.org/10.18637/jss.v103.i02>.

Expand All @@ -77,12 +77,12 @@

## 1.7.5 (2021-10-04)

* [DOCUMENTATION] Paper on *stringi* has been accepted for
* [DOCUMENTATION] Paper on **`stringi`** has been accepted for
publication in the *Journal of Statistical Software*,
see <https://stringi.gagolewski.com/_static/vignette/stringi.pdf>
for a draft version.

* [DOCUMENTATION] The *stringi* website at <https://stringi.gagolewski.com/>
* [DOCUMENTATION] The **`stringi`** website at <https://stringi.gagolewski.com/>
now features a comprehensive tutorial based on the aforementioned paper.

* [DOCUMENTATION] The *ICU* Project site has been moved to
Expand Down Expand Up @@ -222,8 +222,8 @@
stringi will be built against the battle-tested ICU4C 55.1.
The update to ICU brings Unicode 13.0 and CLDR 39 support.

* [DOCUMENTATION] A draft version of a paper on `stringi` is now available at
<https://stringi.gagolewski.com/_static/vignette/stringi.pdf>.
* [DOCUMENTATION] A draft version of a paper on **`stringi`** is now available
at <https://stringi.gagolewski.com/_static/vignette/stringi.pdf>.

* [GENERAL] stringi now requires R >= 3.1 (`CXX_STD` of `CXX11` or `CXX1X`).

Expand Down Expand Up @@ -699,7 +699,7 @@ documentation object `stri_datetime_format`: `...`
* [NEW FEATURE] #117: `stri_extract_all_*()`, `stri_locate_all_*()`,
`stri_match_all_*()` gained a new argument: `omit_no_match`.
Setting it to `TRUE` makes these functions compatible with their
`stringr` equivalents.
**`stringr`** equivalents.

* [NEW FEATURE] #118: `stri_wrap()` gained `indent`, `exdent`, `initial`,
and `prefix` arguments. Moreover, Knuth's dynamic word wrapping algorithm
Expand Down
4 changes: 2 additions & 2 deletions .devel/sphinx/rapi/stri_datetime_add.md
Original file line number Diff line number Diff line change
Expand Up @@ -68,15 +68,15 @@ print(x)
```

```
## [1] "2024-01-09 12:12:28 AEDT"
## [1] "2024-01-14 08:45:06 AEDT"
```

```r
stri_datetime_add(x, -2, units='months')
```

```
## [1] "2023-11-09 12:12:28 AEDT"
## [1] "2023-11-14 08:45:06 AEDT"
```

```r
Expand Down
2 changes: 1 addition & 1 deletion .devel/sphinx/rapi/stri_datetime_create.md
Original file line number Diff line number Diff line change
Expand Up @@ -96,5 +96,5 @@ stri_datetime_create(hour=15, minute=59)
```

```
## [1] "2023-11-09 15:59:00 AEDT"
## [1] "2023-11-14 15:59:00 AEDT"
```
10 changes: 5 additions & 5 deletions .devel/sphinx/rapi/stri_datetime_fields.md
Original file line number Diff line number Diff line change
Expand Up @@ -77,9 +77,9 @@ stri_datetime_fields(stri_datetime_now())

```
## Year Month Day Hour Minute Second Millisecond WeekOfYear WeekOfMonth
## 1 2023 11 9 12 12 28 375 46 2
## 1 2023 11 14 8 45 7 253 47 3
## DayOfYear DayOfWeek Hour12 AmPm Era
## 1 313 5 0 2 2
## 1 318 3 8 1 2
```

```r
Expand All @@ -88,9 +88,9 @@ stri_datetime_fields(stri_datetime_now(), locale='@calendar=hebrew')

```
## Year Month Day Hour Minute Second Millisecond WeekOfYear WeekOfMonth
## 1 5784 2 25 12 12 28 380 9 4
## 1 5784 3 1 8 45 7 258 10 1
## DayOfYear DayOfWeek Hour12 AmPm Era
## 1 55 5 0 2 1
## 1 60 3 8 1 1
```

```r
Expand All @@ -100,5 +100,5 @@ stri_datetime_symbols(locale='@calendar=hebrew')$Month[
```

```
## [1] "Heshvan"
## [1] "Kislev"
```
2 changes: 1 addition & 1 deletion .devel/sphinx/rapi/stri_datetime_format.md
Original file line number Diff line number Diff line change
Expand Up @@ -221,5 +221,5 @@ stri_datetime_format(stri_datetime_now(), 'datetime_relative_medium')
```

```
## [1] "today, 12:12:28 pm"
## [1] "today, 8:45:07 am"
```
4 changes: 2 additions & 2 deletions .devel/sphinx/rapi/stri_sprintf.md
Original file line number Diff line number Diff line change
Expand Up @@ -188,7 +188,7 @@ stri_sprintf("UNIX time %1$f is %1$s.", Sys.time())
```

```
## [1] "UNIX time 1699492358.726822 is 2023-11-09 12:12:38.726822."
## [1] "UNIX time 1699911917.272750 is 2023-11-14 08:45:17.27275."
```

```r
Expand All @@ -213,7 +213,7 @@ stri_sprintf("%1$s is %1$f UNIX time.", Sys.time()) # re-coercion needed
```

```
## [1] "2023-11-09 12:12:38.728535 is 1699492358.728535 UNIX time."
## [1] "2023-11-14 08:45:17.274441 is 1699911917.274441 UNIX time."
```

```r
Expand Down
16 changes: 8 additions & 8 deletions NEWS
Original file line number Diff line number Diff line change
Expand Up @@ -16,11 +16,11 @@
* [BACKWARD INCOMPATIBILITY] Removed the long-deprecated and defunct
`fallback_encoding` parameter of `stri_read_lines` and the ellipsis
parameter of `stri_opts_collator`, `stri_opts_regex`, `stri_opts_fixed`,
and `stri_opts_regex`.
`stri_opts_brkiter`, and `stri_opts_regex`.

* [BUILD TIME] As per the suggestion of Prof. Brian Ripley, `icudt74l`
(ICU data - little endian) is now included in the source tarball (compressed
with xz to save space). This allows for building *stringi* on systems with
with xz to save space). This allows for building **`stringi`** on systems with
no internet access.

* [NEW FEATURE] #476: In break iterator-, date-time-, and collator-based
Expand Down Expand Up @@ -53,7 +53,7 @@

## 1.7.8 (2022-07-11)

* [DOCUMENTATION] Paper on *stringi* has been published in
* [DOCUMENTATION] Paper on **`stringi`** has been published in
the *Journal of Statistical Software*;
see <https://doi.org/10.18637/jss.v103.i02>.

Expand All @@ -77,12 +77,12 @@

## 1.7.5 (2021-10-04)

* [DOCUMENTATION] Paper on *stringi* has been accepted for
* [DOCUMENTATION] Paper on **`stringi`** has been accepted for
publication in the *Journal of Statistical Software*,
see <https://stringi.gagolewski.com/_static/vignette/stringi.pdf>
for a draft version.

* [DOCUMENTATION] The *stringi* website at <https://stringi.gagolewski.com/>
* [DOCUMENTATION] The **`stringi`** website at <https://stringi.gagolewski.com/>
now features a comprehensive tutorial based on the aforementioned paper.

* [DOCUMENTATION] The *ICU* Project site has been moved to
Expand Down Expand Up @@ -222,8 +222,8 @@
stringi will be built against the battle-tested ICU4C 55.1.
The update to ICU brings Unicode 13.0 and CLDR 39 support.

* [DOCUMENTATION] A draft version of a paper on `stringi` is now available at
<https://stringi.gagolewski.com/_static/vignette/stringi.pdf>.
* [DOCUMENTATION] A draft version of a paper on **`stringi`** is now available
at <https://stringi.gagolewski.com/_static/vignette/stringi.pdf>.

* [GENERAL] stringi now requires R >= 3.1 (`CXX_STD` of `CXX11` or `CXX1X`).

Expand Down Expand Up @@ -699,7 +699,7 @@ documentation object `stri_datetime_format`: `...`
* [NEW FEATURE] #117: `stri_extract_all_*()`, `stri_locate_all_*()`,
`stri_match_all_*()` gained a new argument: `omit_no_match`.
Setting it to `TRUE` makes these functions compatible with their
`stringr` equivalents.
**`stringr`** equivalents.

* [NEW FEATURE] #118: `stri_wrap()` gained `indent`, `exdent`, `initial`,
and `prefix` arguments. Moreover, Knuth's dynamic word wrapping algorithm
Expand Down
8 changes: 6 additions & 2 deletions R/search_extract_bound.R
Original file line number Diff line number Diff line change
Expand Up @@ -126,8 +126,12 @@ stri_extract_first_boundaries <- function(str, ..., opts_brkiter = NULL)
stri_extract_all_words <- function(str, simplify = FALSE, omit_no_match = FALSE,
locale = NULL)
{
stri_extract_all_boundaries(str, simplify, omit_no_match, opts_brkiter = stri_opts_brkiter(type = "word",
skip_word_none = TRUE, locale = locale))
stri_extract_all_boundaries(
str, simplify, omit_no_match,
opts_brkiter = stri_opts_brkiter(
type = "word", skip_word_none = TRUE, locale = locale
)
)
}


Expand Down
18 changes: 9 additions & 9 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
<a href="https://stringi.gagolewski.com/"><img src="https://www.gagolewski.com/_static/img/stringi.png" align="right" height="128" width="128" /></a>
# [**stringi**](https://stringi.gagolewski.com/)
# [**`stringi`**](https://stringi.gagolewski.com/)

### Fast and Portable Character String Processing in R (with the Unicode ICU)

Expand All @@ -12,14 +12,14 @@
> A comprehensive tutorial and reference manual is available
> at <https://stringi.gagolewski.com/>.
>
> Check out [**stringx**](https://stringx.gagolewski.com/) for a set of wrappers
> around **stringi** with a base R-compatible API.
> Check out [**`stringx`**](https://stringx.gagolewski.com/) for a set of wrappers
> around **`stringi`** with a base R-compatible API.
>
> To learn more about R, check out Marek's open-access (free!) textbook
> [Deep R Programming](https://deepr.gagolewski.com/).
**stringi** (pronounced “stringy”, IPA [strinɡi]) is THE *R*
package for string/text/natural language processing.
**`stringi`** (pronounced “stringy”, IPA [strinɡi])
is THE *R* package for string/text/natural language processing.
It is very fast, consistent, convenient, and — thanks to the
[ICU – International Components for Unicode](https://icu.unicode.org/)
library — portable across all locales and platforms.
Expand Down Expand Up @@ -47,14 +47,14 @@ with contributions from Bartłomiej Tartanus and many others.

The package's API was inspired by that of the early (pre-tidyverse; v0.6.2)
version of Hadley Wickham's
[*stringr*](https://cran.r-project.org/web/packages/stringr/)
package (and since the 2015 v1.0.0 *stringr* is powered by *stringi*).
[**`stringr`**](https://cran.r-project.org/web/packages/stringr/)
package (and since the 2015 v1.0.0 **`stringr`** is powered by **`stringi`**).


**Homepage**: https://stringi.gagolewski.com/

**Citation**: Gagolewski M.,
*stringi*: Fast and portable character string processing in R,
**`stringi`**: Fast and portable character string processing in R,
*Journal of Statistical Software* **103**(2), 2022, 1–59,
<https://dx.doi.org/10.18637/jss.v103.i02>.

Expand All @@ -64,7 +64,7 @@ package (and since the 2015 v1.0.0 *stringr* is powered by *stringi*).
[INSTALL](https://raw.githubusercontent.com/gagolews/stringi/master/INSTALL)
file for more details)

**License**: *stringi*'s source code is distributed under the open source
**License**: **`stringi`**'s source code is distributed under the open source
BSD-3-clause license. For more details, see
[LICENSE](https://raw.githubusercontent.com/gagolews/stringi/master/LICENSE).

Expand Down
2 changes: 1 addition & 1 deletion docs/genindex.html
Original file line number Diff line number Diff line change
Expand Up @@ -357,7 +357,7 @@ <h1 id="index">Index</h1>
Some rights reserved. Licensed under <a href='https://creativecommons.org/licenses/by-nc-nd/4.0/'>CC BY-NC-ND 4.0</a>.
Built with <a href="https://sphinx-doc.org/">Sphinx</a>
and a customised <a href="https://github.com/pradyunsg/furo">Furo</a> theme.
Last updated on 2023-11-09T12:07:05+1100.
Last updated on 2023-11-14T08:45:22+1100.
This site will never display any ads: it is a non-profit project.
It does not collect any data.
</div>
Expand Down
2 changes: 1 addition & 1 deletion docs/index.html
Original file line number Diff line number Diff line change
Expand Up @@ -448,7 +448,7 @@ <h1>stringi: Fast and Portable Character String Processing in R<a class="headerl
Some rights reserved. Licensed under <a href='https://creativecommons.org/licenses/by-nc-nd/4.0/'>CC BY-NC-ND 4.0</a>.
Built with <a href="https://sphinx-doc.org/">Sphinx</a>
and a customised <a href="https://github.com/pradyunsg/furo">Furo</a> theme.
Last updated on 2023-11-09T12:07:05+1100.
Last updated on 2023-11-14T08:45:22+1100.
This site will never display any ads: it is a non-profit project.
It does not collect any data.
</div>
Expand Down
2 changes: 1 addition & 1 deletion docs/install.html
Original file line number Diff line number Diff line change
Expand Up @@ -470,7 +470,7 @@ <h2>Getting Help<a class="headerlink" href="#getting-help" title="Link to this h
Some rights reserved. Licensed under <a href='https://creativecommons.org/licenses/by-nc-nd/4.0/'>CC BY-NC-ND 4.0</a>.
Built with <a href="https://sphinx-doc.org/">Sphinx</a>
and a customised <a href="https://github.com/pradyunsg/furo">Furo</a> theme.
Last updated on 2023-11-09T12:07:05+1100.
Last updated on 2023-11-14T08:45:22+1100.
This site will never display any ads: it is a non-profit project.
It does not collect any data.
</div>
Expand Down
18 changes: 9 additions & 9 deletions docs/news.html
Original file line number Diff line number Diff line change
Expand Up @@ -360,10 +360,10 @@ <h2>1.8.1 (2023-11-09)<a class="headerlink" href="#id1" title="Link to this head
<li><p>[BACKWARD INCOMPATIBILITY] Removed the long-deprecated and defunct
<code class="docutils literal notranslate"><span class="pre">fallback_encoding</span></code> parameter of <code class="docutils literal notranslate"><span class="pre">stri_read_lines</span></code> and the ellipsis
parameter of <code class="docutils literal notranslate"><span class="pre">stri_opts_collator</span></code>, <code class="docutils literal notranslate"><span class="pre">stri_opts_regex</span></code>, <code class="docutils literal notranslate"><span class="pre">stri_opts_fixed</span></code>,
and <code class="docutils literal notranslate"><span class="pre">stri_opts_regex</span></code>.</p></li>
<code class="docutils literal notranslate"><span class="pre">stri_opts_brkiter</span></code>, and <code class="docutils literal notranslate"><span class="pre">stri_opts_regex</span></code>.</p></li>
<li><p>[BUILD TIME] As per the suggestion of Prof. Brian Ripley, <code class="docutils literal notranslate"><span class="pre">icudt74l</span></code>
(ICU data - little endian) is now included in the source tarball (compressed
with xz to save space). This allows for building <em>stringi</em> on systems with
with xz to save space). This allows for building <strong><code class="docutils literal notranslate"><span class="pre">stringi</span></code></strong> on systems with
no internet access.</p></li>
<li><p>[NEW FEATURE] #476: In break iterator-, date-time-, and collator-based
operations (e.g., <code class="docutils literal notranslate"><span class="pre">stri_sort</span></code>), a warning is emitted when the <em>root</em> ICU
Expand Down Expand Up @@ -393,7 +393,7 @@ <h2>1.7.12 (2023-01-09)<a class="headerlink" href="#id2" title="Link to this hea
<section id="id3">
<h2>1.7.8 (2022-07-11)<a class="headerlink" href="#id3" title="Link to this heading">#</a></h2>
<ul class="simple">
<li><p>[DOCUMENTATION] Paper on <em>stringi</em> has been published in
<li><p>[DOCUMENTATION] Paper on <strong><code class="docutils literal notranslate"><span class="pre">stringi</span></code></strong> has been published in
the <em>Journal of Statistical Software</em>;
see <a class="reference external" href="https://doi.org/10.18637/jss.v103.i02">https://doi.org/10.18637/jss.v103.i02</a>.</p></li>
<li><p>[BUGFIX] #473, #397: Fixed buffer overflow in <code class="docutils literal notranslate"><span class="pre">stri_dup</span></code>; Also,
Expand All @@ -415,11 +415,11 @@ <h2>1.7.6 (2021-11-29)<a class="headerlink" href="#id4" title="Link to this head
<section id="id5">
<h2>1.7.5 (2021-10-04)<a class="headerlink" href="#id5" title="Link to this heading">#</a></h2>
<ul class="simple">
<li><p>[DOCUMENTATION] Paper on <em>stringi</em> has been accepted for
<li><p>[DOCUMENTATION] Paper on <strong><code class="docutils literal notranslate"><span class="pre">stringi</span></code></strong> has been accepted for
publication in the <em>Journal of Statistical Software</em>,
see <a class="reference external" href="https://stringi.gagolewski.com/_static/vignette/stringi.pdf">https://stringi.gagolewski.com/_static/vignette/stringi.pdf</a>
for a draft version.</p></li>
<li><p>[DOCUMENTATION] The <em>stringi</em> website at <a class="reference external" href="https://stringi.gagolewski.com/">https://stringi.gagolewski.com/</a>
<li><p>[DOCUMENTATION] The <strong><code class="docutils literal notranslate"><span class="pre">stringi</span></code></strong> website at <a class="reference external" href="https://stringi.gagolewski.com/">https://stringi.gagolewski.com/</a>
now features a comprehensive tutorial based on the aforementioned paper.</p></li>
<li><p>[DOCUMENTATION] The <em>ICU</em> Project site has been moved to
<a class="reference external" href="https://icu.unicode.org/">https://icu.unicode.org/</a>.</p></li>
Expand Down Expand Up @@ -536,8 +536,8 @@ <h2>1.6.1 (2021-05-05)<a class="headerlink" href="#id11" title="Link to this hea
not equipped with system ICU. However, if the C++11 support is disabled,
stringi will be built against the battle-tested ICU4C 55.1.
The update to ICU brings Unicode 13.0 and CLDR 39 support.</p></li>
<li><p>[DOCUMENTATION] A draft version of a paper on <code class="docutils literal notranslate"><span class="pre">stringi</span></code> is now available at
<a class="reference external" href="https://stringi.gagolewski.com/_static/vignette/stringi.pdf">https://stringi.gagolewski.com/_static/vignette/stringi.pdf</a>.</p></li>
<li><p>[DOCUMENTATION] A draft version of a paper on <strong><code class="docutils literal notranslate"><span class="pre">stringi</span></code></strong> is now available
at <a class="reference external" href="https://stringi.gagolewski.com/_static/vignette/stringi.pdf">https://stringi.gagolewski.com/_static/vignette/stringi.pdf</a>.</p></li>
<li><p>[GENERAL] stringi now requires R &gt;= 3.1 (<code class="docutils literal notranslate"><span class="pre">CXX_STD</span></code> of <code class="docutils literal notranslate"><span class="pre">CXX11</span></code> or <code class="docutils literal notranslate"><span class="pre">CXX1X</span></code>).</p></li>
<li><p>[NEW FEATURE] #408: <code class="docutils literal notranslate"><span class="pre">stri_trans_casefold()</span></code> performs case folding;
this is different from case mapping, which is locale-dependent.
Expand Down Expand Up @@ -926,7 +926,7 @@ <h2>0.4-1 (2014-12-11)<a class="headerlink" href="#id30" title="Link to this hea
<li><p>[NEW FEATURE] #117: <code class="docutils literal notranslate"><span class="pre">stri_extract_all_*()</span></code>, <code class="docutils literal notranslate"><span class="pre">stri_locate_all_*()</span></code>,
<code class="docutils literal notranslate"><span class="pre">stri_match_all_*()</span></code> gained a new argument: <code class="docutils literal notranslate"><span class="pre">omit_no_match</span></code>.
Setting it to <code class="docutils literal notranslate"><span class="pre">TRUE</span></code> makes these functions compatible with their
<code class="docutils literal notranslate"><span class="pre">stringr</span></code> equivalents.</p></li>
<strong><code class="docutils literal notranslate"><span class="pre">stringr</span></code></strong> equivalents.</p></li>
<li><p>[NEW FEATURE] #118: <code class="docutils literal notranslate"><span class="pre">stri_wrap()</span></code> gained <code class="docutils literal notranslate"><span class="pre">indent</span></code>, <code class="docutils literal notranslate"><span class="pre">exdent</span></code>, <code class="docutils literal notranslate"><span class="pre">initial</span></code>,
and <code class="docutils literal notranslate"><span class="pre">prefix</span></code> arguments. Moreover, Knuth’s dynamic word wrapping algorithm
now assumes that the cost of printing the last line is zero, see #128.</p></li>
Expand Down Expand Up @@ -1218,7 +1218,7 @@ <h2>0.1-1 (2013-01-05)<a class="headerlink" href="#id44" title="Link to this hea
Some rights reserved. Licensed under <a href='https://creativecommons.org/licenses/by-nc-nd/4.0/'>CC BY-NC-ND 4.0</a>.
Built with <a href="https://sphinx-doc.org/">Sphinx</a>
and a customised <a href="https://github.com/pradyunsg/furo">Furo</a> theme.
Last updated on 2023-11-09T12:07:05+1100.
Last updated on 2023-11-14T08:45:22+1100.
This site will never display any ads: it is a non-profit project.
It does not collect any data.
</div>
Expand Down
Loading

0 comments on commit 0f1cf65

Please sign in to comment.