Transforms all the documents in the corpus by removing blank lines, converting all whitespace charactors into a single space, and reducing multipul spaces into a single space.
The script is run in accordance with the general form
Below is the suggested text to add to the Methods and Materials section of your paper when using this building block. The references can be found here
Each document in the corpus underwent several pre-processing steps. They were transformed under the following conditions:
- White-space normalization
The above transformation's results are exemplified in (figure xxx). The transformation script can be found in the companion repository ^[http://www.github.com/{user}/{repo}].