Skip to content

Latest commit

 

History

History
22 lines (19 loc) · 955 Bytes

TextCleaning.md

File metadata and controls

22 lines (19 loc) · 955 Bytes

Text Cleaning Documentation

File

Line

  • A Measuring Rod to Test Text Books, and Reference Books in Schools, Colleges and Libriaries:
  1. Removed \n and ## to make it continuous space.
  • Through Some Eventful Years:
  1. Removed \n
  2. Removed ## p.
  3. Removed ## #s s s
  4. Removed #s s s
  5. Removed -a s s
  6. Removed ----------------—
  7. Slowly removing extra line breaks, #, characters describing pictures/emblems
  8. Removed odd characters

Notes

  • Will need to do a close reading of both texts
  • Would like to table words used in both documents (i.e. Negro, white, Condefederate/Confederacy, grateful/ungrateful, South) to find narratives of "the happy slave" and other Lost Cause myths used in textbooks.
  • Found extra textbooks that can be added to website/work later.