This program counts number of letters usage in the provided texts and makes charts.
Local data is taken from the Data directory and will be grouped by subfolders. Don't forget to set copy to output directory property to copy if newer.
You can create your input provider by implementing IInputTextStreamProvider interface.
Currently supported formats: Epub. To add a new format please extend FileText and FileTextFactory with your file format.
Charts are generated by ChartjsNodeCanvas invoked from .NET via Javascript.NodeJS.
- From the investigation, we can see that usage of the letter 'Ñ‘' has increased significantly. It's due to the fact that this letter was introduced in 1797 but its usage was marginal thus it was avoided.
- The letter 'Ñ„' is used more frequently in modern literature due to the increased number of foreign words.
- Modern literature is more concise than classic one. This we can see in letters count analysis results. All analyzed literature has the same form - novel.
Time spent on analysis of 20 large books is ~560ms.
Time spent on this data rendering is ~1500ms due to the fact that rendering code is executed via nodejs.