This Python project is designed to analyze invoices by extracting information from PDF files, emails, and text files. It processes the data to calculate the total amount in euros.
- PDF to Text Conversion: Utilizes
Pdf2Txt.py
to convert PDF files to text format for easy analysis. - Email Parsing: Extracts relevant invoice data from emails using
ExtractMails.py
. - Text Analysis: Analyzes text files containing invoice information with
AnalyzeTxtFINAL.py
. - Total Calculation: Calculates the total amount in euros based on the extracted data.
-
Clone the Repository: Clone this repository to your local machine using:
git clone <repository_url>
-
Install Dependencies: Ensure you have Python installed on your system. Install the required dependencies by running:
pip install -r requirements.txt
-
Usage:
- Place your PDF files in the
Descargas
directory. - Run the relevant Python scripts to process the invoices and generate the total amount in euros.
- Place your PDF files in the
This is the Input:
The script 'Pdf2Txt.py' converts the pdf to text so Python can analyze it and manipulate it.
Output:
- DescargasMail: Directory where PDFs files are located.
- OutputTxtsV2: Output directory for converted text files.
- AnalyzeTxtFINAL.py: Python script to analyze text files and calculate totals.
- ExtractMails.py: Python script to extract invoice information from emails.
- Pdf2Txt.py: Python script to convert PDF files to text format.
- Some items needed manual modification so the total woks well.
- Review
OutputTxtsV2
for insights into the analyzed data and file descriptions.
This project is licensed under the MIT License.
Feel free to adjust the content and add more details as needed!