Feat: General File Reader Tool #398

tybalex · 2025-02-03T12:03:36Z

This PR adds a general File-Reader Tool by reusing existing knowledge ingestion code.
This tool can parse the content of input workspace file and convert to markdown format and write to a workspace file.

This tool also helps address issues like obot-platform/obot#1405, by:

the agent uses File Reader tool to convert .pdf file to .md file
the agent then uses the Summarizer tool to summarize the markdown format content.

@cjellick @thedadams I wonder if gptscript could support a syntax such that File Reader Tool is always a prerequisite of Summarizer Tool.

cjellick · 2025-02-04T15:20:11Z

Hm. This isn't really a file reader tool. It's a "Convert to markdown" tool. I'm not sure that's what we want.

I had envisioned a tool that reads a file and sends the contents directly to the LLM. Do you not think that approach will work well?

What happens if the user just says "Read the court filings file" or "What's the highest grossing office in my spreadsheet?" Will this tool be called?

I'm also not sure that we want to create double the artifacts in the workspace.

tybalex · 2025-02-04T15:33:17Z

Hm. This isn't really a file reader tool. It's a "Convert to markdown" tool. I'm not sure that's what we want.

Technically this tool parses the text content from pdf/pptx/docx documents, so it is a reader. In terms of the format, maybe not necessarily markdown, parsing the content to plain text would be enough. I guess markdown format is more readable?

I had envisioned a tool that reads a file and sends the contents directly to the LLM. Do you not think that approach will work well?

We can support both, and make it sends content back to llm by default. In the case of when the file has too much text, then it should probably write to a file and use a summarizer tool to handle the text.

What happens if the user just says "Read the court filings file" or "What's the highest grossing office in my spreadsheet?" Will this tool be called?

For structured data like a spreadsheet(I mean excel/csv/json .. etc), we will need to handle it separately.

tybalex · 2025-02-05T11:01:36Z

@cjellick I made an update, now this tool will send the content directly to LLM by default. If user specifies an output file, then it will do so.

tybalex added 2 commits February 3, 2025 19:13

file reader tool

d6bea2f

update description

f7aaa72

tybalex self-assigned this Feb 3, 2025

tybalex requested review from cjellick, thedadams, njhale, StrongMonkey and iwilltry42 February 3, 2025 12:16

iwilltry42 approved these changes Feb 3, 2025

View reviewed changes

update tool description

e22bd4a

by default the tool will read content and send directly to LLM.

a12aaac

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat: General File Reader Tool #398

Feat: General File Reader Tool #398

tybalex commented Feb 3, 2025 •

edited

Loading

cjellick commented Feb 4, 2025

tybalex commented Feb 4, 2025 •

edited

Loading

tybalex commented Feb 5, 2025 •

edited

Loading

Feat: General File Reader Tool #398

Are you sure you want to change the base?

Feat: General File Reader Tool #398

Conversation

tybalex commented Feb 3, 2025 • edited Loading

cjellick commented Feb 4, 2025

tybalex commented Feb 4, 2025 • edited Loading

tybalex commented Feb 5, 2025 • edited Loading

tybalex commented Feb 3, 2025 •

edited

Loading

tybalex commented Feb 4, 2025 •

edited

Loading

tybalex commented Feb 5, 2025 •

edited

Loading