feat: Add vision support #4076

TheRamU · 2024-02-19T20:52:55Z

Gemini Pro Vision supported
GPT-4 Vision supported
Fixed not reporting to the user when Gemini request error
Image features are now globally supported in this application

vercel · 2024-02-19T20:53:00Z

@TheRamU is attempting to deploy a commit to the NextChat Team on Vercel.

A member of the Team first needs to authorize it.

github-actions · 2024-02-19T20:55:56Z

Your build has completed!

Preview deployment

H0llyW00dzZ · 2024-02-19T22:10:06Z

LGTM

H0llyW00dzZ

Request Change Summarize for gemini-pro and gemini-pro-vision

H0llyW00dzZ · 2024-02-19T22:42:57Z

app/store/chat.ts

+  if (currentModel.startsWith("gpt")) {
+    return SUMMARIZE_MODEL;
+  }
+  if (currentModel.startsWith("gemini-pro")) {


Tip
It's more effective to use Summarize with Gemini's own models. For example, gemini-pro refers to the Gemini Pro model, and gemini-pro-vision refers to the Gemini Pro Vision model. These models are more affordable compared to OpenAI hahaha and more efficient, as each model has its own specific token limit. For instance, the token input limit for the gemini-pro-vision model is around 12288 tokens.

Proof:

Gemini Pro Vision does not enable multiturn chat, which limits its role as a "Summarize Model"

Gemini Pro Vision does not enable multiturn chat, which limits its role as a "Summarize Model"

rip, its better model, unlike openai gpt-4-vision-preview

H0llyW00dzZ · 2024-02-20T00:47:43Z

Show off

* Add vision support (ChatGPTNextWeb#4076) * Refactor [UI/UX] [Front End] [Chat] Remove Duplicate "onUserInput" - [+] refactor(chat.tsx): remove duplicate onUserInput call and localStorage.setItem in _Chat function * Feat [UI/UX] [Chat List] Search Support for Multimodal Content - [+] feat(chat-list.tsx): add search support for array of MultimodalContent in ChatList component * Style [UI/UX] [Chat List] Linting - [+] style(chat-list.tsx): improve readability by breaking down lengthy if condition into multiple lines * Adding Back Text Moderation - [+] feat(openai.ts): add support for text moderation in chat method * Feat [LLM APIs] [Google] InlineData - [+] feat(google.ts): add InlineData to MessagePart, refactor message construction - [+] chore(google.ts): add comments for clarity * Style [LLM APIs] [Google] InlineData - [+] style(google.ts): update comment for InlineData interface * Todo [LLM APIs] [Google] InlineData - [+] todo(google.ts): add TODO comment to improve safety settings configuration * Todo [UI/UX] [Front End] [Chat] Summarize - [+] chore(chat.ts): add TODO comment to improve the summary for gemini-pro-vision --------- Co-authored-by: TheRamU <[email protected]>

Issues-translate-bot · 2024-02-27T02:57:02Z

Bot detected the issue body's language is not English, translate it automatically.

Is it possible to select the Google visual model so that the conversation does not use multiple rounds of chat, and only uploads single-discussion conversations, without needing to clear the context every time it is used?

fengzai6 · 2024-02-27T02:57:20Z

能否在选中谷歌视觉模型的时候，对话不使用多轮聊天，仅上传单次对话，不用每次使用需要清除上下文

Issues-translate-bot · 2024-02-27T02:57:29Z

Bot detected the issue body's language is not English, translate it automatically.

Can the conversation not use multiple rounds of chat when the Google visual model is selected, and only a single conversation can be uploaded without the need to clear the context each time it is used?

PengLingJun · 2024-03-06T04:34:47Z

Why are my responses using the vision-preview api incomplete

Add vision support

3291516

H0llyW00dzZ suggested changes Feb 19, 2024

View reviewed changes

fred-bf merged commit e2da340 into ChatGPTNextWeb:main Feb 20, 2024
1 of 2 checks passed

H0llyW00dzZ pushed a commit to H0llyW00dzZ/ChatGPT-Next-Web that referenced this pull request Feb 20, 2024

Add vision support (ChatGPTNextWeb#4076)

4424cac

X-Zero-L pushed a commit to X-Zero-L/ChatGPT-Next-Web that referenced this pull request Feb 21, 2024

Add vision support (ChatGPTNextWeb#4076)

006c5aa

gaogao1030 pushed a commit to gaogao1030/ChatGPT-Next-Web that referenced this pull request May 16, 2024

Add vision support (ChatGPTNextWeb#4076)

9e626be

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Add vision support #4076

feat: Add vision support #4076

TheRamU commented Feb 19, 2024

vercel bot commented Feb 19, 2024

github-actions bot commented Feb 19, 2024

H0llyW00dzZ commented Feb 19, 2024

H0llyW00dzZ left a comment

H0llyW00dzZ Feb 19, 2024

TheRamU Feb 21, 2024

H0llyW00dzZ Feb 21, 2024

H0llyW00dzZ commented Feb 20, 2024

Issues-translate-bot commented Feb 27, 2024

fengzai6 commented Feb 27, 2024

Issues-translate-bot commented Feb 27, 2024

PengLingJun commented Mar 6, 2024

feat: Add vision support #4076

feat: Add vision support #4076

Conversation

TheRamU commented Feb 19, 2024

vercel bot commented Feb 19, 2024

github-actions bot commented Feb 19, 2024

H0llyW00dzZ commented Feb 19, 2024

H0llyW00dzZ left a comment

Choose a reason for hiding this comment

H0llyW00dzZ Feb 19, 2024

Choose a reason for hiding this comment

TheRamU Feb 21, 2024

Choose a reason for hiding this comment

H0llyW00dzZ Feb 21, 2024

Choose a reason for hiding this comment

H0llyW00dzZ commented Feb 20, 2024

Issues-translate-bot commented Feb 27, 2024

fengzai6 commented Feb 27, 2024

Issues-translate-bot commented Feb 27, 2024

PengLingJun commented Mar 6, 2024