Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Query About Sentiment Analysis Capabilities for Jordanian Arabic Texts #149

Open
LamaAkmeel opened this issue Dec 14, 2024 · 1 comment
Open
Assignees
Labels

Comments

@LamaAkmeel
Copy link

Hello CAMeL Tools Team,
I am new to Python and NLP and am currently working on a research project involving sentiment analysis of interview texts in Arabic, specifically from the Jordanian dialect. I would like to use CAMeL Tools for this purpose and have some questions:

1- Since the Dialect Identification feature is not available on Windows, can I still effectively use the sentiment analysis tool for Jordanian Arabic texts without identifying the dialect first? How critical is dialect identification to the performance of sentiment analysis within CAMeL Tools?

2- Does CAMeL Tools support sentiment analysis for Jordanian Arabic? If so, are there any specific configurations or pre-processing steps recommended for handling this dialect?

3- As I am new to NLP, could you advise whether CAMeL Tools provides pre-trained models for sentiment analysis that are suitable for Jordanian Arabic, or would I need to train a model myself? If training is required, could you provide some guidance or resources to get started?

@owo
Copy link
Collaborator

owo commented Dec 15, 2024

Hi @LamaAkmeel,

1- Dialect ID is not required for sentiment analysis and should work on Windows.

2- Our sentiment analysis models are fine-tuned on a mixture of Modern Standard Arabic, Egyptian, and Levantine (Jordan, Lebanon, Palestine and Syria). While I don't know what the performance of our system would be on Jordanian specifically, you can read more about the data used and evaluation results in our paper.

3- Unfortunately, camel-tools doesn't expose a training pipeline at the moment but we do aim to add that functionality in upcoming releases. As for alternative systems, you could try asking on the SIGARAB mailing group. @nizarhabash1 @balhafni any ideas?

I hope that helps.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants