Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[3주차] gold data에 대해서 #63

Open
nowionlyseedaylight opened this issue Mar 20, 2022 · 0 comments
Open

[3주차] gold data에 대해서 #63

nowionlyseedaylight opened this issue Mar 20, 2022 · 0 comments

Comments

@nowionlyseedaylight
Copy link
Collaborator

cross entropy loss에 대한 설명을 하실 때 언급한 gold data에 대해서 더 찾아보았습니다.

What gold data means?
This refers to data of very high quality, which is more or less as close as you can get to the ground truth. For example, Alzheimer's disease can be diagnosed through behavioral tests, but it's not a perfect diagnosis and can be confused with other types of dementia.

What is gold standard NLP?
In natural language processing (NLP) and computational linguistics the Gold Standard typically represents a corpus of text or a set of documents, annotated or tagged with the desired results for the analysis – be it designation of the corresponding part of speech, syntactic parsing, concept or relationship.

What is the gold standard validation strategy in machine learning?
Generally k-fold cross validation is the gold-standard for evaluating the performance of a machine learning algorithm on unseen data with k set to 3, 5, or 10. Using a train/test split is good for speed when using a slow algorithm and produces performance estimates with lower bias when using large datasets.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant