Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

pull changes to main to opj branch #55

Merged
merged 73 commits into from
Oct 21, 2024
Merged

pull changes to main to opj branch #55

merged 73 commits into from
Oct 21, 2024

Conversation

jmopjordsmoen
Copy link
Collaborator

No description provided.

dependabot bot and others added 30 commits November 15, 2023 10:55
Bumps [pyarrow](https://github.com/apache/arrow) from 13.0.0 to 14.0.1.
- [Commits](apache/arrow@go/v13.0.0...apache-arrow-14.0.1)

---
updated-dependencies:
- dependency-name: pyarrow
  dependency-type: indirect
...

Signed-off-by: dependabot[bot] <[email protected]>
Bumps [urllib3](https://github.com/urllib3/urllib3) from 2.0.5 to 2.0.7.
- [Release notes](https://github.com/urllib3/urllib3/releases)
- [Changelog](https://github.com/urllib3/urllib3/blob/main/CHANGES.rst)
- [Commits](urllib3/urllib3@v2.0.5...2.0.7)

---
updated-dependencies:
- dependency-name: urllib3
  dependency-type: indirect
...

Signed-off-by: dependabot[bot] <[email protected]>
Bumps [jinja2](https://github.com/pallets/jinja) from 3.1.2 to 3.1.3.
- [Release notes](https://github.com/pallets/jinja/releases)
- [Changelog](https://github.com/pallets/jinja/blob/main/CHANGES.rst)
- [Commits](pallets/jinja@3.1.2...3.1.3)

---
updated-dependencies:
- dependency-name: jinja2
  dependency-type: indirect
...

Signed-off-by: dependabot[bot] <[email protected]>
Bumps [pillow](https://github.com/python-pillow/Pillow) from 10.1.0 to 10.2.0.
- [Release notes](https://github.com/python-pillow/Pillow/releases)
- [Changelog](https://github.com/python-pillow/Pillow/blob/main/CHANGES.rst)
- [Commits](python-pillow/Pillow@10.1.0...10.2.0)

---
updated-dependencies:
- dependency-name: pillow
  dependency-type: indirect
...

Signed-off-by: dependabot[bot] <[email protected]>
Bumps [aiohttp](https://github.com/aio-libs/aiohttp) from 3.8.5 to 3.9.2.
- [Release notes](https://github.com/aio-libs/aiohttp/releases)
- [Changelog](https://github.com/aio-libs/aiohttp/blob/master/CHANGES.rst)
- [Commits](aio-libs/aiohttp@v3.8.5...v3.9.2)

---
updated-dependencies:
- dependency-name: aiohttp
  dependency-type: indirect
...

Signed-off-by: dependabot[bot] <[email protected]>
Analysis of trends in a specific type of negative feedback from responders to the nøku oki-skjema
Tested the NNI method used for the production of nøku statistics. Test provided for an alternative method, but is in the prelimanary stages. Will need some improvement but the proof of concept is there.
Clean up the code a little bit.
1. Added ml evaluations on a 3-siffer level
2. fixed a bug where commas were causing columns to be classifed as strings
3. added a few extra tests to Master
jep739 and others added 19 commits October 3, 2024 15:49
…b-4.2.5

Bump jupyterlab from 4.2.4 to 4.2.5
…phy-43.0.1

Bump cryptography from 43.0.0 to 43.0.1
New model made in order to eliminate data leakage. New distance function added to model that makes it impossible for a neighbor to be from 'the future'

The new function does not use year in any other way - so perhaps its worth investigating using year as a feature variable?
@jmopjordsmoen jmopjordsmoen requested a review from jep739 October 11, 2024 09:43
jep739 and others added 4 commits October 11, 2024 11:54
3 different versions of the KNN model, including that original that has data leakage. More updates to come.
Little updates that allow you to switch on/off options such as whether to rerun ml models, or include UU data.
Moved data to the new google cloud bucket:

ssb-strukt-naering-data-produkt-prod

updated all the file paths in the functions and notebooks.
jep739 and others added 4 commits October 15, 2024 15:00
New knn model with custom distance function takes at least 24 hours to run. This is too long, although maybe worth doing once during the year .

To combat this i updated the original KNN model which had data leakage and also adjusted the training data to include some data from outside the sample. Results look good..... too good.

Adjusted method for reading in parquet files. Doesnt seem to work in the new bucket for some reason.
Imputed data now gets saved to google cloud every time its run, so now we can skip the machine learning part of the program.
@jmopjordsmoen jmopjordsmoen merged commit a16a48a into opj Oct 21, 2024
3 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants