-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Clickhouse tables #24
Comments
@dlomax64 could you remind what they should contain (i.e. schema)? |
I think that the original b2cPtaPkgR_all table will work for what I want to do with it:
If I remember correctly, Chris was wanting to be able to get downstream commits from a dependency so this table should be fine for that + some things I want to add for the website. |
TY. Will let you know once done. My plan is to have
|
Does the language extension and ctags for one commit often differ? |
In my experience with the For records where both language fields exist, they match a strong majority of the time. Ctags also can become an issue when checking their value in scripts since they are not character-safe. For example, C# (the ctag for the C# language) may be a problem. The WoC equivelant is simply 'Cs', a safer string. |
@audrism Can this table, or some other lookup program, be optimized for Currently, searching for all downstream commits for a pkg requires traversing all lines of each |
|
I included ctags classification mostly as a way to see where language types do not match, it may make sense to use linguist tool to identify language in such cases |
I do not imagine the "original" What would be very useful is a similar table that uses Project as the key, such as |
Apart from the commit detail (and project) table in the CH noted above, it is not entirely clear what to add a) One idea is to have monthly commit/blob counts (see issue #25) |
@audrism Can you update the Clickhouse Tables?
The text was updated successfully, but these errors were encountered: