put your analytics key inside of ~/.bash_profile
for analytics on the entire database, run the motif-analytics-2.py program.
update: 03/14/2018: Now support the ability to query a single user's actions between two timestamps.
To do so, I have added a new input parameter which works as follows: Example command line input:
python motif-analytics-2.py user-dump user-id (this input gives you the data dump for all of the user's actions throughout the entire lifetime of our mongoDB collection)
python motif-analytics-2.py user-dump user-id 2018-01-01 (this input gives you the data dump for all of the user's actions from the timestamp entered to the present moment)
python motif-analytics-2.py user-dump user-id 2018-01-01 2018-01-31 (this input gives you the data dump for all of the user's actions between the two timestamps entered)
all three of these queries will print the data to the console for each of the user actions, one per line. after all of the data is dumped, it will print an integer count of the total number of URLs visited and a ranked order of the URLs visited (ties are printed in an arbitrary order) Run-time for the user-dump query is faster than other queries because instead of storing the data in structures, it is printed to the console immediately (aside from storing the URLs in a dict to keep track of appearance counts).
there are six possible inputs to generate data across the entire cohort:
-
graph users-per-room uses matplotlib to create a histogram that accounts for the number of users in each room
-
graph domains-per-room uses matplotlib to create a histogram that accounts for the number of domains in each room (session)
-
graph time-per-room uses matplotlib to create a histogram that calculates for the minutes each room was opened
-
print domains-by-rank desired-length-of-list prints a list from 1 to desired-length-of-list where 1 is the most common domain
-
print users-by-rank-room desired-length-of-list prints a list from 1 to desired-length-of-list where 1 is the user that appears in the most rooms
-
print users-by-rank-time desired-length-of-list prints a list from 1 to desired-length-of-list where 1 is the users who has spent the most total minutes using Motif
example of commandline if you want a graph to be generated from the data: python motif-analytics-2.py graph users-per-room
example of commandline if you want a ranked list of length 10 from the data : python motif-analytics-2.py print domains-by-rank 10
there are also four "user-query" options: total-time, room-count, room-ids, average-time
an example of running a specific user query is as follows:
python motif-analytics-2.py user-query total-time 58fa64cfe0d3ee1097943664
total-time: adds the entire length of all the rooms that the user has added-sessions to room-count: counts the number of rooms that the user has added a session to room-ids: all the rooms that a user has added-sessions to average-time: total-time/room-count