Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Enrich stats items [JIRA: RCS-217] #961

Closed
23 tasks
shino opened this issue Aug 28, 2014 · 12 comments
Closed
23 tasks

Enrich stats items [JIRA: RCS-217] #961

shino opened this issue Aug 28, 2014 · 12 comments

Comments

@shino
Copy link
Contributor

shino commented Aug 28, 2014

Update 2015-07-17

After #1165 #1180 #1189 , major stats items are implemented for 2.1.0.
Remaining items for future releases are as follows.

  • admin api: user
  • admin api: users
  • admin api: storage
  • admin api: access
  • Pool exhaution counter
  • Indicator for block retries
  • GC activity/elapsed time/gc count/error count
  • deleted object size and number per batch (indicator for disk space reclaim)
  • timeout, disconnect, overload, notfound counter
  • 'siblings' in manifest size indicators
  • bucket number per user indicator
  • notfound in delete_fsm
  • bucket list size in user record
  • manifest history count
  • multipart manifest part count
  • total thruput MB/sec in
  • total thruput MB/sec out
  • error count grouped by error code/reason (total, last minutes)
  • stale manifest usage ... how much to be scheduled_delete?
  • http status codes stats
  • ad-hoc stats triggered by operation command that copies all
    stats item, focusing on a single user
  • Good / better / not-bad stats for object get/put latency,
    because they depend on body size
  • Good / better / not-bad stats for storage calc mapred
    because it depends on number of objects

Original description

  • User related elapsed time (get, put)
  • Bucket related elaplsed time (get)
  • Add latency_min and latency_100
  • Recent history information for pools
  • Indicator for pool overflow, or pool history
  • Indicator for block retries
  • GC activity/elapsed time/gc count/error count
  • Storage calculation activity/elapsed time/error count
  • Access Usage flush count/ fail count
  • List objects activity/elapsed time/error count

Recommendations and ideas are welcome.

@shino shino changed the title Increment stats items Enrich stats items Aug 28, 2014
@shino shino added this to the 2.0.0 milestone Aug 28, 2014
@ksauzz
Copy link
Contributor

ksauzz commented Aug 28, 2014

My recommendations are :

  • GC activity/elapsed time/gc count/error count
  • Storage calculation activity/elapsed time/error count
  • Access Usage flush count

@shino
Copy link
Contributor Author

shino commented Aug 28, 2014

@ksauzz thanks, added to the list in the description.

@kuenishi
Copy link
Contributor

  • deleted object size and number per batch (indicator for disk space reclaim)
  • disconnect, overload, notfound counter
  • proxy_get counter
  • 'siblings' in manifest size indicators

@kuenishi
Copy link
Contributor

nice

  • bucket number per user indicator
  • notfound in delete_fsm

And I think we have to move onto Exometer like Riak 2.0.

@ksauzz
Copy link
Contributor

ksauzz commented Oct 18, 2014

  • bucket history size
  • manifest history size

@shino
Copy link
Contributor Author

shino commented Oct 29, 2014

total thruput

  • MB/sec in
  • MB/sec out

@ksauzz
Copy link
Contributor

ksauzz commented Nov 14, 2014

meter_rate always looks to indicate about zero for me. We should clarify how it's useful, or remove it.

@shino
Copy link
Contributor Author

shino commented Nov 15, 2014

  • error count grouped by error code/reason (total, last minutes)

@ksauzz
Copy link
Contributor

ksauzz commented Nov 21, 2014

I'm not sure if get/put object latency should be included in metrics. it looks to depend on object size.

@kuenishi kuenishi modified the milestones: 2.0.0, 2.1.0 Jan 15, 2015
@kuenishi kuenishi modified the milestones: 2.0.1, 2.1.0 Apr 3, 2015
@kuenishi kuenishi modified the milestones: 2.1.0, 2.2.0 Jun 24, 2015
@kuenishi
Copy link
Contributor

get/put latency has been included in older versions prior to 2.0. But it's for all uptime, not for last minute.

@Basho-JIRA Basho-JIRA changed the title Enrich stats items Enrich stats items [JIRA: RCS-217] Jun 24, 2015
@kuenishi
Copy link
Contributor

I believe we've covered necessary stats. Will be out in 2.1.

@Basho-JIRA
Copy link

Release note will be in RCS-11.

_[posted via JIRA by Kota Uenishi]_

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

4 participants