-
Notifications
You must be signed in to change notification settings - Fork 441
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[wip] [serverless] Add S3 span pointers #3083
Draft
nhulston
wants to merge
65
commits into
main
Choose a base branch
from
nicholas.hulston/s3-span-pointers
base: main
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Draft
Changes from all commits
Commits
Show all changes
65 commits
Select commit
Hold shift + click to select a range
c6bf758
create skeleton for S3 span pointer creation
nhulston ce4f56a
get key+bucket from S3 request and etag from S3 response
nhulston 963dadd
calculate hash following span pointer rules
nhulston c5f0539
add span pointer to links and update `_dd.span_links` tag
nhulston 6a2bb47
add copyright to span_pointers.go
nhulston 98a1df8
add tests for `generatePointerHash`
nhulston 32f31c7
simplify param checks
nhulston eb16fdd
finish aws span in deserialize, not in init.
nhulston ded7e8e
implement `SpanContextWithLinks` in `mockspancontext`
nhulston 02da8e3
add TestHandleS3Operation
nhulston 9790821
f
nhulston acb79a8
impl noop
nhulston 1e8d8c1
temp impl civisibility
nhulston 7fcb8d6
test
nhulston 3020a61
test
nhulston c9163ef
test
nhulston ec580e1
test
nhulston 2d77b2a
test
nhulston 0f14d96
test
nhulston ec57e5c
test
nhulston 7031c27
test
nhulston d9ae023
test
nhulston 6656be0
test
nhulston 397aaf9
test
nhulston 7aa5eef
test
nhulston c228cc4
test
nhulston e9019bc
test
nhulston 61dff10
test
nhulston 2efd7ec
test
nhulston 451c77a
test
nhulston 66c37dd
test
nhulston c6b7011
test
nhulston 097bbd9
test
nhulston ad9d8c6
test
nhulston 1d5a2ac
test
nhulston 4b23bd9
test
nhulston 14e74b5
test
nhulston 3b048bd
test
nhulston b510644
test
nhulston 2963c53
test
nhulston 7e8fb2b
test
nhulston a325cfb
implement mockspan
nhulston 42dd6bd
serialize span links in meta
nhulston 67d6787
test remove omitempty
nhulston 1783913
test custom marshal
nhulston 0c38ae0
test custom marshal
nhulston f96a6ed
test
nhulston db5a438
test
nhulston 5e0f58b
test
nhulston 33e436a
test
nhulston e83b9a4
test
nhulston 25a616a
test
nhulston fc61d96
test
nhulston 25a4995
test
nhulston b6e1e67
test
nhulston 507e4bf
test
nhulston 45489d8
test
nhulston 20228af
test
nhulston aa572e3
test
nhulston 083c8b2
test
nhulston e148a0e
test
nhulston ee85da4
test
nhulston e6de76f
test
nhulston 526e65e
test
nhulston 7d4ae01
test
nhulston File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,91 @@ | ||
// Unless explicitly stated otherwise all files in this repository are licensed | ||
// under the Apache License Version 2.0. | ||
// This product includes software developed at Datadog (https://www.datadoghq.com/). | ||
// Copyright 2016 Datadog, Inc. | ||
|
||
package span_pointers | ||
|
||
import ( | ||
"crypto/sha256" | ||
"encoding/hex" | ||
"fmt" | ||
"github.com/aws/smithy-go/middleware" | ||
smithyhttp "github.com/aws/smithy-go/transport/http" | ||
"gopkg.in/DataDog/dd-trace-go.v1/ddtrace" | ||
"gopkg.in/DataDog/dd-trace-go.v1/ddtrace/tracer" | ||
"gopkg.in/DataDog/dd-trace-go.v1/internal/log" | ||
"strings" | ||
) | ||
|
||
const ( | ||
// SpanPointerHashLengthBytes 16 bytes = 32 chars. | ||
// See https://github.com/DataDog/dd-span-pointer-rules/blob/main/README.md#general-hashing-rules | ||
SpanPointerHashLengthBytes = 16 | ||
PointerDownDirection = "d" | ||
LinkKind = "span-pointer" | ||
S3PointerKind = "aws.s3.object" | ||
) | ||
|
||
func HandleS3Operation(in middleware.DeserializeInput, out middleware.DeserializeOutput, span tracer.Span) { | ||
req, ok := in.Request.(*smithyhttp.Request) | ||
if !ok { | ||
return | ||
} | ||
res, ok := out.RawResponse.(*smithyhttp.Response) | ||
if !ok { | ||
return | ||
} | ||
|
||
// URL format: https://BUCKETNAME.s3.REGION.amazonaws.com/KEYNAME?x-id=OPERATIONNAME | ||
key := strings.TrimPrefix(req.URL.Path, "/") | ||
bucket := strings.Split(req.URL.Host, ".")[0] | ||
// the AWS SDK sometimes wraps the eTag in quotes | ||
etag := strings.Trim(res.Header.Get("ETag"), "\"") | ||
if key == "" || bucket == "" || etag == "" { | ||
log.Debug("Unable to create S3 span pointer because key could not be found.") | ||
return | ||
} | ||
|
||
// Hash calculation rules: https://github.com/DataDog/dd-span-pointer-rules/blob/main/AWS/S3/Object/README.md | ||
components := []string{bucket, key, etag} | ||
hash := generatePointerHash(components) | ||
|
||
link := ddtrace.SpanLink{ | ||
// We leave trace_id, span_id, trade_id_high, tracestate, and flags as 0 or empty. | ||
// The Datadog frontend will use `ptr.hash` to find the linked span. | ||
TraceID: 0, | ||
SpanID: 0, | ||
TraceIDHigh: 0, | ||
Flags: 0, | ||
Tracestate: "", | ||
Attributes: map[string]string{ | ||
"ptr.kind": S3PointerKind, | ||
"ptr.dir": PointerDownDirection, | ||
"ptr.hash": hash, | ||
"link.kind": LinkKind, | ||
}, | ||
} | ||
fmt.Println("Adding link...") | ||
if spanWithLinks, ok := span.(tracer.SpanWithLinks); ok { | ||
fmt.Println("SpanWithLinks found") | ||
spanWithLinks.AddSpanLinks(link) | ||
} | ||
//span.AddSpanLinks(link) | ||
fmt.Println("Link added!") | ||
} | ||
|
||
// generatePointerHash generates a unique hash from an array of strings by joining them with | before hashing. | ||
// Used to uniquely identify AWS requests for span pointers. | ||
// Returns a 32-character hash uniquely identifying the components. | ||
func generatePointerHash(components []string) string { | ||
h := sha256.New() | ||
for i, component := range components { | ||
if i > 0 { | ||
h.Write([]byte("|")) | ||
} | ||
h.Write([]byte(component)) | ||
} | ||
|
||
fullHash := h.Sum(nil) | ||
return hex.EncodeToString(fullHash[:SpanPointerHashLengthBytes]) | ||
} |
185 changes: 185 additions & 0 deletions
185
contrib/aws/internal/span_pointers/span_pointers_test.go
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,185 @@ | ||
package span_pointers | ||
|
||
import ( | ||
"context" | ||
"encoding/json" | ||
"github.com/aws/smithy-go/middleware" | ||
smithyhttp "github.com/aws/smithy-go/transport/http" | ||
"github.com/stretchr/testify/assert" | ||
"github.com/stretchr/testify/require" | ||
"gopkg.in/DataDog/dd-trace-go.v1/ddtrace" | ||
"gopkg.in/DataDog/dd-trace-go.v1/ddtrace/mocktracer" | ||
"gopkg.in/DataDog/dd-trace-go.v1/ddtrace/tracer" | ||
"net/http" | ||
"net/url" | ||
"testing" | ||
) | ||
|
||
func TestGeneratePointerHash(t *testing.T) { | ||
tests := []struct { | ||
name string | ||
components []string | ||
expectedHash string | ||
}{ | ||
{ | ||
name: "basic values", | ||
components: []string{ | ||
"some-bucket", | ||
"some-key.data", | ||
"ab12ef34", | ||
}, | ||
expectedHash: "e721375466d4116ab551213fdea08413", | ||
}, | ||
{ | ||
name: "non-ascii key", | ||
components: []string{ | ||
"some-bucket", | ||
"some-key.你好", | ||
"ab12ef34", | ||
}, | ||
expectedHash: "d1333a04b9928ab462b5c6cadfa401f4", | ||
}, | ||
{ | ||
name: "multipart-upload", | ||
components: []string{ | ||
"some-bucket", | ||
"some-key.data", | ||
"ab12ef34-5", | ||
}, | ||
expectedHash: "2b90dffc37ebc7bc610152c3dc72af9f", | ||
}, | ||
} | ||
|
||
for _, tt := range tests { | ||
t.Run(tt.name, func(t *testing.T) { | ||
got := generatePointerHash(tt.components) | ||
if got != tt.expectedHash { | ||
t.Errorf("GeneratePointerHash() = %v, want %v", got, tt.expectedHash) | ||
} | ||
}) | ||
} | ||
} | ||
|
||
func TestHandleS3Operation(t *testing.T) { | ||
mt := mocktracer.Start() | ||
defer mt.Stop() | ||
|
||
tests := []struct { | ||
name string | ||
bucket string | ||
key string | ||
etag string | ||
expectedHash string | ||
expectSuccess bool | ||
}{ | ||
{ | ||
name: "basic operation", | ||
bucket: "some-bucket", | ||
key: "some-key.data", | ||
etag: "ab12ef34", | ||
expectedHash: "e721375466d4116ab551213fdea08413", | ||
expectSuccess: true, | ||
}, | ||
{ | ||
name: "quoted etag", | ||
bucket: "some-bucket", | ||
key: "some-key.data", | ||
etag: "\"ab12ef34\"", | ||
expectedHash: "e721375466d4116ab551213fdea08413", | ||
expectSuccess: true, | ||
}, | ||
{ | ||
name: "non-ascii key", | ||
bucket: "some-bucket", | ||
key: "some-key.你好", | ||
etag: "ab12ef34", | ||
expectedHash: "d1333a04b9928ab462b5c6cadfa401f4", | ||
expectSuccess: true, | ||
}, | ||
{ | ||
name: "empty bucket", | ||
bucket: "", | ||
key: "some_key", | ||
etag: "some_etag", | ||
expectSuccess: false, | ||
}, | ||
{ | ||
name: "empty key", | ||
bucket: "some_bucket", | ||
key: "", | ||
etag: "some_etag", | ||
expectSuccess: false, | ||
}, | ||
{ | ||
name: "empty etag", | ||
bucket: "some_bucket", | ||
key: "some_key", | ||
etag: "", | ||
expectSuccess: false, | ||
}, | ||
} | ||
|
||
for _, tt := range tests { | ||
t.Run(tt.name, func(t *testing.T) { | ||
ctx := context.Background() | ||
span, ctx := tracer.StartSpanFromContext(ctx, "test.s3.operation") | ||
|
||
// Create request | ||
reqURL, _ := url.Parse("https://" + tt.bucket + ".s3.region.amazonaws.com/" + tt.key) | ||
req := &smithyhttp.Request{ | ||
Request: &http.Request{ | ||
URL: reqURL, | ||
}, | ||
} | ||
|
||
// Create response | ||
header := http.Header{} | ||
header.Set("ETag", tt.etag) | ||
res := &smithyhttp.Response{ | ||
Response: &http.Response{ | ||
Header: header, | ||
}, | ||
} | ||
|
||
// Create input/output | ||
in := middleware.DeserializeInput{ | ||
Request: req, | ||
} | ||
out := middleware.DeserializeOutput{ | ||
RawResponse: res, | ||
} | ||
|
||
HandleS3Operation(in, out, span) | ||
span.Finish() | ||
spans := mt.FinishedSpans() | ||
if tt.expectSuccess { | ||
require.Len(t, spans, 1) | ||
tags := spans[0].Tags() | ||
|
||
spanLinks, exists := tags["_dd.span_links"] | ||
assert.True(t, exists, "Expected span links to be set") | ||
assert.NotEmpty(t, spanLinks, "Expected span links to not be empty") | ||
|
||
spanLinksStr, ok := spanLinks.(string) | ||
assert.True(t, ok, "Expected span links to be a string") | ||
|
||
var links []ddtrace.SpanLink | ||
err := json.Unmarshal([]byte(spanLinksStr), &links) | ||
require.NoError(t, err) | ||
require.Len(t, links, 1) | ||
|
||
attributes := links[0].Attributes | ||
assert.Equal(t, S3PointerKind, attributes["ptr.kind"]) | ||
assert.Equal(t, PointerDownDirection, attributes["ptr.dir"]) | ||
assert.Equal(t, LinkKind, attributes["link.kind"]) | ||
assert.Equal(t, tt.expectedHash, attributes["ptr.hash"]) | ||
} else { | ||
require.Len(t, spans, 1) | ||
tags := spans[0].Tags() | ||
_, exists := tags["_dd.span_links"] | ||
assert.False(t, exists, "Expected no span links to be set") | ||
} | ||
mt.Reset() | ||
}) | ||
} | ||
} |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,20 @@ | ||
package ddtrace | ||
|
||
// SpanLink represents a reference to a span that exists outside of the trace. | ||
// | ||
//go:generate msgp -unexported -marshal=false -o=span_link_msgp.go -tests=false | ||
|
||
type SpanLink struct { | ||
// TraceID represents the low 64 bits of the linked span's trace id. This field is required. | ||
TraceID uint64 `msg:"trace_id" json:"trace_id"` | ||
// TraceIDHigh represents the high 64 bits of the linked span's trace id. This field is only set if the linked span's trace id is 128 bits. | ||
TraceIDHigh uint64 `msg:"trace_id_high" json:"trace_id_high"` | ||
// SpanID represents the linked span's span id. | ||
SpanID uint64 `msg:"span_id" json:"span_id"` | ||
// Attributes is a mapping of keys to string values. These values are used to add additional context to the span link. | ||
Attributes map[string]string `msg:"attributes,omitempty" json:"attributes"` | ||
// Tracestate is the tracestate of the linked span. This field is optional. | ||
Tracestate string `msg:"tracestate" json:"tracestate"` | ||
// Flags represents the W3C trace flags of the linked span. This field is optional. | ||
Flags uint32 `msg:"flags" json:"flags"` | ||
} |
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
the aws span should live until we receive a response, not when the request is sent. (this is how AWS spans work in other tracers)
From my testing this has barely any actual impact on the span end time (does anyone know why? I'd expect there to be a big difference?)
Therefore, I'm moving
span.Finish()
todeserialize
. I also had to do this to get unit tests to pass