Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DataFrame.Join hangs indefinitely with small dataset #7407

Open
zgx opened this issue Mar 3, 2025 · 0 comments
Open

DataFrame.Join hangs indefinitely with small dataset #7407

zgx opened this issue Mar 3, 2025 · 0 comments
Labels
untriaged New issue has not been triaged

Comments

@zgx
Copy link

zgx commented Mar 3, 2025

Description:

When using Microsoft.Data.Analysis DataFrame's Join method, the operation hangs indefinitely even with a very small dataset. This happens with a simple inner join operation.

Reproduction steps:

  1. Create two small DataFrames:
var df1 = new DataFrame(
    new StringDataFrameColumn("Symbol", new[] { "MSFT", "AAPL", "GOOGL" }),
    new PrimitiveDataFrameColumn<double>("Price", new[] { 100.5, 150.2, 200.8 })
);

var df2 = new DataFrame(
    new StringDataFrameColumn("Symbol", new[] { "MSFT", "AAPL", "META" }),
    new PrimitiveDataFrameColumn<long>("Volume", new[] { 1000L, 1500L, 800L })
);
  1. Attempt to perform an inner join:
var joinedDf = df1.Join(df2, "Symbol", "Symbol", JoinAlgorithm.Inner);

Expected behavior:
The join operation should complete quickly and return a new DataFrame containing the matched rows.

Actual behavior:
The Join operation hangs indefinitely without returning or throwing an exception.

Environment:

  • OS: Windows 10
  • .NET Version: 8.0.100
  • Microsoft.Data.Analysis Version: 0.22.2

Additional notes:

  • The dataset is very small (only 3 rows in each DataFrame)
  • Both DataFrames have a common key column ("Symbol") with matching values
  • Memory usage appears normal during the hang
@dotnet-policy-service dotnet-policy-service bot added the untriaged New issue has not been triaged label Mar 3, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
untriaged New issue has not been triaged
Projects
None yet
Development

No branches or pull requests

1 participant