-
Notifications
You must be signed in to change notification settings - Fork 210
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
e_forward() not returning results without IDs named 'src' and 'dest' #614
Comments
@DataBoyTX any way to know if they have additional surprisingly named columns? we don't assume hard-coded names, so the issue is generally excess results, not missing... |
@lmeyerov - I am not sure what you mean by surprisingly named columns, the column names that don't work are: Also curious what you mean by we don't assume hard-coded names? |
elid is not a variable name in this repo so I don't know why it would collide, so I'm wondering if their data has more columns , and the bug report does not provide the full schema |
@lmeyerov - sorry I missed this comment. I added the link to the dataset above a few weeks ago, there's elid in both the files, and the source code (renamed to src/dest for the edges to show that it works), but it's definitely there. data file (also in description above): link
|
This is fascinating, I think the actual issue is when the node id col matches one of the edge src/dst col names: g = graphistry.nodes(df1, n).edges(df2, e1, e2)
assert g._node != g._source
assert g._node != g._destination This is surprising behavior, so digging in |
Yep that's it:
|
This can likely be solved through renaming src/dst col names during Longer-term, we should be using symbol tables for names to avoid this kind of issue. |
Describe the bug
Customer reported bug with GFQL's e_forward() not returning the correct results unless the edge df IDs are named 'src' and 'dest'
To Reproduce
blueprint_nodes.zip
Expected behavior
be able to use e_forward to get edges
Actual behavior
get no edges back unless using e() or e_undirected(), or change the IDs to
src
anddest
Screenshot showing differences of counts with same column contents for IDs, but different names:
Graphistry GPU server environment
Hub v2.41.10
PyGraphistry API client environment
Jupyter Lab local
0.34.17
Python 3.8.5
The text was updated successfully, but these errors were encountered: