- 
                Notifications
    You must be signed in to change notification settings 
- Fork 216
add a multicodec to identify Ethereum Solidity ABI-encoded data #363
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: master
Are you sure you want to change the base?
Conversation
| @rvagg Given you've been dipping your toes in Ethereum standards as a member of FilOz, I think you'd see the value in this quite clearly ;-) | 
| eth-storage-trie, ipld, 0x98, permanent, Ethereum Contract Storage Trie (Eth-Secure-Trie) | ||
| eth-receipt-log-trie, ipld, 0x99, draft, Ethereum Transaction Receipt Log Trie (Eth-Trie) | ||
| eth-receipt-log, ipld, 0x9a, draft, Ethereum Transaction Receipt Log (RLP) | ||
| eth-solidity-abi, ipld, 0x9b, draft, Ethereum Solidity ABI-encoded data | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think something more concrete for the description might help, since it's currently just redundant to the name:
| eth-solidity-abi, ipld, 0x9b, draft, Ethereum Solidity ABI-encoded data | |
| eth-solidity-abi, ipld, 0x9b, draft, ABI-encoded contract calls from EVM environments | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
not just "calls", returns can also be encoded in this form too
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
none of this is correct.
(a) as mentioned above, Solidity ABI encoding is used for more than just calls.
(b) the EVM does not care about the ABI.
(c) it is Ethereum-related, so let's keep Ethereum in the description.
| Also, forgive me if I'm offbase here, but doesn't Vyper generate the same ABIs? I think it's probably more accurate to scope this to EVM than to Solidity specifically. | 
| A couple thoughts: 
 | 
| I can see historically how this might have been useful in early fevm days, perhaps to signal better than this: https://github.com/filecoin-project/lotus/blob/e2c46f1af1e088535d2bb5a2ec04362b9b0e5009/itests/kit/evm.go#L155-L162 on the way in and this: https://github.com/filecoin-project/lotus/blob/e2c46f1af1e088535d2bb5a2ec04362b9b0e5009/node/impl/full/eth.go#L1503-L1511 on the way out. (Although I can't quite see what it would buy us there.) The  @raulk do you have a use-case in mind where this kind of signalling is needed? It's in the two-byte range so pretty cheap to add and I don't think this is particularly controversial, unless it's just a thought-bubble entry without a concrete use-case ready to use it. | 
| Appreciate the quick feedback here! 
 @bumblefudge This ABI scheme was introduced by Solidity, and other high-level Ethereum programming languages followed in adoption in order to interoperate. Ethereum and the EVM are unopinionated at the transaction data layer. From I can tell, it is spec'ed and canonically maintained by the Solidity team. I suspect your question is aimed at clarifying if it's correct to term this "Solidity ABI". My reflection is "yes", as there is technically no standardised Ethereum ABI (nor I expect there to be one). 
 @rvagg @aschmahmann Our immediate use case is enabling FVM Wasm actors to expose Ethereum-compatible interfaces. More concretely, in IPC we have anchoring contracts written in Solidity deployed to the parent, and native system actors that run inside IPC subnets. Sometimes we need to make the latter return data that can opaquely be passed to the former as call arguments, without client re-encoding. The  On a more general level, if we ever want to use CID-based systems like IPFS/Filecoin/etc. to store, address, and handle Ethereum calldata, return data, and events (which arguably is already happening today anyway), it's worth being able to express the format type in CIDs to aid downstream interpretation (e.g. indexing, discovery, hashing). 
 @aschmahmann I don't know 🤷 I just copied from the row above. Is the schema for this CSV column defined somewhere? I'm fine with @rvagg's suggestion: 
 | 
| In terms of code allocations, IMO, allocating 2-3 byte codes for "maybes" is totally fine. In terms of serialization v. IPLD, "this is solidity ABI encoded" tells the user exactly nothing unless they also know the schema. It's definitely serialization, not IPLD, because it's not self describing. IMO, there's no harm in adding this (as long as the type is serialization). However, separate from the discussion of whether or not this codec should exist, I'm not sure how useful this codec is given that solidity ABI encoded data isn't self-describing. Either: 
 I'm curious to know how you plan on using it. | 
| @Stebalien the issue is that multicodecs get used for all kinds of things, at many levels of the stack, e.g. on a user-facing level we use them for content type signalling in CIDs; on a deep systems level we use them in the ref-fvm to convey the content type of parameters and return values (blocks) to (a) induce link scanning or not, (b) signal to higher APIs how to interpret the content. Given the generality and centrality of multicodec, I lean towards a policy evaluating proposals based on opportunities unlocked and directional vision, versus gatekeeping concrete immediate use cases. For example, if I ever wanted to build a multi-blockchain IPFS-based indexing system, down the the call semantics level, with the ability to address and interpret any blob (params, return values, errors, events, traces, etc.) through a CID, it would be very valuable to be able to express the blob's content type within its CID. Alternatively I could use a an application-level enveloping data structure together with a RAW codec or another standardised custom codec, and put the content type in the envelope, but that is rather wasteful, poorly interoperable, and somewhat of a workaround. In this particular case, we were creating a custom Wasm FVM actor that behaved externally like a Solidity smart contract, exposing a Solidity ABI interface. The  Re: Solidity ABI and self-describability, it could make sense to create a multicodec point for the Solidity 4byte call convention as well. You don't need to know the method ahead of time; it's very common to use directories like https://www.4byte.directory/ (for which public APIs and libraries exist) to parse the call and discover its schema at runtime. | 
| I'm interested to hear @Stebalien's response to that, but I'm fine with this PR since it's not in the double-byte range and is being requested for a concrete and plausible use-case. I'd just like the label column changed to  | 
See https://docs.soliditylang.org/en/latest/abi-spec.html. Solidity ABI encoding is the defacto standard by which call data, return data, events, and more are represented in Ethereum. So given its prevalence, it's worth introducing a dedicated multicodec instead of being relegated to using RAW.