You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.

445 lines
18 KiB

6 years ago
6 years ago
6 years ago
6 years ago
  1. # Applications
  2. Please ensure you've first read the spec for [ABCI Methods and Types](abci.md)
  3. Here we cover the following components of ABCI applications:
  4. - [Connection State](#state) - the interplay between ABCI connections and application state
  5. and the differences between `CheckTx` and `DeliverTx`.
  6. - [Transaction Results](#transaction-results) - rules around transaction
  7. results and validity
  8. - [Validator Set Updates](#validator-updates) - how validator sets are
  9. changed during `InitChain` and `EndBlock`
  10. - [Query](#query) - standards for using the `Query` method and proofs about the
  11. application state
  12. - [Crash Recovery](#crash-recovery) - handshake protocol to synchronize
  13. Tendermint and the application on startup.
  14. ## State
  15. Since Tendermint maintains three concurrent ABCI connections, it is typical
  16. for an application to maintain a distinct state for each, and for the states to
  17. be synchronized during `Commit`.
  18. ### Commit
  19. Application state should only be persisted to disk during `Commit`.
  20. Before `Commit` is called, Tendermint locks and flushes the mempool so that no new messages will
  21. be received on the mempool connection. This provides an opportunity to safely update all three
  22. states to the latest committed state at once.
  23. When `Commit` completes, it unlocks the mempool.
  24. Note that it is not possible to send transactions to Tendermint during `Commit` - if your app
  25. tries to send a `/broadcast_tx` to Tendermint during Commit, it will deadlock.
  26. ### Consensus Connection
  27. The Consensus Connection should maintain a `DeliverTxState` -
  28. the working state for block execution. It should be updated by the calls to
  29. `BeginBlock`, `DeliverTx`, and `EndBlock` during block execution and committed to
  30. disk as the "latest committed state" during `Commit`.
  31. Updates made to the DeliverTxState by each method call must be readable by each subsequent method -
  32. ie. the updates are linearizable.
  33. ### Mempool Connection
  34. The Mempool Connection should maintain a `CheckTxState`
  35. to sequentially process pending transactions in the mempool that have
  36. not yet been committed. It should be initialized to the latest committed state
  37. at the end of every `Commit`.
  38. The CheckTxState may be updated concurrently with the DeliverTxState, as
  39. messages may be sent concurrently on the Consensus and Mempool connections. However,
  40. before calling `Commit`, Tendermint will lock and flush the mempool connection,
  41. ensuring that all existing CheckTx are responded to and no new ones can
  42. begin.
  43. After `Commit`, CheckTx is run again on all transactions that remain in the
  44. node's local mempool after filtering those included in the block. To prevent the
  45. mempool from rechecking all transactions every time a block is committed, set
  46. the configuration option `mempool.recheck=false`.
  47. Finally, the mempool will unlock and new transactions can be processed through CheckTx again.
  48. Note that CheckTx doesn't have to check everything that affects transaction validity; the
  49. expensive things can be skipped. In fact, CheckTx doesn't have to check
  50. anything; it might say that any transaction is a valid transaction.
  51. Unlike DeliverTx, CheckTx is just there as
  52. a sort of weak filter to keep invalid transactions out of the blockchain. It's
  53. weak, because a Byzantine node doesn't care about CheckTx; it can propose a
  54. block full of invalid transactions if it wants.
  55. ### Info Connection
  56. The Info Connection should maintain a `QueryState` for answering queries from the user,
  57. and for initialization when Tendermint first starts up (both described further
  58. below).
  59. It should always contain the latest committed state associated with the
  60. latest committed block.
  61. QueryState should be set to the latest `DeliverTxState` at the end of every `Commit`,
  62. ie. after the full block has been processed and the state committed to disk.
  63. Otherwise it should never be modified.
  64. ## Transaction Results
  65. `ResponseCheckTx` and `ResponseDeliverTx` contain the same fields.
  66. The `Info` and `Log` fields are non-deterministic values for debugging/convenience purposes
  67. that are otherwise ignored.
  68. The `Data` field must be strictly deterministic, but can be arbitrary data.
  69. ### Gas
  70. Ethereum introduced the notion of `gas` as an abstract representation of the
  71. cost of resources used by nodes when processing transactions. Every operation in the
  72. Ethereum Virtual Machine uses some amount of gas, and gas can be accepted at a market-variable price.
  73. Users propose a maximum amount of gas for their transaction; if the tx uses less, they get
  74. the difference credited back. Tendermint adopts a similar abstraction,
  75. though uses it only optionally and weakly, allowing applications to define
  76. their own sense of the cost of execution.
  77. In Tendermint, the `ConsensusParams.Block.MaxGas` limits the amount of `gas` that can be used in a block.
  78. The default value is `-1`, meaning no limit, or that the concept of gas is
  79. meaningless.
  80. Responses contain a `GasWanted` and `GasUsed` field. The former is the maximum
  81. amount of gas the sender of a tx is willing to use, and the later is how much it actually
  82. used. Applications should enforce that `GasUsed <= GasWanted` - ie. tx execution
  83. should halt before it can use more resources than it requested.
  84. When `MaxGas > -1`, Tendermint enforces the following rules:
  85. - `GasWanted <= MaxGas` for all txs in the mempool
  86. - `(sum of GasWanted in a block) <= MaxGas` when proposing a block
  87. If `MaxGas == -1`, no rules about gas are enforced.
  88. Note that Tendermint does not currently enforce anything about Gas in the consensus, only the mempool.
  89. This means it does not guarantee that committed blocks satisfy these rules!
  90. It is the application's responsibility to return non-zero response codes when gas limits are exceeded.
  91. The `GasUsed` field is ignored completely by Tendermint. That said, applications should enforce:
  92. - `GasUsed <= GasWanted` for any given transaction
  93. - `(sum of GasUsed in a block) <= MaxGas` for every block
  94. In the future, we intend to add a `Priority` field to the responses that can be
  95. used to explicitly prioritize txs in the mempool for inclusion in a block
  96. proposal. See [#1861](https://github.com/tendermint/tendermint/issues/1861).
  97. ### CheckTx
  98. If `Code != 0`, it will be rejected from the mempool and hence
  99. not broadcasted to other peers and not included in a proposal block.
  100. `Data` contains the result of the CheckTx transaction execution, if any. It is
  101. semantically meaningless to Tendermint.
  102. `Tags` include any tags for the execution, though since the transaction has not
  103. been committed yet, they are effectively ignored by Tendermint.
  104. ### DeliverTx
  105. If DeliverTx returns `Code != 0`, the transaction will be considered invalid,
  106. though it is still included in the block.
  107. `Data` contains the result of the CheckTx transaction execution, if any. It is
  108. semantically meaningless to Tendermint.
  109. Both the `Code` and `Data` are included in a structure that is hashed into the
  110. `LastResultsHash` of the next block header.
  111. `Tags` include any tags for the execution, which Tendermint will use to index
  112. the transaction by. This allows transactions to be queried according to what
  113. events took place during their execution.
  114. See issue [#1007](https://github.com/tendermint/tendermint/issues/1007) for how
  115. the tags will be hashed into the next block header.
  116. ## Validator Updates
  117. The application may set the validator set during InitChain, and update it during
  118. EndBlock.
  119. Note that the maximum total power of the validator set is bounded by
  120. `MaxTotalVotingPower = MaxInt64 / 8`. Applications are responsible for ensuring
  121. they do not make changes to the validator set that cause it to exceed this
  122. limit.
  123. Additionally, applications must ensure that a single set of updates does not contain any duplicates -
  124. a given public key can only appear in an update once. If an update includes
  125. duplicates, the block execution will fail irrecoverably.
  126. ### InitChain
  127. ResponseInitChain can return a list of validators.
  128. If the list is empty, Tendermint will use the validators loaded in the genesis
  129. file.
  130. If the list is not empty, Tendermint will use it for the validator set.
  131. This way the application can determine the initial validator set for the
  132. blockchain.
  133. ### EndBlock
  134. Updates to the Tendermint validator set can be made by returning
  135. `ValidatorUpdate` objects in the `ResponseEndBlock`:
  136. ```
  137. message ValidatorUpdate {
  138. PubKey pub_key
  139. int64 power
  140. }
  141. message PubKey {
  142. string type
  143. bytes data
  144. }
  145. ```
  146. The `pub_key` currently supports only one type:
  147. - `type = "ed25519" and`data = <raw 32-byte public key>`
  148. The `power` is the new voting power for the validator, with the
  149. following rules:
  150. - power must be non-negative
  151. - if power is 0, the validator must already exist, and will be removed from the
  152. validator set
  153. - if power is non-0:
  154. - if the validator does not already exist, it will be added to the validator
  155. set with the given power
  156. - if the validator does already exist, its power will be adjusted to the given power
  157. - the total power of the new validator set must not exceed MaxTotalVotingPower
  158. Note the updates returned in block `H` will only take effect at block `H+2`.
  159. ## Consensus Parameters
  160. ConsensusParams enforce certain limits in the blockchain, like the maximum size
  161. of blocks, amount of gas used in a block, and the maximum acceptable age of
  162. evidence. They can be set in InitChain and updated in EndBlock.
  163. ### Block.MaxBytes
  164. The maximum size of a complete Amino encoded block.
  165. This is enforced by Tendermint consensus.
  166. This implies a maximum tx size that is this MaxBytes, less the expected size of
  167. the header, the validator set, and any included evidence in the block.
  168. Must have `0 < MaxBytes < 100 MB`.
  169. ### Block.MaxGas
  170. The maximum of the sum of `GasWanted` in a proposed block.
  171. This is *not* enforced by Tendermint consensus.
  172. It is left to the app to enforce (ie. if txs are included past the
  173. limit, they should return non-zero codes). It is used by Tendermint to limit the
  174. txs included in a proposed block.
  175. Must have `MaxGas >= -1`.
  176. If `MaxGas == -1`, no limit is enforced.
  177. ### Block.TimeIotaMs
  178. The minimum time between consecutive blocks (in milliseconds).
  179. This is enforced by Tendermint consensus.
  180. Must have `TimeIotaMs > 0` to ensure time monotonicity.
  181. ### EvidenceParams.MaxAge
  182. This is the maximum age of evidence.
  183. This is enforced by Tendermint consensus.
  184. If a block includes evidence older than this, the block will be rejected
  185. (validators won't vote for it).
  186. Must have `0 < MaxAge`.
  187. ### Updates
  188. The application may set the ConsensusParams during InitChain, and update them during
  189. EndBlock. If the ConsensusParams is empty, it will be ignored. Each field
  190. that is not empty will be applied in full. For instance, if updating the
  191. Block.MaxBytes, applications must also set the other Block fields (like
  192. Block.MaxGas), even if they are unchanged, as they will otherwise cause the
  193. value to be updated to 0.
  194. #### InitChain
  195. ResponseInitChain includes a ConsensusParams.
  196. If its nil, Tendermint will use the params loaded in the genesis
  197. file. If it's not nil, Tendermint will use it.
  198. This way the application can determine the initial consensus params for the
  199. blockchain.
  200. #### EndBlock
  201. ResponseEndBlock includes a ConsensusParams.
  202. If its nil, Tendermint will do nothing.
  203. If it's not nil, Tendermint will use it.
  204. This way the application can update the consensus params over time.
  205. Note the updates returned in block `H` will take effect right away for block
  206. `H+1`.
  207. ## Query
  208. Query is a generic method with lots of flexibility to enable diverse sets
  209. of queries on application state. Tendermint makes use of Query to filter new peers
  210. based on ID and IP, and exposes Query to the user over RPC.
  211. Note that calls to Query are not replicated across nodes, but rather query the
  212. local node's state - hence they may return stale reads. For reads that require
  213. consensus, use a transaction.
  214. The most important use of Query is to return Merkle proofs of the application state at some height
  215. that can be used for efficient application-specific lite-clients.
  216. Note Tendermint has technically no requirements from the Query
  217. message for normal operation - that is, the ABCI app developer need not implement
  218. Query functionality if they do not wish too.
  219. ### Query Proofs
  220. The Tendermint block header includes a number of hashes, each providing an
  221. anchor for some type of proof about the blockchain. The `ValidatorsHash` enables
  222. quick verification of the validator set, the `DataHash` gives quick
  223. verification of the transactions included in the block, etc.
  224. The `AppHash` is unique in that it is application specific, and allows for
  225. application-specific Merkle proofs about the state of the application.
  226. While some applications keep all relevant state in the transactions themselves
  227. (like Bitcoin and its UTXOs), others maintain a separated state that is
  228. computed deterministically *from* transactions, but is not contained directly in
  229. the transactions themselves (like Ethereum contracts and accounts).
  230. For such applications, the `AppHash` provides a much more efficient way to verify lite-client proofs.
  231. ABCI applications can take advantage of more efficient lite-client proofs for
  232. their state as follows:
  233. - return the Merkle root of the deterministic application state in
  234. `ResponseCommit.Data`.
  235. - it will be included as the `AppHash` in the next block.
  236. - return efficient Merkle proofs about that application state in `ResponseQuery.Proof`
  237. that can be verified using the `AppHash` of the corresponding block.
  238. For instance, this allows an application's lite-client to verify proofs of
  239. absence in the application state, something which is much less efficient to do using the block hash.
  240. Some applications (eg. Ethereum, Cosmos-SDK) have multiple "levels" of Merkle trees,
  241. where the leaves of one tree are the root hashes of others. To support this, and
  242. the general variability in Merkle proofs, the `ResponseQuery.Proof` has some minimal structure:
  243. ```
  244. message Proof {
  245. repeated ProofOp ops
  246. }
  247. message ProofOp {
  248. string type = 1;
  249. bytes key = 2;
  250. bytes data = 3;
  251. }
  252. ```
  253. Each `ProofOp` contains a proof for a single key in a single Merkle tree, of the specified `type`.
  254. This allows ABCI to support many different kinds of Merkle trees, encoding
  255. formats, and proofs (eg. of presence and absence) just by varying the `type`.
  256. The `data` contains the actual encoded proof, encoded according to the `type`.
  257. When verifying the full proof, the root hash for one ProofOp is the value being
  258. verified for the next ProofOp in the list. The root hash of the final ProofOp in
  259. the list should match the `AppHash` being verified against.
  260. ### Peer Filtering
  261. When Tendermint connects to a peer, it sends two queries to the ABCI application
  262. using the following paths, with no additional data:
  263. - `/p2p/filter/addr/<IP:PORT>`, where `<IP:PORT>` denote the IP address and
  264. the port of the connection
  265. - `p2p/filter/id/<ID>`, where `<ID>` is the peer node ID (ie. the
  266. pubkey.Address() for the peer's PubKey)
  267. If either of these queries return a non-zero ABCI code, Tendermint will refuse
  268. to connect to the peer.
  269. ### Paths
  270. Queries are directed at paths, and may optionally include additional data.
  271. The expectation is for there to be some number of high level paths
  272. differentiating concerns, like `/p2p`, `/store`, and `/app`. Currently,
  273. Tendermint only uses `/p2p`, for filtering peers. For more advanced use, see the
  274. implementation of
  275. [Query in the Cosmos-SDK](https://github.com/cosmos/cosmos-sdk/blob/v0.23.1/baseapp/baseapp.go#L333).
  276. ## Crash Recovery
  277. On startup, Tendermint calls the `Info` method on the Info Connection to get the latest
  278. committed state of the app. The app MUST return information consistent with the
  279. last block it succesfully completed Commit for.
  280. If the app succesfully committed block H but not H+1, then `last_block_height = H` and `last_block_app_hash = <hash returned by Commit for block H>`. If the app
  281. failed during the Commit of block H, then `last_block_height = H-1` and
  282. `last_block_app_hash = <hash returned by Commit for block H-1, which is the hash in the header of block H>`.
  283. We now distinguish three heights, and describe how Tendermint syncs itself with
  284. the app.
  285. ```
  286. storeBlockHeight = height of the last block Tendermint saw a commit for
  287. stateBlockHeight = height of the last block for which Tendermint completed all
  288. block processing and saved all ABCI results to disk
  289. appBlockHeight = height of the last block for which ABCI app succesfully
  290. completed Commit
  291. ```
  292. Note we always have `storeBlockHeight >= stateBlockHeight` and `storeBlockHeight >= appBlockHeight`
  293. Note also we never call Commit on an ABCI app twice for the same height.
  294. The procedure is as follows.
  295. First, some simple start conditions:
  296. If `appBlockHeight == 0`, then call InitChain.
  297. If `storeBlockHeight == 0`, we're done.
  298. Now, some sanity checks:
  299. If `storeBlockHeight < appBlockHeight`, error
  300. If `storeBlockHeight < stateBlockHeight`, panic
  301. If `storeBlockHeight > stateBlockHeight+1`, panic
  302. Now, the meat:
  303. If `storeBlockHeight == stateBlockHeight && appBlockHeight < storeBlockHeight`,
  304. replay all blocks in full from `appBlockHeight` to `storeBlockHeight`.
  305. This happens if we completed processing the block, but the app forgot its height.
  306. If `storeBlockHeight == stateBlockHeight && appBlockHeight == storeBlockHeight`, we're done.
  307. This happens if we crashed at an opportune spot.
  308. If `storeBlockHeight == stateBlockHeight+1`
  309. This happens if we started processing the block but didn't finish.
  310. If `appBlockHeight < stateBlockHeight`
  311. replay all blocks in full from `appBlockHeight` to `storeBlockHeight-1`,
  312. and replay the block at `storeBlockHeight` using the WAL.
  313. This happens if the app forgot the last block it committed.
  314. If `appBlockHeight == stateBlockHeight`,
  315. replay the last block (storeBlockHeight) in full.
  316. This happens if we crashed before the app finished Commit
  317. If `appBlockHeight == storeBlockHeight`
  318. update the state using the saved ABCI responses but dont run the block against the real app.
  319. This happens if we crashed after the app finished Commit but before Tendermint saved the state.