You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
 
Anton Kaliaev d07e164796 Merge pull request #13 from tendermint/bugfix/6-tm-monitor-does-not-reconnect-to-a-node 8 years ago
..
eventmeter [tm-monitor] use BaseService.Reset method 8 years ago
mock [tm-monitor] rewrite eventmeter to use go-kit/log 8 years ago
monitor [tm-monitor] now EventMeter can be restarted multiple times (Refs #6) 8 years ago
Dockerfile link 2 docker containers 8 years ago
Dockerfile.dev [tm-monitor] rename netmon to tools 8 years ago
LICENSE tm-monitor 0.1.0 8 years ago
Makefile [tm-monitor] fix build-docker 8 years ago
README.md [tm-monitor] proposed solution for idea #1 8 years ago
glide.lock [tm-monitor] extract lib to monitor/ dir 8 years ago
glide.yaml [tm-monitor] rewrite eventmeter to use go-kit/log 8 years ago
main.go [tm-monitor] set current version to 0.2.1 8 years ago
rpc.go [tm-monitor] preserve nodes ordering (Fixes #11) 8 years ago
ton.go [tm-monitor] extract lib to monitor/ dir 8 years ago

README.md

Tendermint monitor (tm-monitor)

Tendermint monitor watches over one or more Tendermint core applications (nodes), collecting and providing various statistics to the user.

QuickStart using Docker

docker run -it --rm -v "/tmp:/tendermint" tendermint/tendermint init
docker run -it --rm -v "/tmp:/tendermint" -p "46657:46657" --name=tm tendermint/tendermint

docker run -it --rm --link=tm tendermint/monitor tm:46657

QuickStart using binaries

Linux:

curl -L https://s3-us-west-2.amazonaws.com/tendermint/0.8.0/tendermint_linux_amd64.zip && sudo unzip -d /usr/local/bin tendermint_linux_amd64.zip && sudo chmod +x tendermint
tendermint init
tendermint node --app_proxy=dummy

tm-monitor localhost:46657

Max OS:

curl -L https://s3-us-west-2.amazonaws.com/tendermint/0.8.0/tendermint_darwin_amd64.zip && sudo unzip -d /usr/local/bin tendermint_darwin_amd64.zip && sudo chmod +x tendermint
tendermint init
tendermint node --app_proxy=dummy

tm-monitor localhost:46657

Usage

tm-monitor [-v] [-no-ton] [-listen-addr="tcp://0.0.0.0:46670"] [endpoints]

Examples:
        # monitor single instance
        tm-monitor localhost:46657

        # monitor a few instances by providing comma-separated list of RPC endpoints
        tm-monitor host1:46657,host2:46657
Flags:
  -listen-addr string
        HTTP and Websocket server listen address (default "tcp://0.0.0.0:46670")
  -no-ton
        Do not show ton (table of nodes)
  -v    verbose logging

asciicast

RPC UI

Run tm-monitor and visit http://localhost:46670. You should see the list of the available RPC endpoints:

http://localhost:46670/status
http://localhost:46670/status/network
http://localhost:46670/monitor?endpoint=_
http://localhost:46670/status/node?name=_
http://localhost:46670/unmonitor?endpoint=_

The API is available as GET requests with URI encoded parameters, or as JSONRPC POST requests. The JSONRPC methods are also exposed over websocket.

Ideas

  • currently we get IPs and dial, but should reverse so the nodes dial the netmon, both for node privacy and easier reconfig (validators changing ip/port). It would be good to have both. For testnets with others we def need them to dial the monitor. But I want to be able to run the monitor from my laptop without openning ports. If we don't want to open all the ports, maybe something like this would be a good fit for us: tm-monitor agent running on each node, collecting all the metrics. Each tm-monitor agent monitors local TM node and sends stats to a single master tm-monitor master. That way we'll only need to open a single port for UI on the node with tm-monitor master. And I believe it could be done with a single package with a few subcommands.
    # agent collecting metrics from localhost (default)
    tm-monitor agent --master="192.168.1.17:8888"
    
    # agent collecting metrics from another TM node (useful for testing, development)
    tm-monitor agent --master="192.168.1.17:8888" --node="192.168.1.18:46657"
    
    # master accepting stats from agents
    tm-monitor master [--ton] OR [--ui] (`--ui` mode by default)
    
    # display table of nodes in the terminal (useful for testing, development, playing with TM)
    # --nodes="localhost:46657" by default
    tm-monitor
    
    # display table of nodes in the terminal (useful for testing, development, playing with TM)
    tm-monitor --nodes="192.168.1.18:46657,192.168.1.19:46657"
    
  • uptime over last day, month, year. There are different meanings for uptime. One is to constantly ping the nodes and make sure they respond to eg. /status. A more fine-grained one is to check for votes in the block commits.
  • show network size + auto discovery. You can get a list of connected peers at /net_info. But no single one will be connected to the whole network, so need to tease out all the unique peers from calling /net_info on all of them. Unless you have some prior information about how many peers in the net ... More: we could add -auto-discovery option and try to connect to every node.
  • input plugin for https://github.com/influxdata/telegraf, so the user is able to get the metrics and send them whenever he wants to (grafana, prometheus, etc.).

Feel free to vote on the ideas or add your own by saying hello on Slack or by opening an issue.