To learn high concurrency, distributed, hadoop, hive, hbase, zookeeper and other related components, CDH, spark and database of Cloudera manager, I won't list them one by one.
As for training institutions, just look around and compare them. According to my personal opinion, big data can be a little bit of an institution, so can Shang Shang Silicon Valley.