Skip to content
Youngwoo Kim edited this page May 23, 2022 · 168 revisions

Notes


Workflow...

Mongoose storage performance testing tools, https://github.com/emc-mongoose

Linux LVM:

https://github.com/ExpediaGroup/stream-registry

FLOW:

https://github.com/tiangolo/fastapi

-- Ubuntu 18.04, Hash sum mismatch
sudo rm -rf /var/lib/apt/lists/*
sudo apt-get update -o Acquire::CompressionTypes::Order::=gz
sudo apt-get update && sudo apt-get upgrade

Procella: Unifying serving and analytical data at YouTube, https://ai.google/research/pubs/pub48388/

https://www.slideshare.net/AmazonWebServices/big-data-analytics-architectural-patterns-and-best-practices-ant201r1-aws-reinvent-2018

https://github.com/Integerous/goQuality-dev-contents

InnerSource - http://paypal.github.io/InnerSourceCommons/assets/files/AdoptingInnerSource.pdf

Js Chart - https://blog.bitsrc.io/11-javascript-charts-and-data-visualization-libraries-for-2018-f01a283a5727

Metadata and Data lineage

https://engineering.linkedin.com/blog/2019/data-hub

https://medium.com/airbnb-engineering/democratizing-data-at-airbnb-852d76c51770

https://eng.uber.com/databook/

https://medium.com/wbaa/facilitating-data-discovery-with-apache-atlas-and-amundsen-631baa287c8b


https://medium.com/@garyogasawara/hadoop-performance-benchmark-results-comparing-on-premise-s3-vs-hdfs-cf7a9ea3baa3

http://info.bluedata.com/rs/693-TGY-247/images/IDC-Perspective-Decoupling-Compute-Storage.pdf

https://github.com/rstacruz/cheatsheets

Apache Airflow:

zeppelin, centos7, 971748464

pypi mirror, https://stackoverflow.com/questions/50348248/creating-a-full-replica-offline-copy-of-the-public-pypi-repository

Readings: