The Data Engineering Vault
Written by Nikos Vaggalis   
Friday, 11 October 2024

A curated network of knowledge designed to facilitate exploration, discovery, and deep learning in the field of data engineering.

Or in other words an encyclopedia on everything about data engineering. Attempts like this are interesting because despite the advent of the LLMs which can answer just about anything, we witness a resurgence in dictionaries/encyclopedias of tech terminology.

Recently we took a look at "Dev Encyclopedia" , an open-source, easy-to-use online resource that helps make sense of complicated tech terms. It's mission is simple:

How many times have you attended a meeting or watched a presentation or read a manual which were riddled with such jargon unknown to you? Wouldn't an accessible and quick lookup guide help in such situations?

Well wish granted. Here is the "Dev Encyclopedia" made by a single Python backend developer who thought along those lines and set out to assist those bewildered by providing clear, concise explanations for all those tricky tech terms

The database of terms of Dev Encylopedia is exhaustive, encompassing many SE disciplines. It ranges from ACID to APIs and Antivirus, from Big O to Deep Learning and LLMs, from MVC and up to Zombie processes. On the contrary,The Data Engineering Vault is not generalized, but specialized on the narrow field of data engineering.

The knowledge base is organized in seven categories:

  • Data Engineering Concepts
  • Data Engineering Tools and Technologies
  • Data Engineering Practices
  • Modern Data Engineering
  • Data Engineering Management and Analysis
  • Data Engineering Design and Development
  • Additional Data Engineering Resources

As such you'll find information ranging from the Classical Architecture of Data Warehouse, to the Relational Model, SQL, Python and Apache Arrow, from ELT and ETL, to OLAP Cubes and Business Intelligence.It's all there.

Navigating from the categories to the main material, say picking "Classical Architecture of Data Warehouse", we are presented with a html page resembling a markdown document (Digital Garden) that contains clear and structured information, diagrams included too:

secondbraindiagram

 

We find that the Classical Architecture of Data Warehouse is split into the Layers of Staging, Cleansing, Core areas, the Data Mart and the Metadata. The document then delves into them.

The point is that the encyclopedia is not comprised of stiff explanations of the terminology but delves into the topic going through the advantages and disadvantages too. You can expect the same level of detail for all the material, rendering the Vault
an indispensable guide in your data engineering journey.

secondbrainlogo

More Information

Data Engineering Vault: A Second Brain Knowledge Network 

Related Articles

Dev Encyclopedia Shares The Knowledge

 

To be informed about new articles on I Programmer, sign up for our weekly newsletter, subscribe to the RSS feed and follow us on Twitter, Facebook or Linkedin.

Banner


Petition Microsoft To Defer Windows 10 End of Support
03/10/2025

With 10 days to go until Microsoft ends free support for Windows 10, Windows 7 has seen an uptick in its market share, while Windows 11 has seen a slight downturn. A new petition is asking Microsoft t [ ... ]



Anthropic Says Claude Sonnet 4.5 Is World's Best Coding Model
06/10/2025

Anthropic has released Claude Sonnet 4.5, describing it as the best coding model in the world. Anthropic says this is the strongest model for building complex agents, the best model at using computers [ ... ]


More News

pico book

 

Comments




or email your comment to: comments@i-programmer.info