Posts

Showing posts with the label terminnology

Featured Post

14 Top Data Pipeline Key Terms Explained

Image
 Here are some key terms commonly used in data pipelines 1. Data Sources Definition: Points where data originates (e.g., databases, APIs, files, IoT devices). Examples: Relational databases (PostgreSQL, MySQL), APIs, cloud storage (S3), streaming data (Kafka), and on-premise systems. 2. Data Ingestion Definition: The process of importing or collecting raw data from various sources into a system for processing or storage. Methods: Batch ingestion, real-time/streaming ingestion. 3. Data Transformation Definition: Modifying, cleaning, or enriching data to make it usable for analysis or storage. Examples: Data cleaning (removing duplicates, fixing missing values). Data enrichment (joining with other data sources). ETL (Extract, Transform, Load). ELT (Extract, Load, Transform). 4. Data Storage Definition: Locations where data is stored after ingestion and transformation. Types: Data Lakes: Store raw, unstructured, or semi-structured data (e.g., S3, Azure Data Lake). Data Warehous...

Here is an Audio Post Explained About Blockchain

Image
According to Investopedia - Originally developed as the accounting method for the virtual currency Bitcoin , blockchains – which use what's known as distributed ledger technology (DLT) – are appearing in a variety of commercial applications today. Distributed Ledger 1 - What is the Current Trend All the transactions currently can be edited by the server owners. They have full control to change the transaction details. The current trend is either centralized or decentralized. 2- Distributed Trend No one can edit the transaction details. It is transparent to all stakeholders. Video on Distributed Systems References What is Centralized Server Processing Role of Distributed Server Processing