Featured Post

Python: Built-in Functions vs. For & If Loops – 5 Programs Explained

Image
Python’s built-in functions make coding fast and efficient. But understanding how they work under the hood is crucial to mastering Python. This post shows five Python tasks, each implemented in two ways: Using built-in functions Using for loops and if statements ✅ 1. Sum of a List ✅ Using Built-in Function: numbers = [ 10 , 20 , 30 , 40 ] total = sum (numbers) print ( "Sum:" , total) 🔁 Using For Loop: numbers = [ 10 , 20 , 30 , 40 ] total = 0 for num in numbers: total += num print ( "Sum:" , total) ✅ 2. Find Maximum Value ✅ Using Built-in Function: values = [ 3 , 18 , 7 , 24 , 11 ] maximum = max (values) print ( "Max:" , maximum) 🔁 Using For and If: values = [ 3 , 18 , 7 , 24 , 11 ] maximum = values[ 0 ] for val in values: if val > maximum: maximum = val print ( "Max:" , maximum) ✅ 3. Count Vowels in a String ✅ Using Built-ins: text = "hello world" vowel_count = sum ( 1 for ch in text if ch i...

Here's to Know Data lake Vs Database

In a data lake, data stored internally in a repository. You can call it a blob. The data in the lake a no-format data, but you need a schema for the database. 



Data lake Repository

Database

  • In the database, the Schema definition you need before you store data on it.
  • It should follow Codd's rules.
  • Here data is completely formatted.
  • The data stores here in Tables, so you need SQL language to read the records.
  • Poor performance in terms of scalability.



Data lake

  • It doesn't have any format - it's just a dump.
  • You can send this dump to the Hadoop repository for data analysis.
  • This repository can be incremental. You can build a database.
  • The data lake is a dump of data with no format. It needs a pre-format before it sends for analytics.
  • Data security and encryption: You need these before you send data to Hadoop.
  • In real-time, you need to pre-process data.
  • This data you need to send to the data warehouse to get insights.

Comments

Popular posts from this blog

SQL Query: 3 Methods for Calculating Cumulative SUM

5 SQL Queries That Popularly Used in Data Analysis

Big Data: Top Cloud Computing Interview Questions (1 of 4)