INTRODUCTION TO PANDAS

Khan Asif Salim
4 min readOct 8, 2021

FEATURES OF PANDAS:-

(1) Its is High Performance Data Analysis Tool.

(2) Working With Large Data Set.

(3) Support Or Load Files With Different Formats.

(4) More Flexible.

(5) Represents In Tabular Way (Rows and Columns)

(6) Working On Missing Data.

USES OF PANDAS:-

(1) Indexing - Slicing - Sub setting The Large Data Sets.

(2) Merge And Join Two Different Dataset Easily.

(3) Reshape Data Sets.

DATA STRUCTURE IN PANDAS:-

(1) SERIES → Data Represent One Dimensional Labeled Homogenous Array Size-Immutable.

SYNTAX → s = pd. Series (Data , Index)

(2) DATAFRAME → Two Dimensional Labeled Size-Mutable Tabular Structure With Potentially Heterogeneously Types Columns.

SYNTAX → d= pd. Data Frame(Data)

(3) PANEL → Multi Dimensional.(ARE REMOVED FROM PYTHON)

SYNTAX → p= pd. Panel (Data)

NOTE:- Data Frame Is More Efficient.

Reading Tabular Data File In Pandas:-

READ TSV FILE:-

READ CSV FILE:-

READ EXCEL FILE:-

READ PSV FILE:-

MANIPULATING DATAFRAME IN PANDAS(ADD COLUMNS , DROP COLUMNS):-

UNDERSTANDING LOC AND i LOC:-

HANDLING MISSING DATA (dropna & fillna) IN PANDAS:-

REMOVE DUPLICATES FROM DATAFRAME IN PANDAS:-

--

--