{"id":3897,"date":"2023-11-04T23:13:55","date_gmt":"2023-11-04T23:13:55","guid":{"rendered":"http:\/\/localhost:10003\/working-with-data-using-pandas\/"},"modified":"2023-11-05T05:48:28","modified_gmt":"2023-11-05T05:48:28","slug":"working-with-data-using-pandas","status":"publish","type":"post","link":"http:\/\/localhost:10003\/working-with-data-using-pandas\/","title":{"rendered":"Working with data using Pandas"},"content":{"rendered":"<p>Python has been a popular language for data analysis and manipulation over the years due to its powerful libraries. One of these libraries is <em>Pandas<\/em>, which is widely used for data analysis. Pandas provides an easy-to-use data structure and data manipulation tools. In this tutorial, we will cover the basics of working with data using Pandas.<\/p>\n<h2>Setting Up Pandas<\/h2>\n<p>Before we can start working with Pandas, we need to install it. You can install Pandas using pip, the package installer for Python:<\/p>\n<pre><code class=\"language-python\">pip install pandas\n<\/code><\/pre>\n<p>Once installed, you can import it using the following command:<\/p>\n<pre><code class=\"language-python\">import pandas as pd\n<\/code><\/pre>\n<h2>The Pandas Data Structure<\/h2>\n<p>Pandas provides two fundamental data structures:<\/p>\n<ul>\n<li>Series &#8211; a one-dimensional array-like object that can hold any data type.<\/li>\n<li>DataFrame &#8211; a two-dimensional table consisting of rows and columns.<\/li>\n<\/ul>\n<h3>Series Data Structure<\/h3>\n<p>A Series can be created by passing a list of values, an array, or a scalar value. The first column represents the index, and the second column represents the values.<\/p>\n<pre><code class=\"language-python\">import pandas as pd\nimport numpy as np\n\ndata = pd.Series([0.25, 0.5, 0.75, 1.0])\nprint(data)\n<\/code><\/pre>\n<p>Output:<\/p>\n<pre><code>0    0.25\n1    0.50\n2    0.75\n3    1.00\ndtype: float64\n<\/code><\/pre>\n<h3>DataFrame Data Structure<\/h3>\n<p>A DataFrame can be created by passing a dictionary of arrays, lists, or Series. The dictionary keys represent the column names, and the dictionary values represent the column data.<\/p>\n<pre><code class=\"language-python\">data = {'name': ['John', 'Jane', 'Alice', 'Bob'],\n        'age': [30, 25, 40, 35],\n        'gender': ['male', 'female', 'female', 'male']}\n\ndf = pd.DataFrame(data)\nprint(df)\n<\/code><\/pre>\n<p>Output:<\/p>\n<pre><code>    name  age  gender\n0   John   30    male\n1   Jane   25  female\n2  Alice   40  female\n3    Bob   35    male\n<\/code><\/pre>\n<h2>Reading and Writing Data<\/h2>\n<p>Pandas provides many functions to read and write data in different formats such as CSV, Excel, SQL, and others.<\/p>\n<h3>Reading Data<\/h3>\n<p>Pandas provides a wide range of functions to read data:<\/p>\n<ul>\n<li><code>pd.read_csv()<\/code> &#8211; reads a CSV file.<\/li>\n<li><code>pd.read_excel()<\/code> &#8211; reads an Excel file.<\/li>\n<li><code>pd.read_sql()<\/code> &#8211; reads data from a SQL database.<\/li>\n<\/ul>\n<p>For instance, to read a CSV file, you can use <code>pd.read_csv()<\/code> as follows:<\/p>\n<pre><code class=\"language-python\">data = pd.read_csv('data.csv')\n<\/code><\/pre>\n<h3>Writing Data<\/h3>\n<p>Similarly, Pandas provides functions to write data in various formats:<\/p>\n<ul>\n<li><code>df.to_csv()<\/code> &#8211; write a DataFrame to a CSV file.<\/li>\n<li><code>df.to_excel()<\/code> &#8211; write a DataFrame to an Excel file.<\/li>\n<li><code>df.to_sql()<\/code> &#8211; writes data to a SQL database.<\/li>\n<\/ul>\n<p>For example, to write a DataFrame to a CSV file, you can use <code>df.to_csv()<\/code> as follows:<\/p>\n<pre><code class=\"language-python\">df.to_csv('output.csv', index=False)\n<\/code><\/pre>\n<p>The <code>index=False<\/code> parameter will exclude the index column from the CSV file.<\/p>\n<h2>Basic Operations<\/h2>\n<p>Once we have loaded data into our DataFrame, we can perform various operations on it. Here, we will look at some of the basic operations that we can perform.<\/p>\n<h3>Viewing Data<\/h3>\n<p>Pandas provides several ways to view data:<\/p>\n<ul>\n<li><code>df.head()<\/code> &#8211; displays the first few rows of the DataFrame.<\/li>\n<li><code>df.tail()<\/code> &#8211; displays the last few rows of the DataFrame.<\/li>\n<li><code>df.index<\/code> &#8211; displays the index of the DataFrame.<\/li>\n<li><code>df.columns<\/code> &#8211; displays the column names of the DataFrame.<\/li>\n<li><code>df.shape<\/code> &#8211; displays the number of rows and columns of the DataFrame.<\/li>\n<\/ul>\n<pre><code class=\"language-python\">print(df.head())\n<\/code><\/pre>\n<p>Output:<\/p>\n<pre><code>    name  age  gender\n0   John   30    male\n1   Jane   25  female\n2  Alice   40  female\n3    Bob   35    male\n<\/code><\/pre>\n<h3>Selection and Slicing<\/h3>\n<p>We can select, filter, and slice data using several methods:<\/p>\n<ul>\n<li><code>df['column_name']<\/code> or <code>df.column_name<\/code> &#8211; select a column from the DataFrame.<\/li>\n<li><code>df.loc[row_label, col_label]<\/code> &#8211; select a subset of rows and columns using the row and column labels.<\/li>\n<li><code>df.iloc[row_num, col_num]<\/code> &#8211; select a subset of rows and columns using integer indexing.<\/li>\n<li><code>df.query()<\/code> &#8211; select rows based on a condition.<\/li>\n<li><code>df.filter()<\/code> &#8211; select columns based on a condition.<\/li>\n<\/ul>\n<pre><code class=\"language-python\">print(df['name'])\n<\/code><\/pre>\n<p>Output:<\/p>\n<pre><code>0     John\n1     Jane\n2    Alice\n3      Bob\nName: name, dtype: object\n<\/code><\/pre>\n<pre><code class=\"language-python\">print(df.loc[0:1, ['name', 'gender']])\n<\/code><\/pre>\n<p>Output:<\/p>\n<pre><code>   name  gender\n0  John    male\n1  Jane  female\n<\/code><\/pre>\n<h3>Filtering<\/h3>\n<p>We can also filter data for specific values or conditions:<\/p>\n<pre><code class=\"language-python\">print(df[df.age &gt; 30])\n<\/code><\/pre>\n<p>Output:<\/p>\n<pre><code>    name  age gender\n2  Alice   40      f\n3    Bob   35      m\n<\/code><\/pre>\n<h3>Grouping<\/h3>\n<p>We can group our data based on one or more variables and then perform aggregation functions, such as mean, sum, and count, on the grouped data:<\/p>\n<pre><code class=\"language-python\">grouped_data = df.groupby(['gender'])['age'].mean()\nprint(grouped_data)\n<\/code><\/pre>\n<p>Output:<\/p>\n<pre><code>gender\nfemale    32.5\nmale      32.5\nName: age, dtype: float64\n<\/code><\/pre>\n<h2>Conclusion<\/h2>\n<p>In this tutorial, we have covered the basics of working with data using Pandas. We learned about the Pandas data structure, reading and writing data, and performing basic operations such as selection, filtering, and grouping. With this knowledge, you can analyze and manipulate any dataset using Pandas.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Python has been a popular language for data analysis and manipulation over the years due to its powerful libraries. One of these libraries is Pandas, which is widely used for data analysis. Pandas provides an easy-to-use data structure and data manipulation tools. In this tutorial, we will cover the basics <a href=\"http:\/\/localhost:10003\/working-with-data-using-pandas\/\" class=\"btn btn-link continue-link\">Continue Reading<\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_import_markdown_pro_load_document_selector":0,"_import_markdown_pro_submit_text_textarea":"","footnotes":""},"categories":[1],"tags":[193,194,195,192,191],"yoast_head":"<!-- This site is optimized with the Yoast SEO Premium plugin v21.5 (Yoast SEO v21.5) - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Working with data using Pandas - Pantherax Blogs<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"http:\/\/localhost:10003\/working-with-data-using-pandas\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Working with data using Pandas\" \/>\n<meta property=\"og:description\" content=\"Python has been a popular language for data analysis and manipulation over the years due to its powerful libraries. One of these libraries is Pandas, which is widely used for data analysis. Pandas provides an easy-to-use data structure and data manipulation tools. In this tutorial, we will cover the basics Continue Reading\" \/>\n<meta property=\"og:url\" content=\"http:\/\/localhost:10003\/working-with-data-using-pandas\/\" \/>\n<meta property=\"og:site_name\" content=\"Pantherax Blogs\" \/>\n<meta property=\"article:published_time\" content=\"2023-11-04T23:13:55+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2023-11-05T05:48:28+00:00\" \/>\n<meta name=\"author\" content=\"Panther\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Panther\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"4 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\n\t    \"@context\": \"https:\/\/schema.org\",\n\t    \"@graph\": [\n\t        {\n\t            \"@type\": \"Article\",\n\t            \"@id\": \"http:\/\/localhost:10003\/working-with-data-using-pandas\/#article\",\n\t            \"isPartOf\": {\n\t                \"@id\": \"http:\/\/localhost:10003\/working-with-data-using-pandas\/\"\n\t            },\n\t            \"author\": {\n\t                \"name\": \"Panther\",\n\t                \"@id\": \"http:\/\/localhost:10003\/#\/schema\/person\/b63d816f4964b163e53cbbcffaa0f3d7\"\n\t            },\n\t            \"headline\": \"Working with data using Pandas\",\n\t            \"datePublished\": \"2023-11-04T23:13:55+00:00\",\n\t            \"dateModified\": \"2023-11-05T05:48:28+00:00\",\n\t            \"mainEntityOfPage\": {\n\t                \"@id\": \"http:\/\/localhost:10003\/working-with-data-using-pandas\/\"\n\t            },\n\t            \"wordCount\": 556,\n\t            \"publisher\": {\n\t                \"@id\": \"http:\/\/localhost:10003\/#organization\"\n\t            },\n\t            \"keywords\": [\n\t                \"\\\"Data analysis\\\"\",\n\t                \"\\\"Data manipulation\\\"\",\n\t                \"\\\"Data wrangling\\\"]\",\n\t                \"\\\"Pandas\\\"\",\n\t                \"[\\\"Working with data\\\"\"\n\t            ],\n\t            \"inLanguage\": \"en-US\"\n\t        },\n\t        {\n\t            \"@type\": \"WebPage\",\n\t            \"@id\": \"http:\/\/localhost:10003\/working-with-data-using-pandas\/\",\n\t            \"url\": \"http:\/\/localhost:10003\/working-with-data-using-pandas\/\",\n\t            \"name\": \"Working with data using Pandas - Pantherax Blogs\",\n\t            \"isPartOf\": {\n\t                \"@id\": \"http:\/\/localhost:10003\/#website\"\n\t            },\n\t            \"datePublished\": \"2023-11-04T23:13:55+00:00\",\n\t            \"dateModified\": \"2023-11-05T05:48:28+00:00\",\n\t            \"breadcrumb\": {\n\t                \"@id\": \"http:\/\/localhost:10003\/working-with-data-using-pandas\/#breadcrumb\"\n\t            },\n\t            \"inLanguage\": \"en-US\",\n\t            \"potentialAction\": [\n\t                {\n\t                    \"@type\": \"ReadAction\",\n\t                    \"target\": [\n\t                        \"http:\/\/localhost:10003\/working-with-data-using-pandas\/\"\n\t                    ]\n\t                }\n\t            ]\n\t        },\n\t        {\n\t            \"@type\": \"BreadcrumbList\",\n\t            \"@id\": \"http:\/\/localhost:10003\/working-with-data-using-pandas\/#breadcrumb\",\n\t            \"itemListElement\": [\n\t                {\n\t                    \"@type\": \"ListItem\",\n\t                    \"position\": 1,\n\t                    \"name\": \"Home\",\n\t                    \"item\": \"http:\/\/localhost:10003\/\"\n\t                },\n\t                {\n\t                    \"@type\": \"ListItem\",\n\t                    \"position\": 2,\n\t                    \"name\": \"Working with data using Pandas\"\n\t                }\n\t            ]\n\t        },\n\t        {\n\t            \"@type\": \"WebSite\",\n\t            \"@id\": \"http:\/\/localhost:10003\/#website\",\n\t            \"url\": \"http:\/\/localhost:10003\/\",\n\t            \"name\": \"Pantherax Blogs\",\n\t            \"description\": \"\",\n\t            \"publisher\": {\n\t                \"@id\": \"http:\/\/localhost:10003\/#organization\"\n\t            },\n\t            \"potentialAction\": [\n\t                {\n\t                    \"@type\": \"SearchAction\",\n\t                    \"target\": {\n\t                        \"@type\": \"EntryPoint\",\n\t                        \"urlTemplate\": \"http:\/\/localhost:10003\/?s={search_term_string}\"\n\t                    },\n\t                    \"query-input\": \"required name=search_term_string\"\n\t                }\n\t            ],\n\t            \"inLanguage\": \"en-US\"\n\t        },\n\t        {\n\t            \"@type\": \"Organization\",\n\t            \"@id\": \"http:\/\/localhost:10003\/#organization\",\n\t            \"name\": \"Pantherax Blogs\",\n\t            \"url\": \"http:\/\/localhost:10003\/\",\n\t            \"logo\": {\n\t                \"@type\": \"ImageObject\",\n\t                \"inLanguage\": \"en-US\",\n\t                \"@id\": \"http:\/\/localhost:10003\/#\/schema\/logo\/image\/\",\n\t                \"url\": \"http:\/\/localhost:10003\/wp-content\/uploads\/2023\/11\/cropped-9e7721cb-2d62-4f72-ab7f-7d1d8db89226.jpeg\",\n\t                \"contentUrl\": \"http:\/\/localhost:10003\/wp-content\/uploads\/2023\/11\/cropped-9e7721cb-2d62-4f72-ab7f-7d1d8db89226.jpeg\",\n\t                \"width\": 1024,\n\t                \"height\": 1024,\n\t                \"caption\": \"Pantherax Blogs\"\n\t            },\n\t            \"image\": {\n\t                \"@id\": \"http:\/\/localhost:10003\/#\/schema\/logo\/image\/\"\n\t            }\n\t        },\n\t        {\n\t            \"@type\": \"Person\",\n\t            \"@id\": \"http:\/\/localhost:10003\/#\/schema\/person\/b63d816f4964b163e53cbbcffaa0f3d7\",\n\t            \"name\": \"Panther\",\n\t            \"image\": {\n\t                \"@type\": \"ImageObject\",\n\t                \"inLanguage\": \"en-US\",\n\t                \"@id\": \"http:\/\/localhost:10003\/#\/schema\/person\/image\/\",\n\t                \"url\": \"http:\/\/2.gravatar.com\/avatar\/b8c0eda5a49f8f31ec32d0a0f9d6f838?s=96&d=mm&r=g\",\n\t                \"contentUrl\": \"http:\/\/2.gravatar.com\/avatar\/b8c0eda5a49f8f31ec32d0a0f9d6f838?s=96&d=mm&r=g\",\n\t                \"caption\": \"Panther\"\n\t            },\n\t            \"sameAs\": [\n\t                \"http:\/\/localhost:10003\"\n\t            ],\n\t            \"url\": \"http:\/\/localhost:10003\/author\/pepethefrog\/\"\n\t        }\n\t    ]\n\t}<\/script>\n<!-- \/ Yoast SEO Premium plugin. -->","yoast_head_json":{"title":"Working with data using Pandas - Pantherax Blogs","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"http:\/\/localhost:10003\/working-with-data-using-pandas\/","og_locale":"en_US","og_type":"article","og_title":"Working with data using Pandas","og_description":"Python has been a popular language for data analysis and manipulation over the years due to its powerful libraries. One of these libraries is Pandas, which is widely used for data analysis. Pandas provides an easy-to-use data structure and data manipulation tools. In this tutorial, we will cover the basics Continue Reading","og_url":"http:\/\/localhost:10003\/working-with-data-using-pandas\/","og_site_name":"Pantherax Blogs","article_published_time":"2023-11-04T23:13:55+00:00","article_modified_time":"2023-11-05T05:48:28+00:00","author":"Panther","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Panther","Est. reading time":"4 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"http:\/\/localhost:10003\/working-with-data-using-pandas\/#article","isPartOf":{"@id":"http:\/\/localhost:10003\/working-with-data-using-pandas\/"},"author":{"name":"Panther","@id":"http:\/\/localhost:10003\/#\/schema\/person\/b63d816f4964b163e53cbbcffaa0f3d7"},"headline":"Working with data using Pandas","datePublished":"2023-11-04T23:13:55+00:00","dateModified":"2023-11-05T05:48:28+00:00","mainEntityOfPage":{"@id":"http:\/\/localhost:10003\/working-with-data-using-pandas\/"},"wordCount":556,"publisher":{"@id":"http:\/\/localhost:10003\/#organization"},"keywords":["\"Data analysis\"","\"Data manipulation\"","\"Data wrangling\"]","\"Pandas\"","[\"Working with data\""],"inLanguage":"en-US"},{"@type":"WebPage","@id":"http:\/\/localhost:10003\/working-with-data-using-pandas\/","url":"http:\/\/localhost:10003\/working-with-data-using-pandas\/","name":"Working with data using Pandas - Pantherax Blogs","isPartOf":{"@id":"http:\/\/localhost:10003\/#website"},"datePublished":"2023-11-04T23:13:55+00:00","dateModified":"2023-11-05T05:48:28+00:00","breadcrumb":{"@id":"http:\/\/localhost:10003\/working-with-data-using-pandas\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["http:\/\/localhost:10003\/working-with-data-using-pandas\/"]}]},{"@type":"BreadcrumbList","@id":"http:\/\/localhost:10003\/working-with-data-using-pandas\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"http:\/\/localhost:10003\/"},{"@type":"ListItem","position":2,"name":"Working with data using Pandas"}]},{"@type":"WebSite","@id":"http:\/\/localhost:10003\/#website","url":"http:\/\/localhost:10003\/","name":"Pantherax Blogs","description":"","publisher":{"@id":"http:\/\/localhost:10003\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"http:\/\/localhost:10003\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"en-US"},{"@type":"Organization","@id":"http:\/\/localhost:10003\/#organization","name":"Pantherax Blogs","url":"http:\/\/localhost:10003\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"http:\/\/localhost:10003\/#\/schema\/logo\/image\/","url":"http:\/\/localhost:10003\/wp-content\/uploads\/2023\/11\/cropped-9e7721cb-2d62-4f72-ab7f-7d1d8db89226.jpeg","contentUrl":"http:\/\/localhost:10003\/wp-content\/uploads\/2023\/11\/cropped-9e7721cb-2d62-4f72-ab7f-7d1d8db89226.jpeg","width":1024,"height":1024,"caption":"Pantherax Blogs"},"image":{"@id":"http:\/\/localhost:10003\/#\/schema\/logo\/image\/"}},{"@type":"Person","@id":"http:\/\/localhost:10003\/#\/schema\/person\/b63d816f4964b163e53cbbcffaa0f3d7","name":"Panther","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"http:\/\/localhost:10003\/#\/schema\/person\/image\/","url":"http:\/\/2.gravatar.com\/avatar\/b8c0eda5a49f8f31ec32d0a0f9d6f838?s=96&d=mm&r=g","contentUrl":"http:\/\/2.gravatar.com\/avatar\/b8c0eda5a49f8f31ec32d0a0f9d6f838?s=96&d=mm&r=g","caption":"Panther"},"sameAs":["http:\/\/localhost:10003"],"url":"http:\/\/localhost:10003\/author\/pepethefrog\/"}]}},"jetpack_sharing_enabled":true,"jetpack_featured_media_url":"","_links":{"self":[{"href":"http:\/\/localhost:10003\/wp-json\/wp\/v2\/posts\/3897"}],"collection":[{"href":"http:\/\/localhost:10003\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"http:\/\/localhost:10003\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"http:\/\/localhost:10003\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"http:\/\/localhost:10003\/wp-json\/wp\/v2\/comments?post=3897"}],"version-history":[{"count":1,"href":"http:\/\/localhost:10003\/wp-json\/wp\/v2\/posts\/3897\/revisions"}],"predecessor-version":[{"id":4633,"href":"http:\/\/localhost:10003\/wp-json\/wp\/v2\/posts\/3897\/revisions\/4633"}],"wp:attachment":[{"href":"http:\/\/localhost:10003\/wp-json\/wp\/v2\/media?parent=3897"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"http:\/\/localhost:10003\/wp-json\/wp\/v2\/categories?post=3897"},{"taxonomy":"post_tag","embeddable":true,"href":"http:\/\/localhost:10003\/wp-json\/wp\/v2\/tags?post=3897"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}