avatarJ3

Free AI web copilot to create summaries, insights and extended knowledge, download it at here

3492

Abstract

3</span> <span class="hljs-type">Name</span>: <span class="hljs-type">FB</span>, d<span class="hljs-keyword">type</span>: int64</pre></div><div id="5120"><pre>df.<span class="hljs-built_in">groupby</span>(<span class="hljs-string">'Company'</span>).<span class="hljs-built_in">count</span>() </pre></div><figure id="004b"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*dw1pCK6vl06ozy1zqCgD4Q.png"><figcaption></figcaption></figure><div id="a03c"><pre>df.<span class="hljs-built_in">groupby</span>(<span class="hljs-string">'Company'</span>).<span class="hljs-built_in">max</span>()</pre></div><figure id="f79a"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*8ONPKTh8ulxVYXjRB-gWIA.png"><figcaption></figcaption></figure><p id="dacf">We probably use these aggregate functions with the numeric number (here is inverse of alphabet order:)</p><div id="feab"><pre>df.<span class="hljs-built_in">groupby</span>(<span class="hljs-string">'Company'</span>).<span class="hljs-built_in">min</span>()</pre></div><figure id="142a"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*kZJyZ5USJ0tekmkVMGInzA.png"><figcaption></figcaption></figure><p id="624e">That will give you a bunch of information!</p><div id="d1e5"><pre>df<span class="hljs-selector-class">.groupby</span>(<span class="hljs-string">'Company'</span>)<span class="hljs-selector-class">.describe</span>()</pre></div><figure id="c884"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*4shN7dwyDsvlG2Y-gSymVw.png"><figcaption></figcaption></figure><p id="343c">df.groupby(‘Company’).describe().transpose()[‘FB’]</p><figure id="2548"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*PnTnesUVPQB3KKxU6ic-BA.png"><figcaption></figcaption></figure><p id="a3c2">Colab File <a href="https://colab.research.google.com/drive/1lE3yhSztBE62FbkazmU7e1Pc1VGZuMsF?usp=sharing">link</a>:)</p><h1 id="d21c">Credits & References:</h1><p id="0285">Jose Portilla — <a href="https://www.udemy.com/course/python-for-data-science-and-machine-learning-bootcamp/">Python for Data Science and Machine Learning Bootcamp </a>— Learn how to use NumPy, Pandas, Seaborn , Matplotlib , Plotly , Scikit-Learn , Machine Learning, Tensorflow , and more!</p><h1 id="1f47">Posts Related:</h1><p id="c9be">00Episode#<b>PySeries </b>— Python — <a href="https://medium.com/@J.3/python-jupiter-notebook-quick-start-with-vscode-916c43c10d9a">Jupiter Notebook Quick Start with VSCode — How to Set your Win10 Environment to use Jupiter Notebook</a></p><p id="0c7d">01Episode#<b>PySeries </b>— Python — <a href="https://readmedium.com/python-for-engenniging-exercises-977fbe4d6d02">Python 4 Engineers — Exercises! An overview of the Opportunities Offered by Python in Engineering!</a></p><p id="5e70">02Episode#<b>PySeries </b>— Python — <a href="https://readmedium.com/geogebra-plus-linear-programming-a51661c99590">Geogebra Plus Linear Programming- We’ll Create a Geogebra program to help us with our linear programming</a></p><p id="a994">03Episode#<b>PySeries</b> — Python — Python 4 Engineers — More Exercises! — Another Round to Make Sure that Python is Really Amazing!</p><p id="3d1e">04Episode#<b>PySeries</b> — Python — <a href="https://readmedium.com/linear-regressions-the-basics-1a633f351ec2">Linear Regressions — The Basics — How to Understand Linear Regression Once and For All!</a></p><p id="b935">05Episode#<b>PySeries</b> — Python — <a href="https://readmedium.com/numpy-init-p

Options

ython-review-f5362abbaaf9">NumPy Init & Python Review — A Crash Python Review & Initialization at NumPy lib.</a></p><p id="a9a7">06Episode#<b>PySeries</b> — Python — <a href="https://readmedium.com/numpy-jupyter-notebook-1182f78ab4e1">NumPy Arrays & Jupyter Notebook — Arithmetic Operations, Indexing & Slicing, and Conditional Selection w/ np arrays</a>.</p><p id="cb6a">07Episode#<b>PySeries</b> — Python — <a href="https://readmedium.com/pandas-intro-series-970e206e2ad5">Pandas — Intro & Series — What it is? How to use it?</a></p><p id="8209">08Episode#<b>PySeries</b> — Python — <a href="https://readmedium.com/pandas-dataframes-7ba872dcbc30">Pandas DataFrames — The primary Pandas data structure! It is a dict-like container for Series objects</a></p><p id="b289">09Episode#<b>PySeries</b> — Python — <a href="https://readmedium.com/python-4-engineers-even-more-exercises-d0141e0b06d">Python 4 Engineers — Even More Exercises! — More Practicing Coding Questions in Python!</a></p><p id="0646">10Episode#<b>PySeries</b> — Python — <a href="https://readmedium.com/pandas-hierarchical-index-cross-section-30783023a274">Pandas — Hierarchical Index & Cross-section — Open your Colab notebook and here are the follow-up exercises!</a></p><p id="dea6">11Episode#<b>PySeries</b> — Python —<a href="https://readmedium.com/pandas-missing-data-5142f3eda2b"> Pandas — Missing Data — Let’s Continue the Python Exercises — Filling & Dropping Missing Data</a></p><p id="0943">12Episode#<b>PySeries</b> — Python — Pandas — Group By — Grouping large amounts of data and compute operations on these groups (this one:)</p><p id="9ff4">13Episode#<b>PySeries</b> — Python — <a href="https://readmedium.com/pandas-merging-joining-concatenations-a35bbe1a9dd5">Pandas — Merging, Joining & Concatenations — Facilities For Easily Combining Together Series or DataFrame</a></p><p id="20c9">14Episode#<b>PySeries</b> — Python — <a href="https://readmedium.com/pandas-operations-4b8f7a4b4139">Pandas — Pandas Dataframe Examples: Column Operations</a></p><p id="059e">15Episode#<b>PySeries</b> — Python — <b>Python 4 Engineers </b>— Keeping It In The Short-Term Memory — <a href="https://readmedium.com/python-4-engineers-keeping-it-in-the-short-term-memory-4f9458016171"><b>Test Yourself!</b> Coding in Python, Again!</a></p><p id="6ef1">16Episode#<b>PySeries</b> — NumPy — <a href="https://readmedium.com/numpy-review-again-f94f1c1c77e8">NumPy Review, Again;)<b> </b></a>— Python Review Free Exercises</p><p id="2159">17Episode#<b>PySeries</b><a href="https://readmedium.com/generators-in-python-8d3de173743e">Generators in Python<b></b></a><b><a href="https://readmedium.com/numpy-review-again-f94f1c1c77e8"><b> </b></a>— Python Review Free Hints</b></p><p id="9b56">18Episode#<b>PySeries</b> — P<a href="https://readmedium.com/panda-review-again-baf0687b35de">andas Review…Again;)</a> — Python Review Free Exercise</p><p id="3aa1">19Episode#<b>PySeries</b><a href="https://readmedium.com/matlibplot-seaborn-python-libs-459f6666f35f">MatlibPlot & Seaborn Python Libs </a>— Reviewing theses Plotting & Statistics Packs</p><p id="44bc">20Episode#<b>PySeries</b><a href="https://readmedium.com/seaborn-python-review-9e543b6b7a44">Seaborn Python Review</a> — Reviewing theses Plotting & Statistics Packs</p><figure id="8bf5"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/0*qXEVL-uD3ZpUlWE_.png"><figcaption></figcaption></figure></article></body>

Pandas — Group By

Grouping large amounts of data and compute operations on these groups — #PySeries#Episode 12

print(~Hello Pandas — Group By lesson!”)

Preparing the DataFrame:

import numpy as np
import pandas as pd
data = { 'Company':['GOOG', 'GOOG' 'MSFT', 'MSFT', 'FB', 'FB',
'Person':['Sam', 'Charlie', 'Amy', 'Vanessa', 'Carl', 'Sarah'],'Sales':[200,120, 340, 124, 243, 350]}
df = pd.DataFrame(data)
df

Aggregating Functions:

Allows you to group together rows based off of a column and perform an aggregate function on them.

df.groupby('Company')
byComp = df.groupby('Company')
byComp.mean()
byComp.std()
byComp.sum()
byComp.sum().loc['FB']
Sales    593 Name: FB, dtype: int64

In one single line!

df.groupby('Company').sum().loc['FB']
Sales    593 Name: FB, dtype: int64
df.groupby('Company').count()
df.groupby('Company').max()

We probably use these aggregate functions with the numeric number (here is inverse of alphabet order:)

df.groupby('Company').min()

That will give you a bunch of information!

df.groupby('Company').describe()

df.groupby(‘Company’).describe().transpose()[‘FB’]

Colab File link:)

Credits & References:

Jose Portilla — Python for Data Science and Machine Learning Bootcamp — Learn how to use NumPy, Pandas, Seaborn , Matplotlib , Plotly , Scikit-Learn , Machine Learning, Tensorflow , and more!

Posts Related:

00Episode#PySeries — Python — Jupiter Notebook Quick Start with VSCode — How to Set your Win10 Environment to use Jupiter Notebook

01Episode#PySeries — Python — Python 4 Engineers — Exercises! An overview of the Opportunities Offered by Python in Engineering!

02Episode#PySeries — Python — Geogebra Plus Linear Programming- We’ll Create a Geogebra program to help us with our linear programming

03Episode#PySeries — Python — Python 4 Engineers — More Exercises! — Another Round to Make Sure that Python is Really Amazing!

04Episode#PySeries — Python — Linear Regressions — The Basics — How to Understand Linear Regression Once and For All!

05Episode#PySeries — Python — NumPy Init & Python Review — A Crash Python Review & Initialization at NumPy lib.

06Episode#PySeries — Python — NumPy Arrays & Jupyter Notebook — Arithmetic Operations, Indexing & Slicing, and Conditional Selection w/ np arrays.

07Episode#PySeries — Python — Pandas — Intro & Series — What it is? How to use it?

08Episode#PySeries — Python — Pandas DataFrames — The primary Pandas data structure! It is a dict-like container for Series objects

09Episode#PySeries — Python — Python 4 Engineers — Even More Exercises! — More Practicing Coding Questions in Python!

10Episode#PySeries — Python — Pandas — Hierarchical Index & Cross-section — Open your Colab notebook and here are the follow-up exercises!

11Episode#PySeries — Python — Pandas — Missing Data — Let’s Continue the Python Exercises — Filling & Dropping Missing Data

12Episode#PySeries — Python — Pandas — Group By — Grouping large amounts of data and compute operations on these groups (this one:)

13Episode#PySeries — Python — Pandas — Merging, Joining & Concatenations — Facilities For Easily Combining Together Series or DataFrame

14Episode#PySeries — Python — Pandas — Pandas Dataframe Examples: Column Operations

15Episode#PySeries — Python — Python 4 Engineers — Keeping It In The Short-Term Memory — Test Yourself! Coding in Python, Again!

16Episode#PySeries — NumPy — NumPy Review, Again;) — Python Review Free Exercises

17Episode#PySeriesGenerators in Python — Python Review Free Hints

18Episode#PySeries — Pandas Review…Again;) — Python Review Free Exercise

19Episode#PySeriesMatlibPlot & Seaborn Python Libs — Reviewing theses Plotting & Statistics Packs

20Episode#PySeriesSeaborn Python Review — Reviewing theses Plotting & Statistics Packs

Pandas Dataframe
Pandas
Pandas Groupby
Numpy
Aggregate Function
Recommended from ReadMedium