{
 "cells": [
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "# Adding New Columns\n",
    "\n",
    "In the second chapter we will learn how to add new columns to the tables based on a set of functions. This is similar to the Functions that are available in Excel with _=FUNC(A1)_\n",
    "\n",
    "[![Open In Studio Lab](https://studiolab.sagemaker.aws/studiolab.svg)](https://studiolab.sagemaker.aws/import/github/aiola-lab/from-excel-to-pandas/blob/master/notebooks/02.01_Adding_New_Columns.ipynb)"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "We will start with loading a csv file that is hosted in data.world on the \"Median Value Per Sq ft per zip code in the US\""
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 1,
   "metadata": {},
   "outputs": [],
   "source": [
    "import pandas as pd"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 3,
   "metadata": {},
   "outputs": [],
   "source": [
    "us_median_sq_ft_value = (\n",
    "    pd\n",
    "    .read_csv('https://query.data.world/s/xrfy7fb7oq55gpzh6bvs6jtonv32lk')\n",
    ")"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 4,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>RegionID</th>\n",
       "      <th>RegionName</th>\n",
       "      <th>City</th>\n",
       "      <th>State</th>\n",
       "      <th>Metro</th>\n",
       "      <th>CountyName</th>\n",
       "      <th>SizeRank</th>\n",
       "      <th>1996-04</th>\n",
       "      <th>1996-05</th>\n",
       "      <th>1996-06</th>\n",
       "      <th>...</th>\n",
       "      <th>2016-12</th>\n",
       "      <th>2017-01</th>\n",
       "      <th>2017-02</th>\n",
       "      <th>2017-03</th>\n",
       "      <th>2017-04</th>\n",
       "      <th>2017-05</th>\n",
       "      <th>2017-06</th>\n",
       "      <th>2017-07</th>\n",
       "      <th>2017-08</th>\n",
       "      <th>2017-09</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>61639</td>\n",
       "      <td>10025</td>\n",
       "      <td>New York</td>\n",
       "      <td>NY</td>\n",
       "      <td>New York</td>\n",
       "      <td>New York</td>\n",
       "      <td>1</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>...</td>\n",
       "      <td>1306.0</td>\n",
       "      <td>1290</td>\n",
       "      <td>1277</td>\n",
       "      <td>1272</td>\n",
       "      <td>1271</td>\n",
       "      <td>1277</td>\n",
       "      <td>1281</td>\n",
       "      <td>1279</td>\n",
       "      <td>1274</td>\n",
       "      <td>1270</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>84654</td>\n",
       "      <td>60657</td>\n",
       "      <td>Chicago</td>\n",
       "      <td>IL</td>\n",
       "      <td>Chicago</td>\n",
       "      <td>Cook</td>\n",
       "      <td>2</td>\n",
       "      <td>134.0</td>\n",
       "      <td>134.0</td>\n",
       "      <td>133.0</td>\n",
       "      <td>...</td>\n",
       "      <td>291.0</td>\n",
       "      <td>294</td>\n",
       "      <td>296</td>\n",
       "      <td>295</td>\n",
       "      <td>293</td>\n",
       "      <td>294</td>\n",
       "      <td>294</td>\n",
       "      <td>294</td>\n",
       "      <td>294</td>\n",
       "      <td>295</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>61637</td>\n",
       "      <td>10023</td>\n",
       "      <td>New York</td>\n",
       "      <td>NY</td>\n",
       "      <td>New York</td>\n",
       "      <td>New York</td>\n",
       "      <td>3</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>...</td>\n",
       "      <td>1602.0</td>\n",
       "      <td>1597</td>\n",
       "      <td>1580</td>\n",
       "      <td>1567</td>\n",
       "      <td>1569</td>\n",
       "      <td>1570</td>\n",
       "      <td>1560</td>\n",
       "      <td>1543</td>\n",
       "      <td>1529</td>\n",
       "      <td>1521</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>84616</td>\n",
       "      <td>60614</td>\n",
       "      <td>Chicago</td>\n",
       "      <td>IL</td>\n",
       "      <td>Chicago</td>\n",
       "      <td>Cook</td>\n",
       "      <td>4</td>\n",
       "      <td>149.0</td>\n",
       "      <td>150.0</td>\n",
       "      <td>150.0</td>\n",
       "      <td>...</td>\n",
       "      <td>330.0</td>\n",
       "      <td>333</td>\n",
       "      <td>333</td>\n",
       "      <td>331</td>\n",
       "      <td>330</td>\n",
       "      <td>330</td>\n",
       "      <td>329</td>\n",
       "      <td>328</td>\n",
       "      <td>329</td>\n",
       "      <td>329</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>93144</td>\n",
       "      <td>79936</td>\n",
       "      <td>El Paso</td>\n",
       "      <td>TX</td>\n",
       "      <td>El Paso</td>\n",
       "      <td>El Paso</td>\n",
       "      <td>5</td>\n",
       "      <td>50.0</td>\n",
       "      <td>51.0</td>\n",
       "      <td>50.0</td>\n",
       "      <td>...</td>\n",
       "      <td>81.0</td>\n",
       "      <td>81</td>\n",
       "      <td>81</td>\n",
       "      <td>81</td>\n",
       "      <td>81</td>\n",
       "      <td>81</td>\n",
       "      <td>82</td>\n",
       "      <td>82</td>\n",
       "      <td>82</td>\n",
       "      <td>82</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "<p>5 rows × 265 columns</p>\n",
       "</div>"
      ],
      "text/plain": [
       "   RegionID  RegionName      City State     Metro CountyName  SizeRank  \\\n",
       "0     61639       10025  New York    NY  New York   New York         1   \n",
       "1     84654       60657   Chicago    IL   Chicago       Cook         2   \n",
       "2     61637       10023  New York    NY  New York   New York         3   \n",
       "3     84616       60614   Chicago    IL   Chicago       Cook         4   \n",
       "4     93144       79936   El Paso    TX   El Paso    El Paso         5   \n",
       "\n",
       "   1996-04  1996-05  1996-06  ...  2016-12  2017-01  2017-02  2017-03  \\\n",
       "0      NaN      NaN      NaN  ...   1306.0     1290     1277     1272   \n",
       "1    134.0    134.0    133.0  ...    291.0      294      296      295   \n",
       "2      NaN      NaN      NaN  ...   1602.0     1597     1580     1567   \n",
       "3    149.0    150.0    150.0  ...    330.0      333      333      331   \n",
       "4     50.0     51.0     50.0  ...     81.0       81       81       81   \n",
       "\n",
       "   2017-04  2017-05  2017-06  2017-07  2017-08  2017-09  \n",
       "0     1271     1277     1281     1279     1274     1270  \n",
       "1      293      294      294      294      294      295  \n",
       "2     1569     1570     1560     1543     1529     1521  \n",
       "3      330      330      329      328      329      329  \n",
       "4       81       81       82       82       82       82  \n",
       "\n",
       "[5 rows x 265 columns]"
      ]
     },
     "execution_count": 4,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "us_median_sq_ft_value.head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "## Constant Value\n",
    "\n",
    "The simplest way to add a column is to put a constant value. Since we know that this data is only for USA we can add a column with the value _USA_ to a new column called _country_. We will be able to later merge this table with data from other countries and then this new column will be useful. \n",
    "\n",
    "For setting a value in a column we will use the [assign](https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.DataFrame.assign.html) function of Pandas. You will need to scroll all the way to the right of the output to see the new _country_ column."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 5,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>RegionID</th>\n",
       "      <th>RegionName</th>\n",
       "      <th>City</th>\n",
       "      <th>State</th>\n",
       "      <th>Metro</th>\n",
       "      <th>CountyName</th>\n",
       "      <th>SizeRank</th>\n",
       "      <th>1996-04</th>\n",
       "      <th>1996-05</th>\n",
       "      <th>1996-06</th>\n",
       "      <th>...</th>\n",
       "      <th>2017-01</th>\n",
       "      <th>2017-02</th>\n",
       "      <th>2017-03</th>\n",
       "      <th>2017-04</th>\n",
       "      <th>2017-05</th>\n",
       "      <th>2017-06</th>\n",
       "      <th>2017-07</th>\n",
       "      <th>2017-08</th>\n",
       "      <th>2017-09</th>\n",
       "      <th>country</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>61639</td>\n",
       "      <td>10025</td>\n",
       "      <td>New York</td>\n",
       "      <td>NY</td>\n",
       "      <td>New York</td>\n",
       "      <td>New York</td>\n",
       "      <td>1</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>...</td>\n",
       "      <td>1290</td>\n",
       "      <td>1277</td>\n",
       "      <td>1272</td>\n",
       "      <td>1271</td>\n",
       "      <td>1277</td>\n",
       "      <td>1281</td>\n",
       "      <td>1279</td>\n",
       "      <td>1274</td>\n",
       "      <td>1270</td>\n",
       "      <td>USA</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>84654</td>\n",
       "      <td>60657</td>\n",
       "      <td>Chicago</td>\n",
       "      <td>IL</td>\n",
       "      <td>Chicago</td>\n",
       "      <td>Cook</td>\n",
       "      <td>2</td>\n",
       "      <td>134.0</td>\n",
       "      <td>134.0</td>\n",
       "      <td>133.0</td>\n",
       "      <td>...</td>\n",
       "      <td>294</td>\n",
       "      <td>296</td>\n",
       "      <td>295</td>\n",
       "      <td>293</td>\n",
       "      <td>294</td>\n",
       "      <td>294</td>\n",
       "      <td>294</td>\n",
       "      <td>294</td>\n",
       "      <td>295</td>\n",
       "      <td>USA</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>61637</td>\n",
       "      <td>10023</td>\n",
       "      <td>New York</td>\n",
       "      <td>NY</td>\n",
       "      <td>New York</td>\n",
       "      <td>New York</td>\n",
       "      <td>3</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>...</td>\n",
       "      <td>1597</td>\n",
       "      <td>1580</td>\n",
       "      <td>1567</td>\n",
       "      <td>1569</td>\n",
       "      <td>1570</td>\n",
       "      <td>1560</td>\n",
       "      <td>1543</td>\n",
       "      <td>1529</td>\n",
       "      <td>1521</td>\n",
       "      <td>USA</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>84616</td>\n",
       "      <td>60614</td>\n",
       "      <td>Chicago</td>\n",
       "      <td>IL</td>\n",
       "      <td>Chicago</td>\n",
       "      <td>Cook</td>\n",
       "      <td>4</td>\n",
       "      <td>149.0</td>\n",
       "      <td>150.0</td>\n",
       "      <td>150.0</td>\n",
       "      <td>...</td>\n",
       "      <td>333</td>\n",
       "      <td>333</td>\n",
       "      <td>331</td>\n",
       "      <td>330</td>\n",
       "      <td>330</td>\n",
       "      <td>329</td>\n",
       "      <td>328</td>\n",
       "      <td>329</td>\n",
       "      <td>329</td>\n",
       "      <td>USA</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>93144</td>\n",
       "      <td>79936</td>\n",
       "      <td>El Paso</td>\n",
       "      <td>TX</td>\n",
       "      <td>El Paso</td>\n",
       "      <td>El Paso</td>\n",
       "      <td>5</td>\n",
       "      <td>50.0</td>\n",
       "      <td>51.0</td>\n",
       "      <td>50.0</td>\n",
       "      <td>...</td>\n",
       "      <td>81</td>\n",
       "      <td>81</td>\n",
       "      <td>81</td>\n",
       "      <td>81</td>\n",
       "      <td>81</td>\n",
       "      <td>82</td>\n",
       "      <td>82</td>\n",
       "      <td>82</td>\n",
       "      <td>82</td>\n",
       "      <td>USA</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>...</th>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>14750</th>\n",
       "      <td>73940</td>\n",
       "      <td>36564</td>\n",
       "      <td>Fairhope</td>\n",
       "      <td>AL</td>\n",
       "      <td>Daphne</td>\n",
       "      <td>Baldwin</td>\n",
       "      <td>14751</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>...</td>\n",
       "      <td>230</td>\n",
       "      <td>238</td>\n",
       "      <td>243</td>\n",
       "      <td>245</td>\n",
       "      <td>244</td>\n",
       "      <td>245</td>\n",
       "      <td>246</td>\n",
       "      <td>244</td>\n",
       "      <td>240</td>\n",
       "      <td>USA</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>14751</th>\n",
       "      <td>59107</td>\n",
       "      <td>3293</td>\n",
       "      <td>Woodstock</td>\n",
       "      <td>NH</td>\n",
       "      <td>Claremont</td>\n",
       "      <td>Grafton</td>\n",
       "      <td>14752</td>\n",
       "      <td>58.0</td>\n",
       "      <td>59.0</td>\n",
       "      <td>61.0</td>\n",
       "      <td>...</td>\n",
       "      <td>129</td>\n",
       "      <td>129</td>\n",
       "      <td>128</td>\n",
       "      <td>129</td>\n",
       "      <td>130</td>\n",
       "      <td>130</td>\n",
       "      <td>130</td>\n",
       "      <td>130</td>\n",
       "      <td>131</td>\n",
       "      <td>USA</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>14752</th>\n",
       "      <td>82396</td>\n",
       "      <td>55713</td>\n",
       "      <td>Buhl</td>\n",
       "      <td>MN</td>\n",
       "      <td>Duluth</td>\n",
       "      <td>Saint Louis</td>\n",
       "      <td>14753</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>...</td>\n",
       "      <td>72</td>\n",
       "      <td>73</td>\n",
       "      <td>74</td>\n",
       "      <td>74</td>\n",
       "      <td>73</td>\n",
       "      <td>73</td>\n",
       "      <td>74</td>\n",
       "      <td>77</td>\n",
       "      <td>79</td>\n",
       "      <td>USA</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>14753</th>\n",
       "      <td>66881</td>\n",
       "      <td>21405</td>\n",
       "      <td>Annapolis</td>\n",
       "      <td>MD</td>\n",
       "      <td>Baltimore</td>\n",
       "      <td>Anne Arundel</td>\n",
       "      <td>14754</td>\n",
       "      <td>146.0</td>\n",
       "      <td>149.0</td>\n",
       "      <td>151.0</td>\n",
       "      <td>...</td>\n",
       "      <td>470</td>\n",
       "      <td>462</td>\n",
       "      <td>457</td>\n",
       "      <td>457</td>\n",
       "      <td>455</td>\n",
       "      <td>454</td>\n",
       "      <td>455</td>\n",
       "      <td>458</td>\n",
       "      <td>460</td>\n",
       "      <td>USA</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>14754</th>\n",
       "      <td>94815</td>\n",
       "      <td>85220</td>\n",
       "      <td>Apache Junction</td>\n",
       "      <td>AZ</td>\n",
       "      <td>Phoenix</td>\n",
       "      <td>Pinal</td>\n",
       "      <td>14755</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>...</td>\n",
       "      <td>117</td>\n",
       "      <td>119</td>\n",
       "      <td>119</td>\n",
       "      <td>119</td>\n",
       "      <td>120</td>\n",
       "      <td>121</td>\n",
       "      <td>122</td>\n",
       "      <td>124</td>\n",
       "      <td>124</td>\n",
       "      <td>USA</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "<p>14755 rows × 266 columns</p>\n",
       "</div>"
      ],
      "text/plain": [
       "       RegionID  RegionName             City State      Metro    CountyName  \\\n",
       "0         61639       10025         New York    NY   New York      New York   \n",
       "1         84654       60657          Chicago    IL    Chicago          Cook   \n",
       "2         61637       10023         New York    NY   New York      New York   \n",
       "3         84616       60614          Chicago    IL    Chicago          Cook   \n",
       "4         93144       79936          El Paso    TX    El Paso       El Paso   \n",
       "...         ...         ...              ...   ...        ...           ...   \n",
       "14750     73940       36564         Fairhope    AL     Daphne       Baldwin   \n",
       "14751     59107        3293        Woodstock    NH  Claremont       Grafton   \n",
       "14752     82396       55713             Buhl    MN     Duluth   Saint Louis   \n",
       "14753     66881       21405        Annapolis    MD  Baltimore  Anne Arundel   \n",
       "14754     94815       85220  Apache Junction    AZ    Phoenix         Pinal   \n",
       "\n",
       "       SizeRank  1996-04  1996-05  1996-06  ...  2017-01  2017-02  2017-03  \\\n",
       "0             1      NaN      NaN      NaN  ...     1290     1277     1272   \n",
       "1             2    134.0    134.0    133.0  ...      294      296      295   \n",
       "2             3      NaN      NaN      NaN  ...     1597     1580     1567   \n",
       "3             4    149.0    150.0    150.0  ...      333      333      331   \n",
       "4             5     50.0     51.0     50.0  ...       81       81       81   \n",
       "...         ...      ...      ...      ...  ...      ...      ...      ...   \n",
       "14750     14751      NaN      NaN      NaN  ...      230      238      243   \n",
       "14751     14752     58.0     59.0     61.0  ...      129      129      128   \n",
       "14752     14753      NaN      NaN      NaN  ...       72       73       74   \n",
       "14753     14754    146.0    149.0    151.0  ...      470      462      457   \n",
       "14754     14755      NaN      NaN      NaN  ...      117      119      119   \n",
       "\n",
       "       2017-04  2017-05  2017-06  2017-07  2017-08  2017-09  country  \n",
       "0         1271     1277     1281     1279     1274     1270      USA  \n",
       "1          293      294      294      294      294      295      USA  \n",
       "2         1569     1570     1560     1543     1529     1521      USA  \n",
       "3          330      330      329      328      329      329      USA  \n",
       "4           81       81       82       82       82       82      USA  \n",
       "...        ...      ...      ...      ...      ...      ...      ...  \n",
       "14750      245      244      245      246      244      240      USA  \n",
       "14751      129      130      130      130      130      131      USA  \n",
       "14752       74       73       73       74       77       79      USA  \n",
       "14753      457      455      454      455      458      460      USA  \n",
       "14754      119      120      121      122      124      124      USA  \n",
       "\n",
       "[14755 rows x 266 columns]"
      ]
     },
     "execution_count": 5,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "(\n",
    "    us_median_sq_ft_value\n",
    "    .assign(country=\"USA\")\n",
    ")"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "## Excel Functions Equivalent\n",
    "\n",
    "Most of the functions that you are used to use in Excel have a direct Equivalent in Pands. We will try a few of them in the following examples. We will only look at the _City_ column in the dataset.\n",
    "\n",
    "For example: ```=TRIM('cell for trimming')``` can be replaced by _strip_. \n",
    "We will:\n",
    "* Take only the _City_ column from the table \n",
    "* Add a column that strips spaces from around the string (_str_) of the _City_ column\n",
    "* Show the head (first 5 lines) of the new sub-table"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 7,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>City</th>\n",
       "      <th>trimmed</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>New York</td>\n",
       "      <td>New York</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>Chicago</td>\n",
       "      <td>Chicago</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>New York</td>\n",
       "      <td>New York</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>Chicago</td>\n",
       "      <td>Chicago</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>El Paso</td>\n",
       "      <td>El Paso</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "       City   trimmed\n",
       "0  New York  New York\n",
       "1   Chicago   Chicago\n",
       "2  New York  New York\n",
       "3   Chicago   Chicago\n",
       "4   El Paso   El Paso"
      ]
     },
     "execution_count": 7,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "(\n",
    "    us_median_sq_ft_value[['City']]\n",
    "    .assign(trimmed = us_median_sq_ft_value.City.str.strip())\n",
    "    .head()\n",
    ")"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "or to remove spaces between words, you will need to run 'Find and Replace' option in Excel and in Pandas we can use the _replace_ function:\n",
    "* Take only the _City_ column from the table \n",
    "* Add a column that is removes every space (' ') from the string of the _City_ column\n",
    "* Show the head of the new sub-table"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 8,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>City</th>\n",
       "      <th>no_space</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>New York</td>\n",
       "      <td>NewYork</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>Chicago</td>\n",
       "      <td>Chicago</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>New York</td>\n",
       "      <td>NewYork</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>Chicago</td>\n",
       "      <td>Chicago</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>El Paso</td>\n",
       "      <td>ElPaso</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "       City no_space\n",
       "0  New York  NewYork\n",
       "1   Chicago  Chicago\n",
       "2  New York  NewYork\n",
       "3   Chicago  Chicago\n",
       "4   El Paso   ElPaso"
      ]
     },
     "execution_count": 8,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "(\n",
    "    us_median_sq_ft_value[['City']]\n",
    "    .assign(no_space = us_median_sq_ft_value.City.str.replace(' ',''))\n",
    "    .head()\n",
    ")"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "### More Textual Functions\n",
    "\n",
    "Another commonly used Excel functions are LEFT and RIGHT (```=LEFT(Cell, number of digits)```). In Pandas, we can use the '```:```', which is used for _until_ or _from_:\n",
    "* Take only the _City_ column from the table \n",
    "* Add a column that takes the first 5 characters (=LEFT) from the string of the _City_ column\n",
    "* Add a column that takes the last 5 characters (=RIGHT) from the string of the _City_ column\n",
    "* Show the head of the new sub-table\n"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 9,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>City</th>\n",
       "      <th>city_left</th>\n",
       "      <th>city_right</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>New York</td>\n",
       "      <td>New Y</td>\n",
       "      <td>York</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>Chicago</td>\n",
       "      <td>Chica</td>\n",
       "      <td>icago</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>New York</td>\n",
       "      <td>New Y</td>\n",
       "      <td>York</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>Chicago</td>\n",
       "      <td>Chica</td>\n",
       "      <td>icago</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>El Paso</td>\n",
       "      <td>El Pa</td>\n",
       "      <td>Paso</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "       City city_left city_right\n",
       "0  New York     New Y       York\n",
       "1   Chicago     Chica      icago\n",
       "2  New York     New Y       York\n",
       "3   Chicago     Chica      icago\n",
       "4   El Paso     El Pa       Paso"
      ]
     },
     "execution_count": 9,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "(\n",
    "    us_median_sq_ft_value[['City']]\n",
    "    .assign(city_left = us_median_sq_ft_value.City.str[:5]) # until the 5th character\n",
    "    .assign(city_right = us_median_sq_ft_value.City.str[-5:]) # from the 5th characters from the end (=right)\n",
    "    .head()\n",
    ")"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "## Using Lambda\n",
    "\n",
    "Lambda is a bit confusing at first glance, however, it is used to define a the function that we want to apply on a each row or columns of the table. For example, let's define two new columns: one that is calculating the quantile of the region compare to all the regions, and one that is checking if the state is one of ['NY','NJ']\n",
    "\n",
    "We will focus only on the _State_ and the last month in the data and add the two new columns to it.\n",
    "* Take only the _State_ and the values of the last period columns from the table \n",
    "* Add a column that checks if the _State_ is either _'NY'_ or _'NJ'_\n",
    "* Add a column that calculates the ratio of the average price of the row to the maximum price (quantile)"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 10,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>State</th>\n",
       "      <th>2017-09</th>\n",
       "      <th>is_NY_or_NJ</th>\n",
       "      <th>last_quantile</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>NY</td>\n",
       "      <td>1270</td>\n",
       "      <td>True</td>\n",
       "      <td>0.702434</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>IL</td>\n",
       "      <td>295</td>\n",
       "      <td>False</td>\n",
       "      <td>0.163164</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>NY</td>\n",
       "      <td>1521</td>\n",
       "      <td>True</td>\n",
       "      <td>0.841261</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>IL</td>\n",
       "      <td>329</td>\n",
       "      <td>False</td>\n",
       "      <td>0.181969</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>TX</td>\n",
       "      <td>82</td>\n",
       "      <td>False</td>\n",
       "      <td>0.045354</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>...</th>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>14750</th>\n",
       "      <td>AL</td>\n",
       "      <td>240</td>\n",
       "      <td>False</td>\n",
       "      <td>0.132743</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>14751</th>\n",
       "      <td>NH</td>\n",
       "      <td>131</td>\n",
       "      <td>False</td>\n",
       "      <td>0.072456</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>14752</th>\n",
       "      <td>MN</td>\n",
       "      <td>79</td>\n",
       "      <td>False</td>\n",
       "      <td>0.043695</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>14753</th>\n",
       "      <td>MD</td>\n",
       "      <td>460</td>\n",
       "      <td>False</td>\n",
       "      <td>0.254425</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>14754</th>\n",
       "      <td>AZ</td>\n",
       "      <td>124</td>\n",
       "      <td>False</td>\n",
       "      <td>0.068584</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "<p>14755 rows × 4 columns</p>\n",
       "</div>"
      ],
      "text/plain": [
       "      State  2017-09  is_NY_or_NJ  last_quantile\n",
       "0        NY     1270         True       0.702434\n",
       "1        IL      295        False       0.163164\n",
       "2        NY     1521         True       0.841261\n",
       "3        IL      329        False       0.181969\n",
       "4        TX       82        False       0.045354\n",
       "...     ...      ...          ...            ...\n",
       "14750    AL      240        False       0.132743\n",
       "14751    NH      131        False       0.072456\n",
       "14752    MN       79        False       0.043695\n",
       "14753    MD      460        False       0.254425\n",
       "14754    AZ      124        False       0.068584\n",
       "\n",
       "[14755 rows x 4 columns]"
      ]
     },
     "execution_count": 10,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "(\n",
    "    us_median_sq_ft_value[['State','2017-09']]\n",
    "    .assign(is_NY_or_NJ=lambda x : x.State.isin(['NY','NJ']))\n",
    "    .assign(last_quantile=lambda x : x['2017-09']/max(x['2017-09']))    \n",
    ")"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "## Multiple Data Sources\n",
    "\n",
    "We can do a more complicated analysis that is based on additional external sources. For example, we want to calculate the differences between red states and blue states. We can then ask various questions regarding the impacts or correlation between the house prices and the election results and voting patterns. \n",
    "\n",
    "First we will load data regarding the voting in various states across the years. A quick internet search finds an interesting table in the following Wikipedia page. Since there are multiple tables on that page, we can filter them using the _match_ option, and using the word _Year_ as the filter.\n",
    "* Read the HTML tables in the wikipedia entry on the topic of red and blue states\n",
    "* retrive on the ones that have the word 'Year'"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 11,
   "metadata": {},
   "outputs": [],
   "source": [
    "red_blue_states_wikipedia_entry = 'https://en.wikipedia.org/wiki/Red_states_and_blue_states'\n",
    "wikipedia_page_tables = (\n",
    "    pd\n",
    "    .read_html(\n",
    "        red_blue_states_wikipedia_entry, \n",
    "        match='Year'\n",
    "    )\n",
    ")"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "As we saw in the loading data from web site, the list can includes multiple tables. However, we filtered it and only have one table, and we can access it with the _[0]_ modifier."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 12,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead tr th {\n",
       "        text-align: left;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr>\n",
       "      <th></th>\n",
       "      <th>Year</th>\n",
       "      <th>1972</th>\n",
       "      <th>1976</th>\n",
       "      <th>1980</th>\n",
       "      <th>1984</th>\n",
       "      <th>1988</th>\n",
       "      <th>1992</th>\n",
       "      <th>1996</th>\n",
       "      <th>2000</th>\n",
       "      <th>2004</th>\n",
       "      <th>2008</th>\n",
       "      <th>2012</th>\n",
       "      <th>2016</th>\n",
       "      <th>2020</th>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th></th>\n",
       "      <th>Democratic Republican (lighter shading indicates win ≤5%)  Winner did not receive a majority of the popular vote  Winner did not receive a majority of the popular vote and lost the popular vote  Winner chosen by the House of Representatives  Electoral votes invalidated</th>\n",
       "      <th>Democratic Republican (lighter shading indicates win ≤5%)  Winner did not receive a majority of the popular vote  Winner did not receive a majority of the popular vote and lost the popular vote  Winner chosen by the House of Representatives  Electoral votes invalidated</th>\n",
       "      <th>Democratic Republican (lighter shading indicates win ≤5%)  Winner did not receive a majority of the popular vote  Winner did not receive a majority of the popular vote and lost the popular vote  Winner chosen by the House of Representatives  Electoral votes invalidated</th>\n",
       "      <th>Democratic Republican (lighter shading indicates win ≤5%)  Winner did not receive a majority of the popular vote  Winner did not receive a majority of the popular vote and lost the popular vote  Winner chosen by the House of Representatives  Electoral votes invalidated</th>\n",
       "      <th>Democratic Republican (lighter shading indicates win ≤5%)  Winner did not receive a majority of the popular vote  Winner did not receive a majority of the popular vote and lost the popular vote  Winner chosen by the House of Representatives  Electoral votes invalidated</th>\n",
       "      <th>Democratic Republican (lighter shading indicates win ≤5%)  Winner did not receive a majority of the popular vote  Winner did not receive a majority of the popular vote and lost the popular vote  Winner chosen by the House of Representatives  Electoral votes invalidated</th>\n",
       "      <th>Democratic Republican (lighter shading indicates win ≤5%)  Winner did not receive a majority of the popular vote  Winner did not receive a majority of the popular vote and lost the popular vote  Winner chosen by the House of Representatives  Electoral votes invalidated</th>\n",
       "      <th>Democratic Republican (lighter shading indicates win ≤5%)  Winner did not receive a majority of the popular vote  Winner did not receive a majority of the popular vote and lost the popular vote  Winner chosen by the House of Representatives  Electoral votes invalidated</th>\n",
       "      <th>Democratic Republican (lighter shading indicates win ≤5%)  Winner did not receive a majority of the popular vote  Winner did not receive a majority of the popular vote and lost the popular vote  Winner chosen by the House of Representatives  Electoral votes invalidated</th>\n",
       "      <th>Democratic Republican (lighter shading indicates win ≤5%)  Winner did not receive a majority of the popular vote  Winner did not receive a majority of the popular vote and lost the popular vote  Winner chosen by the House of Representatives  Electoral votes invalidated</th>\n",
       "      <th>Democratic Republican (lighter shading indicates win ≤5%)  Winner did not receive a majority of the popular vote  Winner did not receive a majority of the popular vote and lost the popular vote  Winner chosen by the House of Representatives  Electoral votes invalidated</th>\n",
       "      <th>Democratic Republican (lighter shading indicates win ≤5%)  Winner did not receive a majority of the popular vote  Winner did not receive a majority of the popular vote and lost the popular vote  Winner chosen by the House of Representatives  Electoral votes invalidated</th>\n",
       "      <th>Democratic Republican (lighter shading indicates win ≤5%)  Winner did not receive a majority of the popular vote  Winner did not receive a majority of the popular vote and lost the popular vote  Winner chosen by the House of Representatives  Electoral votes invalidated</th>\n",
       "      <th>Democratic Republican (lighter shading indicates win ≤5%)  Winner did not receive a majority of the popular vote  Winner did not receive a majority of the popular vote and lost the popular vote  Winner chosen by the House of Representatives  Electoral votes invalidated</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>Democratic candidate</td>\n",
       "      <td>George McGovern</td>\n",
       "      <td>Jimmy Carter</td>\n",
       "      <td>Jimmy Carter</td>\n",
       "      <td>Walter Mondale</td>\n",
       "      <td>Michael Dukakis</td>\n",
       "      <td>Bill Clinton</td>\n",
       "      <td>Bill Clinton</td>\n",
       "      <td>Al Gore</td>\n",
       "      <td>John Kerry</td>\n",
       "      <td>Barack Obama</td>\n",
       "      <td>Barack Obama</td>\n",
       "      <td>Hillary Clinton</td>\n",
       "      <td>Joe Biden</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>Republican candidate</td>\n",
       "      <td>Richard Nixon</td>\n",
       "      <td>Gerald Ford</td>\n",
       "      <td>Ronald Reagan</td>\n",
       "      <td>Ronald Reagan</td>\n",
       "      <td>George H. W. Bush</td>\n",
       "      <td>George H. W. Bush</td>\n",
       "      <td>Bob Dole</td>\n",
       "      <td>George W. Bush</td>\n",
       "      <td>George W. Bush</td>\n",
       "      <td>John McCain</td>\n",
       "      <td>Mitt Romney</td>\n",
       "      <td>Donald Trump</td>\n",
       "      <td>Donald Trump</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>National popular vote</td>\n",
       "      <td>Nixon</td>\n",
       "      <td>Carter</td>\n",
       "      <td>Reagan</td>\n",
       "      <td>Reagan</td>\n",
       "      <td>Bush</td>\n",
       "      <td>Clinton</td>\n",
       "      <td>Clinton</td>\n",
       "      <td>Gore</td>\n",
       "      <td>Bush</td>\n",
       "      <td>Obama</td>\n",
       "      <td>Obama</td>\n",
       "      <td>Clinton</td>\n",
       "      <td>Biden</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>Alabama</td>\n",
       "      <td>Nixon</td>\n",
       "      <td>Carter</td>\n",
       "      <td>Reagan</td>\n",
       "      <td>Reagan</td>\n",
       "      <td>Bush</td>\n",
       "      <td>Bush</td>\n",
       "      <td>Dole</td>\n",
       "      <td>Bush</td>\n",
       "      <td>Bush</td>\n",
       "      <td>McCain</td>\n",
       "      <td>Romney</td>\n",
       "      <td>Trump</td>\n",
       "      <td>Trump</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>Alaska</td>\n",
       "      <td>Nixon</td>\n",
       "      <td>Ford</td>\n",
       "      <td>Reagan</td>\n",
       "      <td>Reagan</td>\n",
       "      <td>Bush</td>\n",
       "      <td>Bush</td>\n",
       "      <td>Dole</td>\n",
       "      <td>Bush</td>\n",
       "      <td>Bush</td>\n",
       "      <td>McCain</td>\n",
       "      <td>Romney</td>\n",
       "      <td>Trump</td>\n",
       "      <td>Trump</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "                                                                                                                                                                                                     \n",
       "3                                             Reagan                                                                                                                                                                                                                              \n",
       "4                                             Reagan                                                                                                                                                                                                                              \n",
       "\n",
       "                                                                                                                                                                                                                                                                           1984  \\\n",
       "  Democratic Republican (lighter shading indicates win ≤5%)  Winner did not receive a majority of the popular vote  Winner did not receive a majority of the popular vote and lost the popular vote  Winner chosen by the House of Representatives  Electoral votes invalidated   \n",
       "0                                     Walter Mondale                                                                                                                                                                                                                              \n",
       "1                                      Ronald Reagan                                                                                                                                                                                                                              \n",
       "2                                             Reagan                                                                                                                                                                                                                              \n",
       "3                                             Reagan                                                                                                                                                                                                                              \n",
       "4                                             Reagan                                                                                                                                                                                                                              \n",
       "\n",
       "                                                                                                                                                                                                                                                                           1988  \\\n",
       "  Democratic Republican (lighter shading indicates win ≤5%)  Winner did not receive a majority of the popular vote  Winner did not receive a majority of the popular vote and lost the popular vote  Winner chosen by the House of Representatives  Electoral votes invalidated   \n",
       "0                                    Michael Dukakis                                                                                                                                                                                                                              \n",
       "1                                  George H. W. Bush                                                                                                                                                                                                                              \n",
       "2                                               Bush                                                                                                                                                                                                                              \n",
       "3                                               Bush                                                                                                                                                                                                                              \n",
       "4                                               Bush                                                                                                                                                                                                                              \n",
       "\n",
       "                                                                                                                                                                                                                                                                           1992  \\\n",
       "  Democratic Republican (lighter shading indicates win ≤5%)  Winner did not receive a majority of the popular vote  Winner did not receive a majority of the popular vote and lost the popular vote  Winner chosen by the House of Representatives  Electoral votes invalidated   \n",
       "0                                       Bill Clinton                                                                                                                                                                                                                              \n",
       "1                                  George H. W. Bush                                                                                                                                                                                                                              \n",
       "2                                            Clinton                                                                                                                                                                                                                              \n",
       "3                                               Bush                                                                                                                                                                                                                              \n",
       "4                                               Bush                                                                                                                                                                                                                              \n",
       "\n",
       "                                                                                                                                                                                                                                                                           1996  \\\n",
       "  Democratic Republican (lighter shading indicates win ≤5%)  Winner did not receive a majority of the popular vote  Winner did not receive a majority of the popular vote and lost the popular vote  Winner chosen by the House of Representatives  Electoral votes invalidated   \n",
       "0                                       Bill Clinton                                                                                                                                                                                                                              \n",
       "1                                           Bob Dole                                                                                                                                                                                                                              \n",
       "2                                            Clinton                                                                                                                                                                                                                              \n",
       "3                                               Dole                                                                                                                                                                                                                              \n",
       "4                                               Dole                                                                                                                                                                                                                              \n",
       "\n",
       "                                                                                                                                                                                                                                                                           2000  \\\n",
       "  Democratic Republican (lighter shading indicates win ≤5%)  Winner did not receive a majority of the popular vote  Winner did not receive a majority of the popular vote and lost the popular vote  Winner chosen by the House of Representatives  Electoral votes invalidated   \n",
       "0                                            Al Gore                                                                                                                                                                                                                              \n",
       "1                                     George W. Bush                                                                                                                                                                                                                              \n",
       "2                                               Gore                                                                                                                                                                                                                              \n",
       "3                                               Bush                                                                                                                                                                                                                              \n",
       "4                                               Bush                                                                                                                                                                                                                              \n",
       "\n",
       "                                                                                                                                                                                                                                                                           2004  \\\n",
       "  Democratic Republican (lighter shading indicates win ≤5%)  Winner did not receive a majority of the popular vote  Winner did not receive a majority of the popular vote and lost the popular vote  Winner chosen by the House of Representatives  Electoral votes invalidated   \n",
       "0                                         John Kerry                                                                                                                                                                                                                              \n",
       "1                                     George W. Bush                                                                                                                                                                                                                              \n",
       "2                                               Bush                                                                                                                                                                                                                              \n",
       "3                                               Bush                                                                                                                                                                                                                              \n",
       "4                                               Bush                                                                                                                                                                                                                              \n",
       "\n",
       "                                                                                                                                                                                                                                                                           2008  \\\n",
       "  Democratic Republican (lighter shading indicates win ≤5%)  Winner did not receive a majority of the popular vote  Winner did not receive a majority of the popular vote and lost the popular vote  Winner chosen by the House of Representatives  Electoral votes invalidated   \n",
       "0                                       Barack Obama                                                                                                                                                                                                                              \n",
       "1                                        John McCain                                                                                                                                                                                                                              \n",
       "2                                              Obama                                                                                                                                                                                                                              \n",
       "3                                             McCain                                                                                                                                                                                                                              \n",
       "4                                             McCain                                                                                                                                                                                                                              \n",
       "\n",
       "                                                                                                                                                                                                                                                                           2012  \\\n",
       "  Democratic Republican (lighter shading indicates win ≤5%)  Winner did not receive a majority of the popular vote  Winner did not receive a majority of the popular vote and lost the popular vote  Winner chosen by the House of Representatives  Electoral votes invalidated   \n",
       "0                                       Barack Obama                                                                                                                                                                                                                              \n",
       "1                                        Mitt Romney                                                                                                                                                                                                                              \n",
       "2                                              Obama                                                                                                                                                                                                                              \n",
       "3                                             Romney                                                                                                                                                                                                                              \n",
       "4                                             Romney                                                                                                                                                                                                                              \n",
       "\n",
       "                                                                                                                                                                                                                                                                           2016  \\\n",
       "  Democratic Republican (lighter shading indicates win ≤5%)  Winner did not receive a majority of the popular vote  Winner did not receive a majority of the popular vote and lost the popular vote  Winner chosen by the House of Representatives  Electoral votes invalidated   \n",
       "0                                    Hillary Clinton                                                                                                                                                                                                                              \n",
       "1                                       Donald Trump                                                                                                                                                                                                                              \n",
       "2                                            Clinton                                                                                                                                                                                                                              \n",
       "3                                              Trump                                                                                                                                                                                                                              \n",
       "4                                              Trump                                                                                                                                                                                                                              \n",
       "\n",
       "                                                                                                                                                                                                                                                                           2020  \n",
       "  Democratic Republican (lighter shading indicates win ≤5%)  Winner did not receive a majority of the popular vote  Winner did not receive a majority of the popular vote and lost the popular vote  Winner chosen by the House of Representatives  Electoral votes invalidated  \n",
       "0                                          Joe Biden                                                                                                                                                                                                                             \n",
       "1                                       Donald Trump                                                                                                                                                                                                                             \n",
       "2                                              Biden                                                                                                                                                                                                                             \n",
       "3                                              Trump                                                                                                                                                                                                                             \n",
       "4                                              Trump                                                                                                                                                                                                                             "
      ]
     },
     "execution_count": 12,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "red_blue_states = wikipedia_page_tables[0]\n",
    "red_blue_states.head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "We will start with creating a new list of states that voted for a specific candidate in a specific electio. First, let's use Trump in the 2016 elections. \n",
    "\n",
    "We are using here a few new functions such as [rename](https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.DataFrame.rename.html), [query](https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.DataFrame.query.html) and [iloc](https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.DataFrame.iloc.html), which we will discuss in more details in the next sections."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 13,
   "metadata": {},
   "outputs": [],
   "source": [
    "red_candidate = 'Trump'"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "We will:\n",
    "* Take the large table of the multiple elections results\n",
    "* Remove the complicated multi level header from the table (_droplevel_)\n",
    "* Filter to only states that have the name of the red candidate, we defined above, in the column of the election year (_2020_, for example). Note, we need to surround the name of the column with ` as it starts with a number\n",
    "* Take only the first column from the result table, which holds the name of the state (_iloc[:,0]_). "
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 14,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "3            Alabama\n",
       "4             Alaska\n",
       "6           Arkansas\n",
       "12           Florida\n",
       "15             Idaho\n",
       "17           Indiana\n",
       "18              Iowa\n",
       "19            Kansas\n",
       "20          Kentucky\n",
       "21         Louisiana\n",
       "29       Mississippi\n",
       "30          Missouri\n",
       "31           Montana\n",
       "39    North Carolina\n",
       "40      North Dakota\n",
       "41              Ohio\n",
       "42          Oklahoma\n",
       "46    South Carolina\n",
       "47      South Dakota\n",
       "48         Tennessee\n",
       "49             Texas\n",
       "50              Utah\n",
       "54     West Virginia\n",
       "56           Wyoming\n",
       "Name: Year, dtype: object"
      ]
     },
     "execution_count": 14,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "red_states_2020 = (\n",
    "    red_blue_states\n",
    "    .droplevel(1, axis=1) \n",
    "    .query(\"`2020` == @red_candidate\")\n",
    "    .iloc[:,0]\n",
    ")\n",
    "red_states_2020"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "Now that we have the list of the red states of 2020, we need to match them to the states that we have in our house prices data set. We see that we have to match the short version in the data set with the long version in the red state list. \n",
    "\n",
    "We will use a another option to create a data frame using a Dictionary. It is not hard to find a list of mapping of states names and abbreviations. "
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 15,
   "metadata": {},
   "outputs": [],
   "source": [
    "us_state_abbrev = {\n",
    "    'Alabama': 'AL',\n",
    "    'Alaska': 'AK',\n",
    "    'American Samoa': 'AS',\n",
    "    'Arizona': 'AZ',\n",
    "    'Arkansas': 'AR',\n",
    "    'California': 'CA',\n",
    "    'Colorado': 'CO',\n",
    "    'Connecticut': 'CT',\n",
    "    'Delaware': 'DE',\n",
    "    'District of Columbia': 'DC',\n",
    "    'Florida': 'FL',\n",
    "    'Georgia': 'GA',\n",
    "    'Guam': 'GU',\n",
    "    'Hawaii': 'HI',\n",
    "    'Idaho': 'ID',\n",
    "    'Illinois': 'IL',\n",
    "    'Indiana': 'IN',\n",
    "    'Iowa': 'IA',\n",
    "    'Kansas': 'KS',\n",
    "    'Kentucky': 'KY',\n",
    "    'Louisiana': 'LA',\n",
    "    'Maine': 'ME',\n",
    "    'Maryland': 'MD',\n",
    "    'Massachusetts': 'MA',\n",
    "    'Michigan': 'MI',\n",
    "    'Minnesota': 'MN',\n",
    "    'Mississippi': 'MS',\n",
    "    'Missouri': 'MO',\n",
    "    'Montana': 'MT',\n",
    "    'Nebraska': 'NE',\n",
    "    'Nevada': 'NV',\n",
    "    'New Hampshire': 'NH',\n",
    "    'New Jersey': 'NJ',\n",
    "    'New Mexico': 'NM',\n",
    "    'New York': 'NY',\n",
    "    'North Carolina': 'NC',\n",
    "    'North Dakota': 'ND',\n",
    "    'Northern Mariana Islands':'MP',\n",
    "    'Ohio': 'OH',\n",
    "    'Oklahoma': 'OK',\n",
    "    'Oregon': 'OR',\n",
    "    'Pennsylvania': 'PA',\n",
    "    'Puerto Rico': 'PR',\n",
    "    'Rhode Island': 'RI',\n",
    "    'South Carolina': 'SC',\n",
    "    'South Dakota': 'SD',\n",
    "    'Tennessee': 'TN',\n",
    "    'Texas': 'TX',\n",
    "    'Utah': 'UT',\n",
    "    'Vermont': 'VT',\n",
    "    'Virgin Islands': 'VI',\n",
    "    'Virginia': 'VA',\n",
    "    'Washington': 'WA',\n",
    "    'West Virginia': 'WV',\n",
    "    'Wisconsin': 'WI',\n",
    "    'Wyoming': 'WY'\n",
    "}"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "and we can use the dictionary to create a table (DataFrame) from it:\n",
    "* Create a new dataframe table\n",
    "* Using the dictionary of the us state abbreviation above\n",
    "* with row for each entry\n",
    "* and name the column of the value 'Abbreviation' "
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 16,
   "metadata": {},
   "outputs": [],
   "source": [
    "us_state_abbrev_df = (\n",
    "    pd\n",
    "    .DataFrame\n",
    "    .from_dict(us_state_abbrev, \n",
    "                orient='index',\n",
    "                columns=['Abbreviation'])\n",
    ")"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "Before we continue, we can keep this table as file to be used in future analyses. We could also create this file in Excel and load it as we will do in the future with this file. "
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 17,
   "metadata": {},
   "outputs": [],
   "source": [
    "us_state_abbrev_df.to_csv('../data/us_state_abbrev.csv')"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "Next, we will:\n",
    "* Start with the abbreviation list of all the US states\n",
    "* Take only the ones that also appear in the list of red states in the 2020 election\n",
    "* Take all the rows left, and the index column (country name) with the abbreviation column"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 18,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "Alabama           AL\n",
       "Alaska            AK\n",
       "Arkansas          AR\n",
       "Florida           FL\n",
       "Idaho             ID\n",
       "Indiana           IN\n",
       "Iowa              IA\n",
       "Kansas            KS\n",
       "Kentucky          KY\n",
       "Louisiana         LA\n",
       "Mississippi       MS\n",
       "Missouri          MO\n",
       "Montana           MT\n",
       "North Carolina    NC\n",
       "North Dakota      ND\n",
       "Ohio              OH\n",
       "Oklahoma          OK\n",
       "South Carolina    SC\n",
       "South Dakota      SD\n",
       "Tennessee         TN\n",
       "Texas             TX\n",
       "Utah              UT\n",
       "West Virginia     WV\n",
       "Wyoming           WY\n",
       "Name: Abbreviation, dtype: object"
      ]
     },
     "execution_count": 18,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "red_state_abbrev = (\n",
    "    us_state_abbrev_df\n",
    "    .loc[red_states_2020]\n",
    "    .loc[:,'Abbreviation']\n",
    ")\n",
    "red_state_abbrev"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "Finally:\n",
    "* Start with the table of the median value of sqr feet per zip code\n",
    "* Add a new column (_is_red_) that checks if the _State_ in each row is in the list of red states that we calculated above. \n",
    "* Create two groups for red and blue states and the a value in the last period in the table (September 2017)\n",
    "* Calculate the average (_mean_) price for each of the two groups\n",
    "* Plot the two values\n",
    "* Title the graph with \"Average square feet value between blue and red states\""
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 19,
   "metadata": {
    "tags": []
   },
   "outputs": [
    {
     "data": {
      "image/png": "iVBORw0KGgoAAAANSUhEUgAAAXcAAAEmCAYAAACZEtCsAAAAOXRFWHRTb2Z0d2FyZQBNYXRwbG90bGliIHZlcnNpb24zLjMuMiwgaHR0cHM6Ly9tYXRwbG90bGliLm9yZy8vihELAAAACXBIWXMAAAsTAAALEwEAmpwYAAAdcklEQVR4nO3de5wcZZ3v8c83CYT7NQNCgATZyBG8RJ0DevDCCq6AHFF35aIIETTwEo7XdQXXVXBhxQug+9IFgyAgd0GEXVkFEciiBBkwIlclGEhiSIYAGm6ahN/543naVJrumZ7p6enkme/79ZrXVD1VXfWr6upvVz99KUUEZmZWlnHdLsDMzEaew93MrEAOdzOzAjnczcwK5HA3MyuQw93MrEAOd3sRSadIelzSY92upRlJUyWFpAkdWPZ8SfuO9HLXFnm//U2TaTMk3TraNQ2VpJslfahDy+7YsTWa1rpwz3fak5ImdruWsUjSTsCngN0i4iVtLmudCIqRVPoTgzUn6SRJFw1h/r0lLexUPWtVuEuaCrwJCOCdHVj+Ov1MPBzD2OadgGURsbQT9ZiNtrH4uAcgItaaP+DzwM+BM4D/ym0TgaeAV1Tm6wGeA7bJ4wcCc/N8vwBeVZl3PvAZ4G7gz8AE4ARgHrAcuA94d2X+8cDpwOPA74HjSU82E/L0zYFzgcXAIuAUYHyT7dkD6AP+BCwBzqhM+wDwCLAM+Odc57552vnAKZV59wYWVsYHqn9G3odn5mWfkvfh14BHcx1nAxs2qHffvF9fAJ4Gzs/tr8/79Sng18Delds03B/Ay4HngVV5WU81WN8hQF9d2yeAa/PwO4Bf5f23ADipMt/Uuvvlr/svj58EXFQZb7oNDeqaD5yY9+2TwHeBDSrTGx5vwPfyvnsub/M/ARcAn8rTJ+eaj8vjuwBPAONaOI63B64C+knH5UfrtvUK4MJ8TNwL9A6wfQF8FHiYdJx/tVLDDODWRvs4t90MfKgyfhRwf95PPwGmDLDe7wOPAX8EZgO7V6adD3wL+FHehtuBXSrT3wY8kG/7TeCWah116zkJuBK4iHTsfIgBHrek4/VreV88DBxXv911y/9MXsZy4EFgH2A/4C/Ainzf/zrP+8G8f5bnZR+T2zdmzcfa0/k+Hsfqx/eyfL9ulW+zQd6mZfkYuQPYtun+7nRgD+UPeAj4CPC6vJO2ze3nAadW5jsO+HEefg2wFNgz30lHkh6cEysP1LnAjuRAA95b2ZGHAM8A2+Vpx5Ie1DsAWwI/Zc0QuRr4dr5ztgF+WbvDGmzPbcAH8vAmwOvz8G75znwzKXjPAFbSergPVP+MvKz/R3oi25AU9NcCWwGbAv8JfKlJzfXrmpwPpgPy+t6Wx3sG2x9UgqLJujYiHfTTKm13AIdWanllXu+rSE9M72oUPAwQ7oNtQ4O65gP3kI6ZrUhPlqcM4Xir1nEU8J95+H2kB+3llWnXDLbcXPOdpJOf9YGXkoLi7ZVtfT5v33jgS8CcAfZ7ADflbdsJ+C05KBlCuAMHkR6zLycda58DfjHAeo8iHX8Tga8DcyvTzs/3yR55WRcDl+Vpk0jHyT8A65FOAFYycLivAN6V992GDHycHkt64qjd3zfVb3dl2buSTjS2r+yjXeqPucr87yA9iQt4C/As8NpGj7Xc9jFgDil/JuaaL83TjiE9djfK9/PrgM2a7u/RCO1W/oA35jtkUh5/APhEHt4XmFeZ9+fAEXn4LOBf65b1IPCWyoPtqEHWPRc4KA//jEpY53VHPuC2JZ39b1iZfhhwU5PlzgZOrm1Tpf3ztQM3j29MetZvKdwHqX8G8GhlmkjhXz0LegPw+ybLWmNdpLOU79XN8xNS+Ay4Pxgk3PM8FwGfz8PTSA/ijZrM+3XgzMqDqtVwb7oNTdYzHzi2Mn5A7fhr8Xir1rEL6ax2HOkV0zG1/Us6q//kYMslBf6jddNOBL5b2dafVqbtBjw3wD4PYL/K+EeAG+vvs/p9nNtuZnW4/zdwdGXaOFJ4TRnoPs/zbpGXvXnlmP9O3T5/IA8fQeXJinRML2TgcJ9dGR/sOP1Z3f39d/XbXZn2N6Qn4X2B9Rqs96JGNVXm+SHwsUaPtdx2P7BPZXw7Ui5OID05rvGKbqC/tanP/Ujg+oh4PI9fktsgPZNuJGnP3C8/nfRMDDAF+JSkp2p/pGfg7SvLXlBdkaQjJM2tzP8K0tkB+XYLmtx2CunMYXHltt8mnQk0cjTwMuABSXdIOrDROiLiGdJZS0sGqb++5h7SM/2dlfl/nNtbMQV4b93+fSPpoBvq/mjkEtIDDdKZ7Q8j4tm8nXtKuklSv6Q/ks6wJjVZznC3oZnqPnyE1cdTK8fbX0XEPNKT63TS+0n/BfxB0q6k4L6lheVOAbavm/ZZUmjVVD/Z9CywwSB9zc22byimAN+o1PQEKXgn188oabyk0yTNk/Qn0pMgrHl/1m/DJnm4/vESdfU3MpTHbf1j/pFmC42Ih4CPk4J8qaTLJDXdd5L2lzRH0hN5vQcw8DE8Bbi6Uuf9pK7NbUndfj8BLpP0B0lfkbReswWtFW80SNoQOBgYX/n43URgC0mvjohfS7qCFAJLSP3xy/N8C0hdNqcOsIqorGsKcA6pn+y2iFglaS7poITUJ7dD5bY7VoYXkM4AJkXEysG2KyJ+BxwmaRzwHuBKSVvndby8UtNGwNaVmz5DCuSal1TmHaz+NbaX1I/4HKl/c9FgNTewgHTW++H6CZK2Y+D9EQ3a6t0A9EiaTrp/P1GZdgmpf3X/iHhe0tdp/sBous8G2oYBVO/3nYA/VJY10PHWaJtvIXUprB8RiyTdQjpx2ZL0qmvA5UqqvdKaNoT6B7MjqW8e1ty+qmfy/41Ifdfw4v16akRc3ML63kfqxtmXFOybk17RaIDb1Cymcn9IEmveP41U74fBHrdrLJ+0P5ovOOIS4BJJm5GeJL5Meg9tjfs+f+LvKtIrj2siYoWkH7J6mxsdKwtIPQ0/b7L6k4GT80nudaRXd+c2mnFtOXN/F+nZaTfSGc50Uvj9D2nHQHqgHwK8Pw/XnAMcm8/yJGljSe+QtGmTdW1M2qn9AJI+SDrzrbkC+JikyZK2IL2kByAiFgPXA6dL2kzSOEm7SHpLoxVJOlxST0S8QHoDBNIbKFcCB0p6o6T1gS+y5n0xFzhA0laSXkI6U2i1/jXkdZ8DnClpm3ybyZLe3uw2dS4C/q+kt+ezrw3yR7h2aGF/LAF2yNvYrL4VpDfavkrq77yhMnlT4Ikc7HuQAqKZucChktaT1EsK00G3YYDlHSdpB0lbkd7wvjy3D3a8LSH1iVfdQnpjfnYevzmP3xoRq1pY7i+B5ZI+I2nDvA2vkPS/B6h/MJ+WtKWkHUn9vJfXzxAR/aQ3Dg/P6zyK1M1UczZwoqTdASRtLum9Tda3KSlgl5GeLP5tCLX+CNhd0nvyq5GPsuaTzIBaOE6vAD6a7+8tSW9oNiRpV0lvzcH9PKvfFIV030/NJ3OQ3h+ZSHqsrpS0P6nLh8r8W0vavNJ2NnBqPolDUo+kg/Lw30p6paTxpCfbFZV1v8jaEu5HkvoPH42Ix2p/pLO290uaEBG3k84ktif19QEQEX3Ah/O8T5Le4JnRbEURcR/p0zC3kXbuK0l9+DXnkA6Eu0mf1LiO9OZN7UF4BOlOq32S4kqav7zfD7hX0tPAN0hvFD4XEfeS3hS+hHTW8CSpD7Hme6RPdMzPtfz1gddC/Y18hrRf5ii9JP4p6Y2hQUXEAtIZ12dJB+kC4NOsPnYG2h8/I50dPibpcZq7hHRG9/26M6uPAF+UtJz0PsUVAyzjX1jdv30ylROAFrahWU3Xk964nEf6dEUrx9uXgM8pvaz+x9x2CyncauF+KyngauMDLjc/ARxIOun5PenV2HdIZ7/DdQ3pTdq5pPBsePaXa/o0KZR3J/X51mq+mnTWelk+ru4B9m+ynAtJ3R2LSMfKnFYLzV217wVOy3VMY/Bjvt5Ax+k5pO6OXwN3AT8YYDkTcx2Pk7qRtiG9/wHpJAVgmaS7cu/CR0nH7ZOkk5NrK9v1AHAp8HA+XrYn5cS1wPX5uJ9Des8F0hPalaRgv590XH2vWaHKnfbWRH62PTsipnR4PfNJbxD9tJPrMbOxYW05c19r5Je9B0iaIGky8AVWv3lrZrZOcLi/mEgv658kdcvcT+oSMDNbZ7hbxsysQD5zNzMr0FrxOfdJkybF1KlTu12Gmdk65c4773w8Ihp+IXGtCPepU6fS19fX7TLMzNYpkpp+m9bdMmZmBXK4m5kVyOFuZlagQcNd0o5Kv8x3n6R7JX0st28l6QZJv8v/t8ztkvTvkh6SdLek13Z6I8zMbE2tnLmvJF1JZjfS1WyOk7Qb6cd1bsy/VHcjq39sZ3/Sbz9MA2aSfqfazMxG0aDhHhGLI+KuPLyc9I3NyaQfYrogz3YB6Zcdye0XRjKH9LO9A/1utpmZjbAh9bkr/Ybwa0jXN9w2/5QmpF9Hq104YDJr/vD9Qhr/eP9MSX2S+vr7+4dat5mZDaDlcJe0CemH5z8eEX+qTstXRhnS7xhExKyI6I2I3p6eVi8KZGZmrWgp3JUu5XQVcHFE1H7reEmtuyX/X5rbF7HmVU12yG1mZjZKBv2GqiSRfsj//og4ozLpWtJFNk7L/6+ptB8v6TLSj8z/sdJ9s06besKPul1CUeaf9o5ul2BWrFZ+fmAv0vUBf6N0rU5IV7Q5DbhC0tGkK6wcnKddR7oI7EOki9x+cCQLNjOzwQ0a7hFxK80vYrtPg/mDdAk5MzPrEn9D1cysQA53M7MCOdzNzArkcDczK5DD3cysQA53M7MCOdzNzArkcDczK5DD3cysQA53M7MCOdzNzArkcDczK5DD3cysQA53M7MCOdzNzArkcDczK5DD3cysQA53M7MCDRruks6TtFTSPZW2yyXNzX/za9dWlTRV0nOVaWd3sHYzM2uilQtknw98E7iw1hARh9SGJZ0O/LEy/7yImD5C9ZmZ2TC0coHs2ZKmNpomScDBwFtHuC4zM2tDu33ubwKWRMTvKm07S/qVpFskvanZDSXNlNQnqa+/v7/NMszMrKrdcD8MuLQyvhjYKSJeA3wSuETSZo1uGBGzIqI3Inp7enraLMPMzKqGHe6SJgDvAS6vtUXEnyNiWR6+E5gHvKzdIs3MbGjaOXPfF3ggIhbWGiT1SBqfh18KTAMebq9EMzMbqlY+CnkpcBuwq6SFko7Okw5lzS4ZgDcDd+ePRl4JHBsRT4xgvWZm1oJWPi1zWJP2GQ3argKuar8sMzNrh7+hamZWIIe7mVmBHO5mZgVyuJuZFcjhbmZWIIe7mVmBHO5mZgVyuJuZFcjhbmZWIIe7mVmBHO5mZgVyuJuZFcjhbmZWIIe7mVmBHO5mZgVyuJuZFcjhbmZWoFYus3eepKWS7qm0nSRpkaS5+e+AyrQTJT0k6UFJb+9U4WZm1lwrZ+7nA/s1aD8zIqbnv+sAJO1Gurbq7vk2/1G7YLaZmY2eQcM9ImYDrV7k+iDgsoj4c0T8HngI2KON+szMbBja6XM/XtLdudtmy9w2GVhQmWdhbnsRSTMl9Unq6+/vb6MMMzOrN9xwPwvYBZgOLAZOH+oCImJWRPRGRG9PT88wyzAzs0aGFe4RsSQiVkXEC8A5rO56WQTsWJl1h9xmZmajaFjhLmm7yui7gdonaa4FDpU0UdLOwDTgl+2VaGZmQzVhsBkkXQrsDUyStBD4ArC3pOlAAPOBYwAi4l5JVwD3ASuB4yJiVUcqNzOzpgYN94g4rEHzuQPMfypwajtFmZlZe/wNVTOzAjnczcwK5HA3MyuQw93MrEAOdzOzAjnczcwK5HA3MyuQw93MrEAOdzOzAjnczcwK5HA3MyuQw93MrEAOdzOzAjnczcwK5HA3MyuQw93MrEAOdzOzAg0a7pLOk7RU0j2Vtq9KekDS3ZKulrRFbp8q6TlJc/Pf2R2s3czMmmjlzP18YL+6thuAV0TEq4DfAidWps2LiOn579iRKdPMzIZi0HCPiNnAE3Vt10fEyjw6B9ihA7WZmdkwjUSf+1HAf1fGd5b0K0m3SHpTsxtJmimpT1Jff3//CJRhZmY1bYW7pH8GVgIX56bFwE4R8Rrgk8AlkjZrdNuImBURvRHR29PT004ZZmZWZ9jhLmkGcCDw/ogIgIj4c0Qsy8N3AvOAl41AnWZmNgTDCndJ+wH/BLwzIp6ttPdIGp+HXwpMAx4eiULNzKx1EwabQdKlwN7AJEkLgS+QPh0zEbhBEsCc/MmYNwNflLQCeAE4NiKeaLhgMzPrmEHDPSIOa9B8bpN5rwKuarcoMzNrj7+hamZWIIe7mVmBHO5mZgVyuJuZFcjhbmZWIIe7mVmBHO5mZgVyuJuZFcjhbmZWIIe7mVmBHO5mZgVyuJuZFcjhbmZWIIe7mVmBHO5mZgVyuJuZFcjhbmZWoEGvxAQg6TzSxbCXRsQrcttWwOXAVGA+cHBEPKl03b1vAAcAzwIzIuKukS/dzKqmnvCjbpdQjPmnvaPbJbSt1TP384H96tpOAG6MiGnAjXkcYH/ShbGnATOBs9ov08zMhqKlcI+I2UD9ha4PAi7IwxcA76q0XxjJHGALSduNQK1mZtaidvrct42IxXn4MWDbPDwZWFCZb2FuW4OkmZL6JPX19/e3UYaZmdUbkTdUIyKAGOJtZkVEb0T09vT0jEQZZmaWtRPuS2rdLfn/0ty+CNixMt8Ouc3MzEZJO+F+LXBkHj4SuKbSfoSS1wN/rHTfmJnZKGj1o5CXAnsDkyQtBL4AnAZcIelo4BHg4Dz7daSPQT5E+ijkB0e4ZjMzG0RL4R4RhzWZtE+DeQM4rp2izMysPf6GqplZgRzuZmYFcribmRXI4W5mViCHu5lZgRzuZmYFcribmRXI4W5mViCHu5lZgRzuZmYFcribmRXI4W5mViCHu5lZgRzuZmYFcribmRXI4W5mViCHu5lZgVq6ElMjknYFLq80vRT4PLAF8GGgP7d/NiKuG+56zMxs6IYd7hHxIDAdQNJ4YBFwNemaqWdGxNdGokAzMxu6keqW2QeYFxGPjNDyzMysDSMV7ocCl1bGj5d0t6TzJG3Z6AaSZkrqk9TX39/faBYzMxumtsNd0vrAO4Hv56azgF1IXTaLgdMb3S4iZkVEb0T09vT0tFuGmZlVjMSZ+/7AXRGxBCAilkTEqoh4ATgH2GME1mFmZkMwEuF+GJUuGUnbVaa9G7hnBNZhZmZDMOxPywBI2hh4G3BMpfkrkqYDAcyvm2ZmZqOgrXCPiGeArevaPtBWRWZm1jZ/Q9XMrEAOdzOzAjnczcwK5HA3MyuQw93MrEAOdzOzAjnczcwK5HA3MyuQw93MrEAOdzOzAjnczcwK5HA3MyuQw93MrEAOdzOzAjnczcwK5HA3MyuQw93MrEBtXYkJQNJ8YDmwClgZEb2StgIuB6aSLrV3cEQ82e66zMysNSN15v63ETE9Inrz+AnAjRExDbgxj5uZ2SjpVLfMQcAFefgC4F0dWo+ZmTUwEuEewPWS7pQ0M7dtGxGL8/BjwLb1N5I0U1KfpL7+/v4RKMPMzGra7nMH3hgRiyRtA9wg6YHqxIgISVF/o4iYBcwC6O3tfdF0MzMbvrbP3CNiUf6/FLga2ANYImk7gPx/abvrMTOz1rUV7pI2lrRpbRj4O+Ae4FrgyDzbkcA17azHzMyGpt1umW2BqyXVlnVJRPxY0h3AFZKOBh4BDm5zPWZmNgRthXtEPAy8ukH7MmCfdpZtZmbD52+ompkVyOFuZlYgh7uZWYEc7mZmBXK4m5kVyOFuZlYgh7uZWYEc7mZmBXK4m5kVyOFuZlYgh7uZWYEc7mZmBXK4m5kVyOFuZlYgh7uZWYEc7mZmBXK4m5kVaNjhLmlHSTdJuk/SvZI+lttPkrRI0tz8d8DIlWtmZq1o5zJ7K4FPRcRd+SLZd0q6IU87MyK+1n55ZmY2HMMO94hYDCzOw8sl3Q9MHqnCzMxs+Eakz13SVOA1wO256XhJd0s6T9KWTW4zU1KfpL7+/v6RKMPMzLK2w13SJsBVwMcj4k/AWcAuwHTSmf3pjW4XEbMiojcient6etotw8zMKtoKd0nrkYL94oj4AUBELImIVRHxAnAOsEf7ZZqZ2VC082kZAecC90fEGZX27SqzvRu4Z/jlmZnZcLTzaZm9gA8Av5E0N7d9FjhM0nQggPnAMW2sw8zMhqGdT8vcCqjBpOuGX46ZmY0Ef0PVzKxADnczswI53M3MCuRwNzMrkMPdzKxADnczswI53M3MCuRwNzMrkMPdzKxADnczswI53M3MCuRwNzMrkMPdzKxADnczswI53M3MCuRwNzMrkMPdzKxADnczswJ1LNwl7SfpQUkPSTqhU+sxM7MX60i4SxoPfAvYH9iNdNHs3TqxLjMze7FOnbnvATwUEQ9HxF+Ay4CDOrQuMzOrM6FDy50MLKiMLwT2rM4gaSYwM48+LenBDtUyFk0CHu92EYPRl7tdgXWBj82RNaXZhE6F+6AiYhYwq1vrL5mkvojo7XYdZvV8bI6eTnXLLAJ2rIzvkNvMzGwUdCrc7wCmSdpZ0vrAocC1HVqXmZnV6Ui3TESslHQ88BNgPHBeRNzbiXVZQ+7usrWVj81Roojodg1mZjbC/A1VM7MCOdzNzArkcDczK5DD3cw6Rsnhkj6fx3eStEe36xoLHO6FkLSRpH+RdE4enybpwG7XZWPefwBvAA7L48tJvztlHeZwL8d3gT+THkiQvjR2SvfKMQNgz4g4DngeICKeBNbvbkljg8O9HLtExFeAFQAR8Syg7pZkxor8K7EBIKkHeKG7JY0NDvdy/EXShqx+EO1COpM366Z/B64GtpF0KnAr8G/dLWls8JeYCiHpbcDnSL+ffz2wFzAjIm7uZl1mkv4XsA/pleSNEXF/l0saExzuBZG0NfB60oNoTkSs9T+tamWTtFOj9oh4dLRrGWsc7oWQtBcwNyKekXQ48FrgGxHxSJdLszFM0m9IXYUCNgB2Bh6MiN27WtgY4D73cpwFPCvp1cAngXnAhd0tyca6iHhlRLwq/59Gukrbbd2uayxwuJdjZaSXYQcB34qIbwGbdrkmszVExF3UXZXNOqNrV2KyEbdc0onA4cCbJY0D1utyTTbGSfpkZXQcqbvwD10qZ0zxmXs5DiF99PHoiHiMdPWrr3a3JDM2rfxNBH5EenVpHeY3VM2sI/KXl74cEf/Y7VrGInfLrOMkLSd/cal+EhARsdkol2SGpAn5imx7dbuWscpn7mY24iTdFRGvlXQWMBn4PvBMbXpE/KBrxY0RPnMvjKRtSJ8nBvxlEeu6DYBlwFtZ/Xn3ABzuHeZwL4SkdwKnA9sDS4EpwP2Avyxi3bBN/qTMPawO9Rp3F4wCf1qmHP9K+umB30bEzqTf8pjT3ZJsDBsPbJL/Nq0M1/6sw3zmXo4VEbFM0jhJ4yLiJklf73ZRNmYtjogvdruIsczhXo6nJG0CzAYulrSUyhtYZqPM1xLoMn9aZh0naaeIeFTSxsBzpK629wObAxdHxLKuFmhjkqStIuKJbtcxljnc13G1j5zl4asi4u+7XZOZdZ/fUF33VV/+vrRrVZjZWsXhvu6LJsNmNoa5W2YdJ2kV6Y1TARsCz9Ym4Z8fMBuzHO5mZgVyt4yZWYEc7mZmBXK4m5kVyOFuY4akX4zSemZI+uZorMusGYe7jRkR8X/auX2+spDZOsHhbmOGpKfz/+0kzZY0V9I9kt400G0knS7p18AbJB0u6Zf5tt+uBb6kD0r6raRfAr76kHWdw93GovcBP4mI6cCrgbkDzLsxcHtEvJp00YlDgL3ybVcB75e0HXAyKdTfCOzWscrNWuRfhbSx6A7gPEnrAT+MiLkDzLsKuCoP7wO8DrhDEqQvjS0F9gRujoh+AEmXAy/rTOlmrfGZu405ETEbeDOwCDhf0hEDzP58RKzKwwIuiIjp+W/XiDipw+WaDYvD3cYcSVOAJRFxDvAd4LUt3vRG4B/ydWqRtFVe1u3AWyRtnV8NvLcTdZsNhbtlbCzaG/i0pBXA08BAZ+5/FRH3SfoccL2kccAK4LiImCPpJOA24CkG7sM3GxX+bRkzswK5W8bMrEDuljEDJN0OTKxr/kBE/KYb9Zi1y90yZmYFcreMmVmBHO5mZgVyuJuZFcjhbmZWoP8PPpv3SlymhpsAAAAASUVORK5CYII=",
      "image/svg+xml": "<?xml version=\"1.0\" encoding=\"utf-8\" standalone=\"no\"?>\n<!DOCTYPE svg PUBLIC \"-//W3C//DTD SVG 1.1//EN\"\n  \"http://www.w3.org/Graphics/SVG/1.1/DTD/svg11.dtd\">\n<!-- Created with matplotlib (https://matplotlib.org/) -->\n<svg height=\"293.022188pt\" version=\"1.1\" viewBox=\"0 0 375.2875 293.022188\" width=\"375.2875pt\" xmlns=\"http://www.w3.org/2000/svg\" xmlns:xlink=\"http://www.w3.org/1999/xlink\">\n <metadata>\n  <rdf:RDF xmlns:cc=\"http://creativecommons.org/ns#\" xmlns:dc=\"http://purl.org/dc/elements/1.1/\" xmlns:rdf=\"http://www.w3.org/1999/02/22-rdf-syntax-ns#\">\n   <cc:Work>\n    <dc:type rdf:resource=\"http://purl.org/dc/dcmitype/StillImage\"/>\n    <dc:date>2021-01-10T10:42:45.830371</dc:date>\n    <dc:format>image/svg+xml</dc:format>\n    <dc:creator>\n     <cc:Agent>\n      <dc:title>Matplotlib v3.3.2, https://matplotlib.org/</dc:title>\n     </cc:Agent>\n    </dc:creator>\n   </cc:Work>\n  </rdf:RDF>\n </metadata>\n <defs>\n  <style type=\"text/css\">*{stroke-linecap:butt;stroke-linejoin:round;}</style>\n </defs>\n <g id=\"figure_1\">\n  <g id=\"patch_1\">\n   <path d=\"M 0 293.022188 \nL 375.2875 293.022188 \nL 375.2875 0 \nL 0 0 \nz\n\" style=\"fill:none;\"/>\n  </g>\n  <g id=\"axes_1\">\n   <g id=\"patch_2\">\n    <path d=\"M 33.2875 239.758125 \nL 368.0875 239.758125 \nL 368.0875 22.318125 \nL 33.2875 22.318125 \nz\n\" style=\"fill:#ffffff;\"/>\n   </g>\n   <g id=\"patch_3\">\n    <path clip-path=\"url(#p0d48ab611a)\" d=\"M 75.1375 239.758125 \nL 158.8375 239.758125 \nL 158.8375 32.672411 \nL 75.1375 32.672411 \nz\n\" style=\"fill:#1f77b4;\"/>\n   </g>\n   <g id=\"patch_4\">\n    <path clip-path=\"url(#p0d48ab611a)\" d=\"M 242.5375 239.758125 \nL 326.2375 239.758125 \nL 326.2375 131.694841 \nL 242.5375 131.694841 \nz\n\" style=\"fill:#1f77b4;\"/>\n   </g>\n   <g id=\"matplotlib.axis_1\">\n    <g id=\"xtick_1\">\n     <g id=\"line2d_1\">\n      <defs>\n       <path d=\"M 0 0 \nL 0 3.5 \n\" id=\"mee2d3bfb9e\" style=\"stroke:#000000;stroke-width:0.8;\"/>\n      </defs>\n      <g>\n       <use style=\"stroke:#000000;stroke-width:0.8;\" x=\"116.9875\" xlink:href=\"#mee2d3bfb9e\" y=\"239.758125\"/>\n      </g>\n     </g>\n     <g id=\"text_1\">\n      <!-- False -->\n      <g transform=\"translate(119.746875 271.865938)rotate(-90)scale(0.1 -0.1)\">\n       <defs>\n        <path d=\"M 9.8125 72.90625 \nL 51.703125 72.90625 \nL 51.703125 64.59375 \nL 19.671875 64.59375 \nL 19.671875 43.109375 \nL 48.578125 43.109375 \nL 48.578125 34.8125 \nL 19.671875 34.8125 \nL 19.671875 0 \nL 9.8125 0 \nz\n\" id=\"DejaVuSans-70\"/>\n        <path d=\"M 34.28125 27.484375 \nQ 23.390625 27.484375 19.1875 25 \nQ 14.984375 22.515625 14.984375 16.5 \nQ 14.984375 11.71875 18.140625 8.90625 \nQ 21.296875 6.109375 26.703125 6.109375 \nQ 34.1875 6.109375 38.703125 11.40625 \nQ 43.21875 16.703125 43.21875 25.484375 \nL 43.21875 27.484375 \nz\nM 52.203125 31.203125 \nL 52.203125 0 \nL 43.21875 0 \nL 43.21875 8.296875 \nQ 40.140625 3.328125 35.546875 0.953125 \nQ 30.953125 -1.421875 24.3125 -1.421875 \nQ 15.921875 -1.421875 10.953125 3.296875 \nQ 6 8.015625 6 15.921875 \nQ 6 25.140625 12.171875 29.828125 \nQ 18.359375 34.515625 30.609375 34.515625 \nL 43.21875 34.515625 \nL 43.21875 35.40625 \nQ 43.21875 41.609375 39.140625 45 \nQ 35.0625 48.390625 27.6875 48.390625 \nQ 23 48.390625 18.546875 47.265625 \nQ 14.109375 46.140625 10.015625 43.890625 \nL 10.015625 52.203125 \nQ 14.9375 54.109375 19.578125 55.046875 \nQ 24.21875 56 28.609375 56 \nQ 40.484375 56 46.34375 49.84375 \nQ 52.203125 43.703125 52.203125 31.203125 \nz\n\" id=\"DejaVuSans-97\"/>\n        <path d=\"M 9.421875 75.984375 \nL 18.40625 75.984375 \nL 18.40625 0 \nL 9.421875 0 \nz\n\" id=\"DejaVuSans-108\"/>\n        <path d=\"M 44.28125 53.078125 \nL 44.28125 44.578125 \nQ 40.484375 46.53125 36.375 47.5 \nQ 32.28125 48.484375 27.875 48.484375 \nQ 21.1875 48.484375 17.84375 46.4375 \nQ 14.5 44.390625 14.5 40.28125 \nQ 14.5 37.15625 16.890625 35.375 \nQ 19.28125 33.59375 26.515625 31.984375 \nL 29.59375 31.296875 \nQ 39.15625 29.25 43.1875 25.515625 \nQ 47.21875 21.78125 47.21875 15.09375 \nQ 47.21875 7.46875 41.1875 3.015625 \nQ 35.15625 -1.421875 24.609375 -1.421875 \nQ 20.21875 -1.421875 15.453125 -0.5625 \nQ 10.6875 0.296875 5.421875 2 \nL 5.421875 11.28125 \nQ 10.40625 8.6875 15.234375 7.390625 \nQ 20.0625 6.109375 24.8125 6.109375 \nQ 31.15625 6.109375 34.5625 8.28125 \nQ 37.984375 10.453125 37.984375 14.40625 \nQ 37.984375 18.0625 35.515625 20.015625 \nQ 33.0625 21.96875 24.703125 23.78125 \nL 21.578125 24.515625 \nQ 13.234375 26.265625 9.515625 29.90625 \nQ 5.8125 33.546875 5.8125 39.890625 \nQ 5.8125 47.609375 11.28125 51.796875 \nQ 16.75 56 26.8125 56 \nQ 31.78125 56 36.171875 55.265625 \nQ 40.578125 54.546875 44.28125 53.078125 \nz\n\" id=\"DejaVuSans-115\"/>\n        <path d=\"M 56.203125 29.59375 \nL 56.203125 25.203125 \nL 14.890625 25.203125 \nQ 15.484375 15.921875 20.484375 11.0625 \nQ 25.484375 6.203125 34.421875 6.203125 \nQ 39.59375 6.203125 44.453125 7.46875 \nQ 49.3125 8.734375 54.109375 11.28125 \nL 54.109375 2.78125 \nQ 49.265625 0.734375 44.1875 -0.34375 \nQ 39.109375 -1.421875 33.890625 -1.421875 \nQ 20.796875 -1.421875 13.15625 6.1875 \nQ 5.515625 13.8125 5.515625 26.8125 \nQ 5.515625 40.234375 12.765625 48.109375 \nQ 20.015625 56 32.328125 56 \nQ 43.359375 56 49.78125 48.890625 \nQ 56.203125 41.796875 56.203125 29.59375 \nz\nM 47.21875 32.234375 \nQ 47.125 39.59375 43.09375 43.984375 \nQ 39.0625 48.390625 32.421875 48.390625 \nQ 24.90625 48.390625 20.390625 44.140625 \nQ 15.875 39.890625 15.1875 32.171875 \nz\n\" id=\"DejaVuSans-101\"/>\n       </defs>\n       <use xlink:href=\"#DejaVuSans-70\"/>\n       <use x=\"48.394531\" xlink:href=\"#DejaVuSans-97\"/>\n       <use x=\"109.673828\" xlink:href=\"#DejaVuSans-108\"/>\n       <use x=\"137.457031\" xlink:href=\"#DejaVuSans-115\"/>\n       <use x=\"189.556641\" xlink:href=\"#DejaVuSans-101\"/>\n      </g>\n     </g>\n    </g>\n    <g id=\"xtick_2\">\n     <g id=\"line2d_2\">\n      <g>\n       <use style=\"stroke:#000000;stroke-width:0.8;\" x=\"284.3875\" xlink:href=\"#mee2d3bfb9e\" y=\"239.758125\"/>\n      </g>\n     </g>\n     <g id=\"text_2\">\n      <!-- True -->\n      <g transform=\"translate(287.146875 267.9925)rotate(-90)scale(0.1 -0.1)\">\n       <defs>\n        <path d=\"M -0.296875 72.90625 \nL 61.375 72.90625 \nL 61.375 64.59375 \nL 35.5 64.59375 \nL 35.5 0 \nL 25.59375 0 \nL 25.59375 64.59375 \nL -0.296875 64.59375 \nz\n\" id=\"DejaVuSans-84\"/>\n        <path d=\"M 41.109375 46.296875 \nQ 39.59375 47.171875 37.8125 47.578125 \nQ 36.03125 48 33.890625 48 \nQ 26.265625 48 22.1875 43.046875 \nQ 18.109375 38.09375 18.109375 28.8125 \nL 18.109375 0 \nL 9.078125 0 \nL 9.078125 54.6875 \nL 18.109375 54.6875 \nL 18.109375 46.1875 \nQ 20.953125 51.171875 25.484375 53.578125 \nQ 30.03125 56 36.53125 56 \nQ 37.453125 56 38.578125 55.875 \nQ 39.703125 55.765625 41.0625 55.515625 \nz\n\" id=\"DejaVuSans-114\"/>\n        <path d=\"M 8.5 21.578125 \nL 8.5 54.6875 \nL 17.484375 54.6875 \nL 17.484375 21.921875 \nQ 17.484375 14.15625 20.5 10.265625 \nQ 23.53125 6.390625 29.59375 6.390625 \nQ 36.859375 6.390625 41.078125 11.03125 \nQ 45.3125 15.671875 45.3125 23.6875 \nL 45.3125 54.6875 \nL 54.296875 54.6875 \nL 54.296875 0 \nL 45.3125 0 \nL 45.3125 8.40625 \nQ 42.046875 3.421875 37.71875 1 \nQ 33.40625 -1.421875 27.6875 -1.421875 \nQ 18.265625 -1.421875 13.375 4.4375 \nQ 8.5 10.296875 8.5 21.578125 \nz\nM 31.109375 56 \nz\n\" id=\"DejaVuSans-117\"/>\n       </defs>\n       <use xlink:href=\"#DejaVuSans-84\"/>\n       <use x=\"46.333984\" xlink:href=\"#DejaVuSans-114\"/>\n       <use x=\"87.447266\" xlink:href=\"#DejaVuSans-117\"/>\n       <use x=\"150.826172\" xlink:href=\"#DejaVuSans-101\"/>\n      </g>\n     </g>\n    </g>\n    <g id=\"text_3\">\n     <!-- is_red -->\n     <g transform=\"translate(186 283.464375)scale(0.1 -0.1)\">\n      <defs>\n       <path d=\"M 9.421875 54.6875 \nL 18.40625 54.6875 \nL 18.40625 0 \nL 9.421875 0 \nz\nM 9.421875 75.984375 \nL 18.40625 75.984375 \nL 18.40625 64.59375 \nL 9.421875 64.59375 \nz\n\" id=\"DejaVuSans-105\"/>\n       <path d=\"M 50.984375 -16.609375 \nL 50.984375 -23.578125 \nL -0.984375 -23.578125 \nL -0.984375 -16.609375 \nz\n\" id=\"DejaVuSans-95\"/>\n       <path d=\"M 45.40625 46.390625 \nL 45.40625 75.984375 \nL 54.390625 75.984375 \nL 54.390625 0 \nL 45.40625 0 \nL 45.40625 8.203125 \nQ 42.578125 3.328125 38.25 0.953125 \nQ 33.9375 -1.421875 27.875 -1.421875 \nQ 17.96875 -1.421875 11.734375 6.484375 \nQ 5.515625 14.40625 5.515625 27.296875 \nQ 5.515625 40.1875 11.734375 48.09375 \nQ 17.96875 56 27.875 56 \nQ 33.9375 56 38.25 53.625 \nQ 42.578125 51.265625 45.40625 46.390625 \nz\nM 14.796875 27.296875 \nQ 14.796875 17.390625 18.875 11.75 \nQ 22.953125 6.109375 30.078125 6.109375 \nQ 37.203125 6.109375 41.296875 11.75 \nQ 45.40625 17.390625 45.40625 27.296875 \nQ 45.40625 37.203125 41.296875 42.84375 \nQ 37.203125 48.484375 30.078125 48.484375 \nQ 22.953125 48.484375 18.875 42.84375 \nQ 14.796875 37.203125 14.796875 27.296875 \nz\n\" id=\"DejaVuSans-100\"/>\n      </defs>\n      <use xlink:href=\"#DejaVuSans-105\"/>\n      <use x=\"27.783203\" xlink:href=\"#DejaVuSans-115\"/>\n      <use x=\"79.882812\" xlink:href=\"#DejaVuSans-95\"/>\n      <use x=\"129.882812\" xlink:href=\"#DejaVuSans-114\"/>\n      <use x=\"168.746094\" xlink:href=\"#DejaVuSans-101\"/>\n      <use x=\"230.269531\" xlink:href=\"#DejaVuSans-100\"/>\n     </g>\n    </g>\n   </g>\n   <g id=\"matplotlib.axis_2\">\n    <g id=\"ytick_1\">\n     <g id=\"line2d_3\">\n      <defs>\n       <path d=\"M 0 0 \nL -3.5 0 \n\" id=\"m7691cc4c91\" style=\"stroke:#000000;stroke-width:0.8;\"/>\n      </defs>\n      <g>\n       <use style=\"stroke:#000000;stroke-width:0.8;\" x=\"33.2875\" xlink:href=\"#m7691cc4c91\" y=\"239.758125\"/>\n      </g>\n     </g>\n     <g id=\"text_4\">\n      <!-- 0 -->\n      <g transform=\"translate(19.925 243.557344)scale(0.1 -0.1)\">\n       <defs>\n        <path d=\"M 31.78125 66.40625 \nQ 24.171875 66.40625 20.328125 58.90625 \nQ 16.5 51.421875 16.5 36.375 \nQ 16.5 21.390625 20.328125 13.890625 \nQ 24.171875 6.390625 31.78125 6.390625 \nQ 39.453125 6.390625 43.28125 13.890625 \nQ 47.125 21.390625 47.125 36.375 \nQ 47.125 51.421875 43.28125 58.90625 \nQ 39.453125 66.40625 31.78125 66.40625 \nz\nM 31.78125 74.21875 \nQ 44.046875 74.21875 50.515625 64.515625 \nQ 56.984375 54.828125 56.984375 36.375 \nQ 56.984375 17.96875 50.515625 8.265625 \nQ 44.046875 -1.421875 31.78125 -1.421875 \nQ 19.53125 -1.421875 13.0625 8.265625 \nQ 6.59375 17.96875 6.59375 36.375 \nQ 6.59375 54.828125 13.0625 64.515625 \nQ 19.53125 74.21875 31.78125 74.21875 \nz\n\" id=\"DejaVuSans-48\"/>\n       </defs>\n       <use xlink:href=\"#DejaVuSans-48\"/>\n      </g>\n     </g>\n    </g>\n    <g id=\"ytick_2\">\n     <g id=\"line2d_4\">\n      <g>\n       <use style=\"stroke:#000000;stroke-width:0.8;\" x=\"33.2875\" xlink:href=\"#m7691cc4c91\" y=\"213.605965\"/>\n      </g>\n     </g>\n     <g id=\"text_5\">\n      <!-- 25 -->\n      <g transform=\"translate(13.5625 217.405184)scale(0.1 -0.1)\">\n       <defs>\n        <path d=\"M 19.1875 8.296875 \nL 53.609375 8.296875 \nL 53.609375 0 \nL 7.328125 0 \nL 7.328125 8.296875 \nQ 12.9375 14.109375 22.625 23.890625 \nQ 32.328125 33.6875 34.8125 36.53125 \nQ 39.546875 41.84375 41.421875 45.53125 \nQ 43.3125 49.21875 43.3125 52.78125 \nQ 43.3125 58.59375 39.234375 62.25 \nQ 35.15625 65.921875 28.609375 65.921875 \nQ 23.96875 65.921875 18.8125 64.3125 \nQ 13.671875 62.703125 7.8125 59.421875 \nL 7.8125 69.390625 \nQ 13.765625 71.78125 18.9375 73 \nQ 24.125 74.21875 28.421875 74.21875 \nQ 39.75 74.21875 46.484375 68.546875 \nQ 53.21875 62.890625 53.21875 53.421875 \nQ 53.21875 48.921875 51.53125 44.890625 \nQ 49.859375 40.875 45.40625 35.40625 \nQ 44.1875 33.984375 37.640625 27.21875 \nQ 31.109375 20.453125 19.1875 8.296875 \nz\n\" id=\"DejaVuSans-50\"/>\n        <path d=\"M 10.796875 72.90625 \nL 49.515625 72.90625 \nL 49.515625 64.59375 \nL 19.828125 64.59375 \nL 19.828125 46.734375 \nQ 21.96875 47.46875 24.109375 47.828125 \nQ 26.265625 48.1875 28.421875 48.1875 \nQ 40.625 48.1875 47.75 41.5 \nQ 54.890625 34.8125 54.890625 23.390625 \nQ 54.890625 11.625 47.5625 5.09375 \nQ 40.234375 -1.421875 26.90625 -1.421875 \nQ 22.3125 -1.421875 17.546875 -0.640625 \nQ 12.796875 0.140625 7.71875 1.703125 \nL 7.71875 11.625 \nQ 12.109375 9.234375 16.796875 8.0625 \nQ 21.484375 6.890625 26.703125 6.890625 \nQ 35.15625 6.890625 40.078125 11.328125 \nQ 45.015625 15.765625 45.015625 23.390625 \nQ 45.015625 31 40.078125 35.4375 \nQ 35.15625 39.890625 26.703125 39.890625 \nQ 22.75 39.890625 18.8125 39.015625 \nQ 14.890625 38.140625 10.796875 36.28125 \nz\n\" id=\"DejaVuSans-53\"/>\n       </defs>\n       <use xlink:href=\"#DejaVuSans-50\"/>\n       <use x=\"63.623047\" xlink:href=\"#DejaVuSans-53\"/>\n      </g>\n     </g>\n    </g>\n    <g id=\"ytick_3\">\n     <g id=\"line2d_5\">\n      <g>\n       <use style=\"stroke:#000000;stroke-width:0.8;\" x=\"33.2875\" xlink:href=\"#m7691cc4c91\" y=\"187.453805\"/>\n      </g>\n     </g>\n     <g id=\"text_6\">\n      <!-- 50 -->\n      <g transform=\"translate(13.5625 191.253024)scale(0.1 -0.1)\">\n       <use xlink:href=\"#DejaVuSans-53\"/>\n       <use x=\"63.623047\" xlink:href=\"#DejaVuSans-48\"/>\n      </g>\n     </g>\n    </g>\n    <g id=\"ytick_4\">\n     <g id=\"line2d_6\">\n      <g>\n       <use style=\"stroke:#000000;stroke-width:0.8;\" x=\"33.2875\" xlink:href=\"#m7691cc4c91\" y=\"161.301645\"/>\n      </g>\n     </g>\n     <g id=\"text_7\">\n      <!-- 75 -->\n      <g transform=\"translate(13.5625 165.100864)scale(0.1 -0.1)\">\n       <defs>\n        <path d=\"M 8.203125 72.90625 \nL 55.078125 72.90625 \nL 55.078125 68.703125 \nL 28.609375 0 \nL 18.3125 0 \nL 43.21875 64.59375 \nL 8.203125 64.59375 \nz\n\" id=\"DejaVuSans-55\"/>\n       </defs>\n       <use xlink:href=\"#DejaVuSans-55\"/>\n       <use x=\"63.623047\" xlink:href=\"#DejaVuSans-53\"/>\n      </g>\n     </g>\n    </g>\n    <g id=\"ytick_5\">\n     <g id=\"line2d_7\">\n      <g>\n       <use style=\"stroke:#000000;stroke-width:0.8;\" x=\"33.2875\" xlink:href=\"#m7691cc4c91\" y=\"135.149485\"/>\n      </g>\n     </g>\n     <g id=\"text_8\">\n      <!-- 100 -->\n      <g transform=\"translate(7.2 138.948704)scale(0.1 -0.1)\">\n       <defs>\n        <path d=\"M 12.40625 8.296875 \nL 28.515625 8.296875 \nL 28.515625 63.921875 \nL 10.984375 60.40625 \nL 10.984375 69.390625 \nL 28.421875 72.90625 \nL 38.28125 72.90625 \nL 38.28125 8.296875 \nL 54.390625 8.296875 \nL 54.390625 0 \nL 12.40625 0 \nz\n\" id=\"DejaVuSans-49\"/>\n       </defs>\n       <use xlink:href=\"#DejaVuSans-49\"/>\n       <use x=\"63.623047\" xlink:href=\"#DejaVuSans-48\"/>\n       <use x=\"127.246094\" xlink:href=\"#DejaVuSans-48\"/>\n      </g>\n     </g>\n    </g>\n    <g id=\"ytick_6\">\n     <g id=\"line2d_8\">\n      <g>\n       <use style=\"stroke:#000000;stroke-width:0.8;\" x=\"33.2875\" xlink:href=\"#m7691cc4c91\" y=\"108.997325\"/>\n      </g>\n     </g>\n     <g id=\"text_9\">\n      <!-- 125 -->\n      <g transform=\"translate(7.2 112.796543)scale(0.1 -0.1)\">\n       <use xlink:href=\"#DejaVuSans-49\"/>\n       <use x=\"63.623047\" xlink:href=\"#DejaVuSans-50\"/>\n       <use x=\"127.246094\" xlink:href=\"#DejaVuSans-53\"/>\n      </g>\n     </g>\n    </g>\n    <g id=\"ytick_7\">\n     <g id=\"line2d_9\">\n      <g>\n       <use style=\"stroke:#000000;stroke-width:0.8;\" x=\"33.2875\" xlink:href=\"#m7691cc4c91\" y=\"82.845165\"/>\n      </g>\n     </g>\n     <g id=\"text_10\">\n      <!-- 150 -->\n      <g transform=\"translate(7.2 86.644383)scale(0.1 -0.1)\">\n       <use xlink:href=\"#DejaVuSans-49\"/>\n       <use x=\"63.623047\" xlink:href=\"#DejaVuSans-53\"/>\n       <use x=\"127.246094\" xlink:href=\"#DejaVuSans-48\"/>\n      </g>\n     </g>\n    </g>\n    <g id=\"ytick_8\">\n     <g id=\"line2d_10\">\n      <g>\n       <use style=\"stroke:#000000;stroke-width:0.8;\" x=\"33.2875\" xlink:href=\"#m7691cc4c91\" y=\"56.693005\"/>\n      </g>\n     </g>\n     <g id=\"text_11\">\n      <!-- 175 -->\n      <g transform=\"translate(7.2 60.492223)scale(0.1 -0.1)\">\n       <use xlink:href=\"#DejaVuSans-49\"/>\n       <use x=\"63.623047\" xlink:href=\"#DejaVuSans-55\"/>\n       <use x=\"127.246094\" xlink:href=\"#DejaVuSans-53\"/>\n      </g>\n     </g>\n    </g>\n    <g id=\"ytick_9\">\n     <g id=\"line2d_11\">\n      <g>\n       <use style=\"stroke:#000000;stroke-width:0.8;\" x=\"33.2875\" xlink:href=\"#m7691cc4c91\" y=\"30.540845\"/>\n      </g>\n     </g>\n     <g id=\"text_12\">\n      <!-- 200 -->\n      <g transform=\"translate(7.2 34.340063)scale(0.1 -0.1)\">\n       <use xlink:href=\"#DejaVuSans-50\"/>\n       <use x=\"63.623047\" xlink:href=\"#DejaVuSans-48\"/>\n       <use x=\"127.246094\" xlink:href=\"#DejaVuSans-48\"/>\n      </g>\n     </g>\n    </g>\n   </g>\n   <g id=\"patch_5\">\n    <path d=\"M 33.2875 239.758125 \nL 33.2875 22.318125 \n\" style=\"fill:none;stroke:#000000;stroke-linecap:square;stroke-linejoin:miter;stroke-width:0.8;\"/>\n   </g>\n   <g id=\"patch_6\">\n    <path d=\"M 368.0875 239.758125 \nL 368.0875 22.318125 \n\" style=\"fill:none;stroke:#000000;stroke-linecap:square;stroke-linejoin:miter;stroke-width:0.8;\"/>\n   </g>\n   <g id=\"patch_7\">\n    <path d=\"M 33.2875 239.758125 \nL 368.0875 239.758125 \n\" style=\"fill:none;stroke:#000000;stroke-linecap:square;stroke-linejoin:miter;stroke-width:0.8;\"/>\n   </g>\n   <g id=\"patch_8\">\n    <path d=\"M 33.2875 22.318125 \nL 368.0875 22.318125 \n\" style=\"fill:none;stroke:#000000;stroke-linecap:square;stroke-linejoin:miter;stroke-width:0.8;\"/>\n   </g>\n   <g id=\"text_13\">\n    <!-- Average square feet value between blue and red states -->\n    <g transform=\"translate(33.75625 16.318125)scale(0.12 -0.12)\">\n     <defs>\n      <path d=\"M 34.1875 63.1875 \nL 20.796875 26.90625 \nL 47.609375 26.90625 \nz\nM 28.609375 72.90625 \nL 39.796875 72.90625 \nL 67.578125 0 \nL 57.328125 0 \nL 50.6875 18.703125 \nL 17.828125 18.703125 \nL 11.1875 0 \nL 0.78125 0 \nz\n\" id=\"DejaVuSans-65\"/>\n      <path d=\"M 2.984375 54.6875 \nL 12.5 54.6875 \nL 29.59375 8.796875 \nL 46.6875 54.6875 \nL 56.203125 54.6875 \nL 35.6875 0 \nL 23.484375 0 \nz\n\" id=\"DejaVuSans-118\"/>\n      <path d=\"M 45.40625 27.984375 \nQ 45.40625 37.75 41.375 43.109375 \nQ 37.359375 48.484375 30.078125 48.484375 \nQ 22.859375 48.484375 18.828125 43.109375 \nQ 14.796875 37.75 14.796875 27.984375 \nQ 14.796875 18.265625 18.828125 12.890625 \nQ 22.859375 7.515625 30.078125 7.515625 \nQ 37.359375 7.515625 41.375 12.890625 \nQ 45.40625 18.265625 45.40625 27.984375 \nz\nM 54.390625 6.78125 \nQ 54.390625 -7.171875 48.1875 -13.984375 \nQ 42 -20.796875 29.203125 -20.796875 \nQ 24.46875 -20.796875 20.265625 -20.09375 \nQ 16.0625 -19.390625 12.109375 -17.921875 \nL 12.109375 -9.1875 \nQ 16.0625 -11.328125 19.921875 -12.34375 \nQ 23.78125 -13.375 27.78125 -13.375 \nQ 36.625 -13.375 41.015625 -8.765625 \nQ 45.40625 -4.15625 45.40625 5.171875 \nL 45.40625 9.625 \nQ 42.625 4.78125 38.28125 2.390625 \nQ 33.9375 0 27.875 0 \nQ 17.828125 0 11.671875 7.65625 \nQ 5.515625 15.328125 5.515625 27.984375 \nQ 5.515625 40.671875 11.671875 48.328125 \nQ 17.828125 56 27.875 56 \nQ 33.9375 56 38.28125 53.609375 \nQ 42.625 51.21875 45.40625 46.390625 \nL 45.40625 54.6875 \nL 54.390625 54.6875 \nz\n\" id=\"DejaVuSans-103\"/>\n      <path id=\"DejaVuSans-32\"/>\n      <path d=\"M 14.796875 27.296875 \nQ 14.796875 17.390625 18.875 11.75 \nQ 22.953125 6.109375 30.078125 6.109375 \nQ 37.203125 6.109375 41.296875 11.75 \nQ 45.40625 17.390625 45.40625 27.296875 \nQ 45.40625 37.203125 41.296875 42.84375 \nQ 37.203125 48.484375 30.078125 48.484375 \nQ 22.953125 48.484375 18.875 42.84375 \nQ 14.796875 37.203125 14.796875 27.296875 \nz\nM 45.40625 8.203125 \nQ 42.578125 3.328125 38.25 0.953125 \nQ 33.9375 -1.421875 27.875 -1.421875 \nQ 17.96875 -1.421875 11.734375 6.484375 \nQ 5.515625 14.40625 5.515625 27.296875 \nQ 5.515625 40.1875 11.734375 48.09375 \nQ 17.96875 56 27.875 56 \nQ 33.9375 56 38.25 53.625 \nQ 42.578125 51.265625 45.40625 46.390625 \nL 45.40625 54.6875 \nL 54.390625 54.6875 \nL 54.390625 -20.796875 \nL 45.40625 -20.796875 \nz\n\" id=\"DejaVuSans-113\"/>\n      <path d=\"M 37.109375 75.984375 \nL 37.109375 68.5 \nL 28.515625 68.5 \nQ 23.6875 68.5 21.796875 66.546875 \nQ 19.921875 64.59375 19.921875 59.515625 \nL 19.921875 54.6875 \nL 34.71875 54.6875 \nL 34.71875 47.703125 \nL 19.921875 47.703125 \nL 19.921875 0 \nL 10.890625 0 \nL 10.890625 47.703125 \nL 2.296875 47.703125 \nL 2.296875 54.6875 \nL 10.890625 54.6875 \nL 10.890625 58.5 \nQ 10.890625 67.625 15.140625 71.796875 \nQ 19.390625 75.984375 28.609375 75.984375 \nz\n\" id=\"DejaVuSans-102\"/>\n      <path d=\"M 18.3125 70.21875 \nL 18.3125 54.6875 \nL 36.8125 54.6875 \nL 36.8125 47.703125 \nL 18.3125 47.703125 \nL 18.3125 18.015625 \nQ 18.3125 11.328125 20.140625 9.421875 \nQ 21.96875 7.515625 27.59375 7.515625 \nL 36.8125 7.515625 \nL 36.8125 0 \nL 27.59375 0 \nQ 17.1875 0 13.234375 3.875 \nQ 9.28125 7.765625 9.28125 18.015625 \nL 9.28125 47.703125 \nL 2.6875 47.703125 \nL 2.6875 54.6875 \nL 9.28125 54.6875 \nL 9.28125 70.21875 \nz\n\" id=\"DejaVuSans-116\"/>\n      <path d=\"M 48.6875 27.296875 \nQ 48.6875 37.203125 44.609375 42.84375 \nQ 40.53125 48.484375 33.40625 48.484375 \nQ 26.265625 48.484375 22.1875 42.84375 \nQ 18.109375 37.203125 18.109375 27.296875 \nQ 18.109375 17.390625 22.1875 11.75 \nQ 26.265625 6.109375 33.40625 6.109375 \nQ 40.53125 6.109375 44.609375 11.75 \nQ 48.6875 17.390625 48.6875 27.296875 \nz\nM 18.109375 46.390625 \nQ 20.953125 51.265625 25.265625 53.625 \nQ 29.59375 56 35.59375 56 \nQ 45.5625 56 51.78125 48.09375 \nQ 58.015625 40.1875 58.015625 27.296875 \nQ 58.015625 14.40625 51.78125 6.484375 \nQ 45.5625 -1.421875 35.59375 -1.421875 \nQ 29.59375 -1.421875 25.265625 0.953125 \nQ 20.953125 3.328125 18.109375 8.203125 \nL 18.109375 0 \nL 9.078125 0 \nL 9.078125 75.984375 \nL 18.109375 75.984375 \nz\n\" id=\"DejaVuSans-98\"/>\n      <path d=\"M 4.203125 54.6875 \nL 13.1875 54.6875 \nL 24.421875 12.015625 \nL 35.59375 54.6875 \nL 46.1875 54.6875 \nL 57.421875 12.015625 \nL 68.609375 54.6875 \nL 77.59375 54.6875 \nL 63.28125 0 \nL 52.6875 0 \nL 40.921875 44.828125 \nL 29.109375 0 \nL 18.5 0 \nz\n\" id=\"DejaVuSans-119\"/>\n      <path d=\"M 54.890625 33.015625 \nL 54.890625 0 \nL 45.90625 0 \nL 45.90625 32.71875 \nQ 45.90625 40.484375 42.875 44.328125 \nQ 39.84375 48.1875 33.796875 48.1875 \nQ 26.515625 48.1875 22.3125 43.546875 \nQ 18.109375 38.921875 18.109375 30.90625 \nL 18.109375 0 \nL 9.078125 0 \nL 9.078125 54.6875 \nL 18.109375 54.6875 \nL 18.109375 46.1875 \nQ 21.34375 51.125 25.703125 53.5625 \nQ 30.078125 56 35.796875 56 \nQ 45.21875 56 50.046875 50.171875 \nQ 54.890625 44.34375 54.890625 33.015625 \nz\n\" id=\"DejaVuSans-110\"/>\n     </defs>\n     <use xlink:href=\"#DejaVuSans-65\"/>\n     <use x=\"62.533203\" xlink:href=\"#DejaVuSans-118\"/>\n     <use x=\"121.712891\" xlink:href=\"#DejaVuSans-101\"/>\n     <use x=\"183.236328\" xlink:href=\"#DejaVuSans-114\"/>\n     <use x=\"224.349609\" xlink:href=\"#DejaVuSans-97\"/>\n     <use x=\"285.628906\" xlink:href=\"#DejaVuSans-103\"/>\n     <use x=\"349.105469\" xlink:href=\"#DejaVuSans-101\"/>\n     <use x=\"410.628906\" xlink:href=\"#DejaVuSans-32\"/>\n     <use x=\"442.416016\" xlink:href=\"#DejaVuSans-115\"/>\n     <use x=\"494.515625\" xlink:href=\"#DejaVuSans-113\"/>\n     <use x=\"557.992188\" xlink:href=\"#DejaVuSans-117\"/>\n     <use x=\"621.371094\" xlink:href=\"#DejaVuSans-97\"/>\n     <use x=\"682.650391\" xlink:href=\"#DejaVuSans-114\"/>\n     <use x=\"721.513672\" xlink:href=\"#DejaVuSans-101\"/>\n     <use x=\"783.037109\" xlink:href=\"#DejaVuSans-32\"/>\n     <use x=\"814.824219\" xlink:href=\"#DejaVuSans-102\"/>\n     <use x=\"850.029297\" xlink:href=\"#DejaVuSans-101\"/>\n     <use x=\"911.552734\" xlink:href=\"#DejaVuSans-101\"/>\n     <use x=\"973.076172\" xlink:href=\"#DejaVuSans-116\"/>\n     <use x=\"1012.285156\" xlink:href=\"#DejaVuSans-32\"/>\n     <use x=\"1044.072266\" xlink:href=\"#DejaVuSans-118\"/>\n     <use x=\"1103.251953\" xlink:href=\"#DejaVuSans-97\"/>\n     <use x=\"1164.53125\" xlink:href=\"#DejaVuSans-108\"/>\n     <use x=\"1192.314453\" xlink:href=\"#DejaVuSans-117\"/>\n     <use x=\"1255.693359\" xlink:href=\"#DejaVuSans-101\"/>\n     <use x=\"1317.216797\" xlink:href=\"#DejaVuSans-32\"/>\n     <use x=\"1349.003906\" xlink:href=\"#DejaVuSans-98\"/>\n     <use x=\"1412.480469\" xlink:href=\"#DejaVuSans-101\"/>\n     <use x=\"1474.003906\" xlink:href=\"#DejaVuSans-116\"/>\n     <use x=\"1513.212891\" xlink:href=\"#DejaVuSans-119\"/>\n     <use x=\"1595\" xlink:href=\"#DejaVuSans-101\"/>\n     <use x=\"1656.523438\" xlink:href=\"#DejaVuSans-101\"/>\n     <use x=\"1718.046875\" xlink:href=\"#DejaVuSans-110\"/>\n     <use x=\"1781.425781\" xlink:href=\"#DejaVuSans-32\"/>\n     <use x=\"1813.212891\" xlink:href=\"#DejaVuSans-98\"/>\n     <use x=\"1876.689453\" xlink:href=\"#DejaVuSans-108\"/>\n     <use x=\"1904.472656\" xlink:href=\"#DejaVuSans-117\"/>\n     <use x=\"1967.851562\" xlink:href=\"#DejaVuSans-101\"/>\n     <use x=\"2029.375\" xlink:href=\"#DejaVuSans-32\"/>\n     <use x=\"2061.162109\" xlink:href=\"#DejaVuSans-97\"/>\n     <use x=\"2122.441406\" xlink:href=\"#DejaVuSans-110\"/>\n     <use x=\"2185.820312\" xlink:href=\"#DejaVuSans-100\"/>\n     <use x=\"2249.296875\" xlink:href=\"#DejaVuSans-32\"/>\n     <use x=\"2281.083984\" xlink:href=\"#DejaVuSans-114\"/>\n     <use x=\"2319.947266\" xlink:href=\"#DejaVuSans-101\"/>\n     <use x=\"2381.470703\" xlink:href=\"#DejaVuSans-100\"/>\n     <use x=\"2444.947266\" xlink:href=\"#DejaVuSans-32\"/>\n     <use x=\"2476.734375\" xlink:href=\"#DejaVuSans-115\"/>\n     <use x=\"2528.833984\" xlink:href=\"#DejaVuSans-116\"/>\n     <use x=\"2568.042969\" xlink:href=\"#DejaVuSans-97\"/>\n     <use x=\"2629.322266\" xlink:href=\"#DejaVuSans-116\"/>\n     <use x=\"2668.53125\" xlink:href=\"#DejaVuSans-101\"/>\n     <use x=\"2730.054688\" xlink:href=\"#DejaVuSans-115\"/>\n    </g>\n   </g>\n  </g>\n </g>\n <defs>\n  <clipPath id=\"p0d48ab611a\">\n   <rect height=\"217.44\" width=\"334.8\" x=\"33.2875\" y=\"22.318125\"/>\n  </clipPath>\n </defs>\n</svg>\n",
      "text/plain": [
       "<Figure size 432x288 with 1 Axes>"
      ]
     },
     "metadata": {
      "needs_background": "light"
     },
     "output_type": "display_data"
    }
   ],
   "source": [
    "(\n",
    "    us_median_sq_ft_value\n",
    "    .assign(is_red=lambda x : x.State.isin(red_state_abbrev))\n",
    "    .groupby('is_red')['2017-09']\n",
    "    .mean()\n",
    "    .plot\n",
    "    .bar(title=\"Average square feet value between blue and red states\")\n",
    ");"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "We can now see that the average value of square feet in the blue states (_is_red = False_) is about twice the average of red states (_is_red = True_)"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {},
   "outputs": [],
   "source": []
  }
 ],
 "metadata": {
  "kernelspec": {
   "display_name": "Python 3",
   "language": "python",
   "name": "python3"
  },
  "language_info": {
   "codemirror_mode": {
    "name": "ipython",
    "version": 3
   },
   "file_extension": ".py",
   "mimetype": "text/x-python",
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
   "version": "3.8.3-final"
  }
 },
 "nbformat": 4,
 "nbformat_minor": 2
}