W12: 5/02 === ### 範例與提示 --- 1.用panda導入資料 ``` import pandas as pd import numpy as np import matplotlib.pyplot as plt #讀取雲端硬碟中的excel檔 dfIncome = pd.read_excel('/content/drive/My Drive/___TestData/001_Income_F.xlsx') dfIncome ``` --- 2.畫出收入與人數直方圖 ``` sum=dfIncome['Income'].sum() n=dfIncome['Income'].count() print(f'總和={sum}, 樣本數={n}') #直方圖 plt.hist(dfIncome['Income'], bins=range(0, 100000, 1000)) plt.show() ``` --- 3.畫出羅倫茲曲線 ``` dfLorenz=dfIncome[:] se=dfLorenz['Income'].sort_values() cumulativeSum=0 i=0 xx=[] yy=[] for x in se: i=i+1 cumulativeSum+=x xx.append(i/n) yy.append(cumulativeSum/sum) plt.plot([0,1], linestyle = 'dotted') plt.scatter(x=xx,y=yy,s=0.1) plt.axis('square') plt.xlim(0,1) plt.ylim(0,1) print('羅倫茲曲線(Lorenz curve)') plt.show() ```