์ด์์น ํ์ธ ์ค์ตยถ
Inย [ย ]:
import seaborn as sns
import matplotlib.pyplot as plt
import pandas as pd
import numpy as np
Inย [ย ]:
tips = sns.load_dataset('tips')
tips.head(3)
Out[ย ]:
total_bill | tip | sex | smoker | day | time | size | |
---|---|---|---|---|---|---|---|
0 | 16.99 | 1.01 | Female | No | Sun | Dinner | 2 |
1 | 10.34 | 1.66 | Male | No | Sun | Dinner | 3 |
2 | 21.01 | 3.50 | Male | No | Sun | Dinner | 3 |
Inย [ย ]:
tips[['total_bill']].head(3)
Out[ย ]:
total_bill | |
---|---|
0 | 16.99 |
1 | 10.34 |
2 | 21.01 |
1. ESD ์ด์ฉํ ์ด์์น ํ์ธยถ
Inย [ย ]:
mean = np.mean(tips['total_bill'])
std = np.std(tips['total_bill'])
upper_limit = mean + 3 * std
lower_limit = mean - 3 * std
print(upper_limit, ',', lower_limit)
46.43839435626422 , -6.866509110362578
Inย [ย ]:
# ์กฐ๊ฑด ํํฐ๋ง
cond = (tips['total_bill'] > upper_limit)
tips[cond]
Out[ย ]:
total_bill | tip | sex | smoker | day | time | size | |
---|---|---|---|---|---|---|---|
59 | 48.27 | 6.73 | Male | No | Sat | Dinner | 4 |
156 | 48.17 | 5.00 | Male | No | Sun | Dinner | 6 |
170 | 50.81 | 10.00 | Male | Yes | Sat | Dinner | 3 |
212 | 48.33 | 9.00 | Male | No | Sat | Dinner | 4 |
2. IQR์ ์ด์ฉํ ์ด์์น ํ์ธ (boxplot)ยถ
Inย [ย ]:
sns.boxplot(tips['total_bill'])
Out[ย ]:
<Axes: ylabel='total_bill'>
Inย [ย ]:
q1 = tips['total_bill'].quantile(0.25)
q3 = tips['total_bill'].quantile(0.75)
iqr = q3 - q1
upper_limit2 = q3 + 1.5 * iqr
lower_limit2 = q1 - 1.5 * iqr
print(q1, q3, iqr, upper_limit2, lower_limit2)
13.3475 24.127499999999998 10.779999999999998 40.29749999999999 -2.8224999999999945
Inย [ย ]:
# ์กฐ๊ฑด ํํฐ๋ง
cond2 = (tips['total_bill'] > upper_limit2)
tips[cond2]
Out[ย ]:
total_bill | tip | sex | smoker | day | time | size | |
---|---|---|---|---|---|---|---|
59 | 48.27 | 6.73 | Male | No | Sat | Dinner | 4 |
102 | 44.30 | 2.50 | Female | Yes | Sat | Dinner | 3 |
142 | 41.19 | 5.00 | Male | No | Thur | Lunch | 5 |
156 | 48.17 | 5.00 | Male | No | Sun | Dinner | 6 |
170 | 50.81 | 10.00 | Male | Yes | Sat | Dinner | 3 |
182 | 45.35 | 3.50 | Male | Yes | Sun | Dinner | 3 |
184 | 40.55 | 3.00 | Male | Yes | Sun | Dinner | 2 |
197 | 43.11 | 5.00 | Female | Yes | Thur | Lunch | 4 |
212 | 48.33 | 9.00 | Male | No | Sat | Dinner | 4 |
'๐ Python' ์นดํ ๊ณ ๋ฆฌ์ ๋ค๋ฅธ ๊ธ
ํ์ด์ฌ ์ธ์ฝ๋ฉ ์ค์ต (0) | 2024.02.05 |
---|---|
ํ์ด์ฌ ๊ฒฐ์ธก์น ํ์ธ ์ค์ต (0) | 2024.02.05 |
240126 FRI ํ์ด์ฌ ๋ฐ์ดํฐ ๋ถ์ ๊ฐ์ธ ๊ณผ์ (0) | 2024.01.26 |
240125 THU ํ์ด์ฌ์ ํ์ฉํ ์ ์ฒ๋ฆฌ & ์๊ฐํ ๊ฐ์ (1) | 2024.01.25 |
ํ์ด์ฌ ๋ด์ฅํจ์ enumerate()์ for ๋ฐ๋ณต๋ฌธ (0) | 2024.01.10 |