인프런 - 데이터 분석을 위한 판다스 - 섹션 4 - 그룹별 데이터 규모 파악하기

르네·2023년 10월 13일

python

Python

목록 보기

38/45

본 내용은 인프런 강의 <데이터 분석을 위한 판다스>를 수강하며 중요한 점을 정리한 글입니다.

How to calculate summary statistics
Aggregating statistics

https://pandas.pydata.org/pandas-docs/stable/getting_started/intro_tutorials/06_calculate_statistics.html

How to calculate summary statistics
Aggregating statistics

https://pandas.pydata.org/pandas-docs/stable/getting_started/intro_tutorials/06_calculate_statistics.html

그룹별 데이터 규모 파악하기

titanic['Pclass'].value_counts()
->

3    491
1    216
2    184
Name: Pclass, dtype: int64

Both size and count can be used in combination with groupby. Whereas size includes NaN values and just provides the number of rows (size of the table), count excludes the missing values. In the value_counts method, use the dropna argument to include or exclude the NaN values.

titanic.groupby('Pclass')['Age'].count()

->

Pclass
1    186
2    173
3    355

: COUNT()는 NULL값은 세주지 않음.

titanic.groupby('Pclass')['Age'].size()
->
Pclass
1    216
2    184
3    491

: size()는 NULL값도 세준다.

VALUE_COUNTS()

titanic['Age'].value_counts().sum()
->
714

: NULL 값 빼고 세어주는 경우

titanic['Age'].value_counts(dropna=False).sum()
-> 891

: NULL 값 포함해서 세어주는 경우

르네

데이터분석 공부로그

이전 포스트

인프런 - 데이터 분석을 위한 판다스 - 섹션 4 - 그룹별 데이터 집계

다음 포스트

인프런 - 데이터 분석을 위한 판다스 - 섹션 4 - 그룹별 데이터 규모 파악하기

Python

그룹별 데이터 규모 파악하기

VALUE_COUNTS()

인프런 - 데이터 분석을 위한 판다스 - 섹션 4 - 그룹별 데이터 집계

인프런 - 데이터 분석을 위한 판다스 - 섹션 5 - 테이블 형태 변경(Long to Wide)

0개의 댓글