제 1 부 자료의 시각화
제 1 장 자료의 시각화 ······························································································· 3
1. 왜 자료의 시각화인가? ·········································································· 4
2. 자료의 시각화의 목적은 무엇인가? ·················································· 11
1) 자료를 요약 · 정리하기 위한 목적 _ 11
2) 적절한 통계 방법론 적용을 위한 시각화 _ 14
3) 효율적인 소통을 위한 시각화 _ 18
3. 자료의 시각화 방법과 오류 ································································ 21
1) 규모가 다른 그룹 간 차이를 반영하지 못한 시각화 _ 22
2) 잘못된 축의 사용에 따른 시각화의 문제점 _ 24
3) 입체화에 따른 정확성 상실 _ 25
4. 자료의 시각화를 위한 그래프에는 어떤 것들이 있는가? ············· 27
제 2 장 SAS의 소개 ································································································· 33
1. SAS를 이용한 자료 시각화의 접근방법 ··········································· 34
2. SAS에 대한 오해와 SAS의 장점 ······················································· 39
3. SAS 프로그램의 기본 구조: DATA 스텝과 PROC 스텝 ············· 43
1) SAS 프로그램의 기본 구조 _ 43
2) 데이터 스텝의 기본 형식 _ 46
3) PROC 스텝의 기본 형식 _ 47
제 3 장 SAS에서의 자료 입출력 및 자료운영 ················································· 53
1. 자료의 입출력 ······················································································ 54
1) 엑셀 자료의 가져오기와 보내기 _ 54
2) CSV 파일 불러오기 _ 57
3) SAS에서 자료를 직접 입력하기 _ 58
4) SAS 자료의 출력 전달 시스템 _ 63
2. 교차표 형식의 자료 입력 ···································································· 68
3. 변수변환 ·································································································· 77
1) 사칙연산 및 연산자를 이용한 변수변환 _ 78
2) 연속형 변수와 범주형 변수 간의 변환 _ 82
4. 변수 및 변수 값에 레이블 붙이기 ···················································· 86
1) 데이터 세트와 변수에 설명 붙이기: LABEL 명령 _ 89
2) 변수 값에 설명 붙이기: FORMAT 명령 _ 91
3) 엑셀 입력 자료를 통한 레이블과 포맷 지정 _ 94
4) 레이블과 포맷 정보 지우기 _ 100
5) 기타 포맷과 관련된 특수문제들 _ 101
5. 횡형 자료와 종형 자료 간의 변형 ·················································· 103
1) 자료 전치 방법을 이용한 변환: PROC TRANSPOSE _ 104
2) OUTPUT 문을 이용한 횡형 자료를 종형 자료로 변환하기 _ 108
3) ARRAY 문을 이용한 변형 방법 _ 110
4) ARRAY를 이용한 결측치 처리 _ 120
5) ARRAY 함수의 기타 용법 _ 122
6. 축차변수 만들기 ·················································································· 126
7. 자료의 정렬과 결합 ············································································ 130
1) 자료의 정렬: PROC SORT _ 130
2) 자료의 수평결합: MERGE _ 132
3) 자료의 수직결합: SET _ 135
제 2 부 자료 시각화를 위한 기본 그래프 유형
제 4 장 시각화를 위한 기본 그래프 ·································································· 141
1. SAS의 통계 그래프의 기본 유형 ··················································· 142
1) 통계 그래프의 유형 _ 142
2) 기본 그래프의 유형 _ 147
2. 기본 그래프의 구현과 응용 ······························································ 151
1) 막대그래프 _ 151
2) 다양한 막대그래프 응용 사례 _ 164
3) 선 그래프 _ 168
4) 상자그림 _ 178
5) 산점도 _ 183
6) 누적 막대그래프와 밴드 그래프 _ 188
3. PROC SGPANEL을 이용한 시각화 ················································· 195
4. PROC SGSCATTER를 이용한 시각화 ············································ 197
5. GTL을 이용한 시각화 ········································································ 199
1) GTL의 기본 구조 _ 200
2) GTL을 이용한 통계 그래프 그리기 _ 205
6. 다양한 자료 시각화 예시들 ······························································ 217
제 3 부 통계분석의 시각화
제 5 장 기술통계분석을 위한 시각화 ································································· 231
1. 범주형 변수 분포의 시각화 ······························································ 233
1) 단일 범주형 변수의 분포 _ 233
2) 두 개의 서열형 변수의 시각화 _ 246
3) 범주형 변수와 연속형 변수의 시각화 _ 251
2. 연속형 변수 분포의 시각화 ······························································ 260
1) 단일 변수 분포의 시각화 _ 260
2) 시계열 변수 추세의 시각화 _ 268
3) 두 연속형 변수 관계의 시각화 _ 277
제 6 장 그룹 간 차이분석과 분산분석 결과의 시각화 ································· 283
1. 두 집단 간의 차이 검정의 시각화 ·················································· 285
2. 분산분석을 이용한 집단 간의 평균 차이 분석 ····························· 288
3. 그룹 간 평균 차이의 시각화 ···························································· 292
1) 전체 평균과 개별 그룹 평균의 차이 분석 _ 293
2) 전체 그룹 비율과 개별 그룹 비율의 차이 분석 _ 296
3) 단위시간당 사건 발생 횟수의 전체 그룹과 개별 그룹의
차이 분석 _ 298
제 7 장 회귀분석 결과의 시각화 ········································································· 303
1. 산점도 분석을 통한 독립변수와 종속변수의 특성 파악하기 ······ 305
2. 모형적합도의 판단을 위한 시각화 ··················································· 311
3. 회귀계수의 시각화 ·············································································· 312
4. 회귀가정 검토를 위한 시각화 ·························································· 320
1) 잔차를 이용한 극단값의 확인 _ 320
2) 잔차분석을 이용한 이분산성 및 선형성 확인 _ 322
3) 영향점과 이상점의 검출 _ 325
제 8 장 로지스틱 회귀분석 결과의 시각화 ······················································ 329
1. 분석 예제 자료 ···················································································· 330
2. 모형적합도 ···························································································· 332
3. 추정된 회귀계수의 시각화 ································································ 337
4. 한계효과를 이용한 확률 변화의 크기 해석 ··································· 342
5. 로지스틱 회귀분석에서의 상호작용 효과 ······································· 345
제 4 부 공간정보의 시각화
제 9 장 공간정보의 시각화 개요 ······································································· 359
1. 왜 공간정보 시각화인가? ·································································· 361
2. 공간좌표 구하기 ················································································ 362
3. 지리정보 가져오기 ·············································································· 366
4. GEOCODING의 개념 ·········································································· 370
제10장 공간정보를 이용한 시각화 ···································································· 373
1. GMAP을 이용한 시각화 ···································································· 374
2. 엑셀에 저장된 속성자료와 SAS 공간자료를 이용한 분석 ·········· 390
3. 외부 공간정보 자료를 가져와 지도 그리기 ··································· 392
4. 인터넷상의 지리정보를 직접 활용하기 ··········································· 408
제11장 자료 시각화의 발전 방향 ······································································ 415
부록: SAS UNIVERSITY EDITION 설치법 _ 419
찾아보기 _ 431