CLIO_SRL

CLIO_SRL은 한국어 서사 복원 프로젝트의 SRL(Narrative Semantic Role Labeling)을 위한 시스템입니다.
이 저장소는 SRL 태스크를 위한 데이터 준비, 모델 학습, 그리고 추론 과정을 포함합니다. 👉 본 레포지토리는 아래 논문에서 사용된 시스템 구현 코드입니다.
권은낭, 송준모, 서동건, 이강민, 김태욱, 강정한. (2025). 한국어 의미역 결정 모형을 통한 텍스트 내 서사 구조 분석 방법.
한국사회학, 59(3), 101-146.
🔗 논문 바로가기

🧐 Pip install

pip install -r requirements.txt

📂 Data Preparation

다음 링크에서 CLIO SRL dataset_ver2.json 파일을 다운로드합니다.
다운로드한 파일을 현재 레포지토리의 루트 디렉토리에 저장합니다.
아래 명령어를 실행하여 데이터를 학습/검증/평가용으로 분할합니다:
```
python3 data_split.py
```

🏋️ Train

모델 학습을 위해 아래 쉘 스크립트를 실행합니다:

sh train.sh

참고: 하이퍼파라미터나 경로 설정 등은 train.sh 내에서 수정 가능합니다.

🔍 Inference

아래 Google Drive 링크에서 추론용 CSV 파일을 다운로드합니다:
👉 📥 social_line.csv 다운로드
파일 이름을 social_line.csv로 변경한 후, data/ 디렉토리에 저장합니다.
추론을 위해 아래 쉘 스크립트를 실행합니다:
```
sh inference.sh
```

📁 디렉토리 구조 예시

.
├── data/
│   ├── CLIO SRL dataset_ver2.json
│   ├── clio_train.json
│   ├── clio_val.json
│   ├── clio_test.json
│   └── social_line.csv
├── data_split.py
├── train.sh
├── inference.sh
└── ...

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
LICENSE		LICENSE
README.md		README.md
data_split.py		data_split.py
inference.py		inference.py
inference.sh		inference.sh
requirements.txt		requirements.txt
train.py		train.py
train.sh		train.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

CLIO_SRL

🧐 Pip install

📂 Data Preparation

🏋️ Train

🔍 Inference

📁 디렉토리 구조 예시

About

Uh oh!

Releases

Packages

Languages

License

HYU-NLP/CLIO_SRL

Folders and files

Latest commit

History

Repository files navigation

CLIO_SRL

🧐 Pip install

📂 Data Preparation

🏋️ Train

🔍 Inference

📁 디렉토리 구조 예시

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages