[PAPER REVIEW 231228] NLP review paper

2023. 12. 28. 03:21·Paper Review

Natural language processing: state of the art, current trends and challenges

Jul. 2022

 

1. NLP

 

1) Natural Language Understanding (NLU) = Linguistics

 

(1) Phonology - sound

 

(2) Morphology - the smallest units of meaning

   e.g., precancellation -> pre (prefix), cancella (root), -tion (suffix)

   a. Lexical morpheme (e.g., table, chair)

   b. Grammatical morphemes (e.g., Worked, Consulting)

   c. Bound morphemes (e.g., -ed, -ing)

      c-1. inflectional morphemes: change the different grammatical categories

      c-2. derivational morphemes: change the semantic meaning of the word

 

(3) Lexical

   a. part-of-speech

   b. stemming: remove the suffix

   c. lemmatization: correct basic form

 

(4) Syntax - sentence structure

   -> doesn't support stemming or lematization

 

(5) Semantics - literal meaning

(6) Pragmatics - inferred meaning

   e.g., "Do you know what time is it?"

   Semantics: "Asking for the current time"

   Pragmatics: "Expressing resentment to someone"

 

2) Natural Language Generation (NLG)

 

(1) Components and Levels of Representaiton

   a. Content selection

   b. Textual Organization

   c. Linguistic Resources

   d. Realization

 

2. NLP tasks

1) Automatic Summarization

 

2) Co-Reference Resolution - Find words used in different ways to describe an arbitrary entity and connect them to the same entity

 

3) Discourse Analysis - chat data

 

4) Machine translation

 

5) Morphological Segmentation - breaking words into individual meaning-bearing morphemes

 

6) Named entity recognition (NER)

 

7) Optical Character Recognition

 

8) Part Of Speech Tagging

 

3. Datasets

1) Language Modelling

(1) Salesforce's WikiText-103: 103 million tokens

(2) WikiText-2: 2 million tokens

(3) Penn Treebank piece of the Wall Street Diary corpus: 929,000

'Paper Review' 카테고리의 다른 글
  • [PAPER REVIEW 231231] UAE
  • [PAPER REVIEW 231231] emoji2vec
  • [PAPER REVIEW 231226] DialogueRNN
  • [PAPER REVIEW 231221] Recent Trends in Sentiment Analysis and Emotion Detection
Sungyeon Kim
Sungyeon Kim
goldstaryeon@sookmyung.ac.kr
Sungyeon Kim
Sungyeon Kim
Sungyeon Kim
전체
오늘
어제
  • 분류 전체보기 (610) N
    • Paper Review (30)
    • Research Record (9)
    • Study Record (143)
      • Cybersecurity (79)
      • AI Data Science (28)
      • Computer Science (24)
      • Linear Algebra (6)
      • SQL (5)
      • LaTeX (1)
    • English Transcription (256)
    • 한글 필사 (96) N
    • 날것 그대로의 생각들 (72)

인기 글

최근 댓글

최근 글

hELLO· Designed By정상우.v4.5.3
Sungyeon Kim
[PAPER REVIEW 231228] NLP review paper
상단으로

티스토리툴바

단축키

내 블로그

내 블로그 - 관리자 홈 전환
Q
Q
새 글 쓰기
W
W

블로그 게시글

글 수정 (권한 있는 경우)
E
E
댓글 영역으로 이동
C
C

모든 영역

이 페이지의 URL 복사
S
S
맨 위로 이동
T
T
티스토리 홈 이동
H
H
단축키 안내
Shift + /
⇧ + /

* 단축키는 한글/영문 대소문자로 이용 가능하며, 티스토리 기본 도메인에서만 동작합니다.