Skip to content

The goal of our project is to identify the parts of speech of certain euphemisms. For example, "pass away" is a verb while "a delicate condition" is an adjective. We can do a lot with this information, such as using it to aid the detection of euphemistic spans. We can also use this information to determine the differences in how euphemisms are used

Notifications You must be signed in to change notification settings

hasancanbiyik/text_tools_project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

25 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Topic

Cross-linguistic Analysis of Euphemism Usage and Grammatical Structure in Turkish, English, and Spanish

The goal of our project is to identify the parts of speech of certain euphemisms. This can give interesting insights about both the grammatical structure of the languages as well as cultural influences such as average politeness of a sentence, etc.

The script/commands we plan to use:

  • Writing a script to eliminate [PET_BOUNDARY] from the data
  • Using shell tools to grep for certain words
  • Using tr and sed to convert data to uppercase/lowercase if needed
  • Deleting @@@@@@@@@@ as seen in the above screenshot
  • Using wc to do word counts
  • Or wc -l to do line counts for the data points with a specific property
  • Outputting our results to a csv file
  • Removing extraneous punctuation

About

The goal of our project is to identify the parts of speech of certain euphemisms. For example, "pass away" is a verb while "a delicate condition" is an adjective. We can do a lot with this information, such as using it to aid the detection of euphemistic spans. We can also use this information to determine the differences in how euphemisms are used

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •