GitHub - hasancanbiyik/text_tools_project: The goal of our project is to identify the parts of speech of certain euphemisms. For example, "pass away" is a verb while "a delicate condition" is an adjective. We can do a lot with this information, such as using it to aid the detection of euphemistic spans. We can also use this information to determine the differences in how euphemisms are used

Topic

Cross-linguistic Analysis of Euphemism Usage and Grammatical Structure in Turkish, English, and Spanish

The goal of our project is to identify the parts of speech of certain euphemisms. This can give interesting insights about both the grammatical structure of the languages as well as cultural influences such as average politeness of a sentence, etc.

The script/commands we plan to use:

Writing a script to eliminate [PET_BOUNDARY] from the data
Using shell tools to grep for certain words
Using tr and sed to convert data to uppercase/lowercase if needed
Deleting @@@@@@@@@@ as seen in the above screenshot
Using wc to do word counts
Or wc -l to do line counts for the data points with a specific property
Outputting our results to a csv file
Removing extraneous punctuation

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
datasets		datasets
english		english
spanish		spanish
stopwords_files		stopwords_files
turkish		turkish
README.md		README.md
references.txt		references.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Topic

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

hasancanbiyik/text_tools_project

Folders and files

Latest commit

History

Repository files navigation

Topic

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages