Text Processing Script

This script performs text processing tasks, including sentence segmentation, word tokenization, and analysis of sentence lengths and longest words.

Usage

To use this script, run it from the command line with the path to your text file as an argument:

python script.py path/to/your/text.txt

Features

Sentence Segmentation:

Segments the text into sentences based on full stops and spaces.

Word Tokenization:

Tokenizes words, removing punctuation for analysis.

Sentence Length Analysis:

Orders sentences by length and prints the ordinal position of the six longest sentences.

Longest Words Analysis:

Identifies and prints the five longest unique words in the text.

Dependencies

This script uses Python and the re module for regular expressions.

Contributing

If you'd like to contribute or improve the script, please follow the standard GitHub Fork and Pull Request workflow.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
LICENSE		LICENSE
README.md		README.md
code.py		code.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Text Processing Script

Usage

Features

Sentence Segmentation:

Word Tokenization:

Sentence Length Analysis:

Longest Words Analysis:

Dependencies

Contributing

License

About

Releases

Packages

Languages

License

cristinamatacuta/TextLensAnalyzer

Folders and files

Latest commit

History

Repository files navigation

Text Processing Script

Usage

Features

Sentence Segmentation:

Word Tokenization:

Sentence Length Analysis:

Longest Words Analysis:

Dependencies

Contributing

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages