Portuguese Password Wordlist Work

If you are looking for a "deep paper" specifically on the creation and effectiveness of Portuguese-language password wordlists, there isn't a single "standard" academic paper that focuses solely on a wordlist. However, several significant research projects and technical papers address the linguistic nuances of Portuguese in password security. 1. Linguistic & Academic Frameworks

These papers provide the "deep" linguistic data often used to build professional-grade wordlists:

P-AWL: Academic Word List for Portuguese: This research establishes a list of 1,823 entries, systematically contrasting Brazilian and European Portuguese variants. It is used as a foundation for generating high-quality dictionaries by analyzing word frequency and morphological families.

The Portuguese Vocabulary Profile: A pilot study that analyzes learner and native speaker corpora to identify word frequency patterns. This type of frequency analysis is critical for "probabilistic" wordlists, which prioritize common terms over exhaustive dictionaries. 2. Specialized Wordlist Implementations

For technical work, researchers often cite these repositories as the benchmark for Portuguese-specific password behavior:

pt-br-passphrase-wordlist: This project argues that users are moving toward passphrases and provides a massive list of 2.4 million Portuguese/Brazil oriented phrases. It includes Hashcat rules designed to create over 2.5 billion permutations specific to the Brazilian cultural context. portuguese password wordlist work

Dadoware (Brazilian-Portuguese Diceware): Based on the classic Diceware method, this work provides a specific wordlist and methodology for creating human-memorable yet cryptographically strong Portuguese passwords.

SecLists (Language-Specific): The widely-used SecLists repository recently integrated expanded lists for Brazilian Portuguese, citing that standard English lists miss distinct cultural password patterns used by over 130 million Brazilian internet users. 3. Cybersecurity Context in Portugal

If your research is about the effectiveness of these lists in the field:

Portuguese Healthcare Cybersecurity Analysis: This research examines the compliance and security of healthcare systems in Portugal, including password-related vulnerabilities under the NIS2 framework.

Study of Security Issues in Eduroam (Portugal): An analysis of 91 Portuguese institutions that found many users and configurations were vulnerable to dictionary-based attacks due to a lack of security awareness. Comparison of Methods If you are looking for a "deep paper"

For a broader technical deep dive, you might look at "Password Cracking with Brute Force Algorithm and Dictionary Attack", which compares the efficiency of predefined wordlists against brute-force methods using parallel processing on modern GPUs.

Are you focusing on European (PT-PT) or Brazilian (PT-BR) variations for your work?

Add more language-specific wordlists · Issue #1210 - GitHub

3. Enforce accent sensitivity

Many systems silently normalize accents (coraçãocoracao). This is dangerous, as it reduces entropy. Keep accents where possible.

Using a Wordlist

Ethical Warning:

Only use breached data for password analysis on systems you own or have explicit permission to test. Never use personal data from real users for malicious purposes. Password Cracking : Wordlists are commonly used in


Removal of Redundancy:

sort -u huge_wordlist.txt > clean_wordlist.txt

Phase 3: Mutation (Where the real work happens)

The base list is not enough. You must apply mutations. Using Hashcat's --stdout flag, generate permutations.

Common Portuguese mutations:

Command example using Hashcat rule engine:

hashcat --stdout base_clean.txt -r rules/portuguese-fusion.rule > mutated_portuguese.txt

You need a custom rule file (portuguese-fusion.rule). A simple one:

$2 $0 $2 $3   # Append 2023
$1 $9 $9 $0   # Append 1990
d             # Remove accents (if supported)
c             # Capitalize first letter
$!            # Exclamation mark

Append common numbers/symbols

sed 's/$/123/' base_noaccent.txt >> combined.txt sed 's/$/2023/' base_noaccent.txt >> combined.txt