Python Developer – Transition Repetition Analysis Module

120
ETH, DAI, USDT
+55
0 days (till May 25th, 2025)

Milestone 1 – Linguistic QA Validator (French Transition Rules)

🎯 Objective

Develop a Python module to validate batches of AI-generated French transition phrases. This module ensures:
1. No stylistically significant word repetition across transitions in a group
2. “Enfin” is used only in the final transition of each group
3. Grammatical stopwords (like "le", "de", "à", "et") are excluded from repetition checks

📁 Module Target

File: utils/validate_prompt_compliance.py

📚 Definitions

✅ "Repetition" Violation

Flag repeated meaningful words in a group of transitions.
Use a French stopword list to ignore non-stylistic words such as:
["le", "la", "les", "de", "des", "un", "une", "à", "et", "en", "du", "par", "que", "si", "ce", "sur"]

🛑 "Enfin" Misuse

Flag if “enfin” appears in any position other than the last transition in a group.

🧩 Required Functions

tokenize(text: str) -> List[str]: Normalize case, remove punctuation, return word tokens

check_transition_group(transitions: List[str]) -> Dict:
Example return:
{
  "repetition": ["par", "direction"],
  "enfin_misplaced": True
}

validate_batch(batch_outputs: List[List[str]]) -> Dict: Returns summary of violations and per-output breakdown

📤 Output Format (Example)

{
  "total_outputs": 5,
  "outputs_with_violations": 4,
  "violations_summary": {
    "repetition": {
      "count": 3,
      "affected_outputs": [1, 2, 4],
      "violated_words": ["par", "direction", "dans"]
    },
    "enfin_misplaced": {
      "count": 1,
      "affected_outputs": [3]
    }
  },
  "details": [
    {
      "output_id": 1,
      "transitions": ["Par ailleurs,", "Par contre,", "Par exemple,"],
      "violations": {"repetition": ["par"]}
    },
    {
      "output_id": 2,
      "transitions": ["Prenons la direction de Paris,", "Ensuite, prenons la direction de Lyon,", "Enfin, une note sur Marseille"],
      "violations": {"repetition": ["prenons", "direction"]}
    },
    {
      "output_id": 3,
      "transitions": ["Enfin, une annonce importante", "Puis une autre nouvelle", "Pour conclure,"],
      "violations": {"enfin_misplaced": true}
    },
    {
      "output_id": 4,
      "transitions": ["Dans un autre registre,", "Dans la même région,", "Encore dans le domaine économique,"],
      "violations": {"repetition": ["dans"]}
    },
    {
      "output_id": 5,
      "transitions": ["À noter également,", "Nous terminons avec cette info :", "Pour finir,"],
      "violations": {}
    }
  ]
}

✅ Completion Criteria

- tokenize() correctly splits and lowercases all transition text
- Repetition logic excludes stopwords
- enfin_misplaced triggers only when “enfin” is not last
- All outputs match the JSON schema above
- Module is testable and cleanly structured

🧠 Skills Required

- Python 3
- Regex and tokenization
- Set logic and dictionaries
- JSON formatting
- NLP or editorial QA experience (preferred)

120
ETH, DAI, USDT
+55
0 days (till May 25th, 2025)

More Jobs from this customer

🧾 Extract Structured Transition Triplets from DOCX Files

We are hiring a developer to build a Streamlit application that extracts structured examples of transition phrases from .docx documents containing regional French news articles. These transitions are short, context-appropriate phrases that connect ideas between...

🧾 Extract Structured Transition Triplets from DOCX Files

We are hiring a developer to build a Streamlit application that extracts structured examples of transition phrases from .docx documents containing regional French news articles. These transitions are short, context-appropriate phrases that connect ideas between...

More Jobs like this

Show more
Buy second

Buying One Billion Seconds: Investing in the Time Farm Economy"   In a world where time is the most precious and non-renewable asset, imagine if you could buy it — not metaphorically, but literally. Welcome...

Upwork, freelancer seller is here. https://t.me/Nadir53454

Upwork, freelancer seller is here. https://t.me/Nadir53454

Full Automation Tool Request

🎯 Purpose To develop a local automation application that registers specified phone numbers on a list of websites, where completion is defined as receiving the SMS verification message. 🔧 Feature Requirements Automated SMS Verification Registration...

I need to sell freelancer accounts. t.me/Nadir53454

I need to sell freelancer accounts. https://t.me/Nadir53454

WFH Data Contributors

Work Location: Remote/Work-from-home within the United States Engagement Model: Independent Contractor / Freelance Estimated Start Date: June 2025 Estimated Project Duration: 6 weeks Hourly Throughput: 30 screenshots Preferred availability: Minimum 10 hours per week Languages...

Develop an adroid App in Firebase Studio

For now the payment consists of small task packs worth $5 ETH each, and the payment will be by MetaMask Requirements: Dart Contact me on Telegram: t.me/LesterRF Please send me this emoji: 👨‍💻

Need Vietnamese Speaker to Make Local Phone Calls in Vietnam

I'm looking for a Vietnam-based person who speaks fluent Vietnamese and can make local phone calls within Vietnam. Your task will be to: Call specific organizations or companies (list provided), Politely deliver useful information to...

transitions job

We are seeking a talented individual to join our team as a Transitions Job. This position requires strong writing and translation skills, particularly in Portuguese, Spanish, and Hindi. The ideal candidate will have a passion...

English to Italian Document Translation

We are looking for a highly skilled translator to help us convert two documents from English to Italian. The ideal candidate will have strong proficiency in both languages, along with experience translating formal and professional...

Translate From English to Cantonese/Chinese

We are seeking a skilled and detail-oriented Cantonese Translator to join our team. The ideal candidate will be responsible for translating written documents from English to Cantonese with high accuracy, maintaining the original meaning, tone,...

Buy second

Buying One Billion Seconds: Investing in the Time Farm Economy"   In a world where time is the most precious and non-renewable asset, imagine if you could buy it — not metaphorically, but literally. Welcome...

Upwork, freelancer seller is here. https://t.me/Nadir53454

Upwork, freelancer seller is here. https://t.me/Nadir53454

Full Automation Tool Request

🎯 Purpose To develop a local automation application that registers specified phone numbers on a list of websites, where completion is defined as receiving the SMS verification message. 🔧 Feature Requirements Automated SMS Verification Registration...

I need to sell freelancer accounts. t.me/Nadir53454

I need to sell freelancer accounts. https://t.me/Nadir53454

WFH Data Contributors

Work Location: Remote/Work-from-home within the United States Engagement Model: Independent Contractor / Freelance Estimated Start Date: June 2025 Estimated Project Duration: 6 weeks Hourly Throughput: 30 screenshots Preferred availability: Minimum 10 hours per week Languages...

Develop an adroid App in Firebase Studio

For now the payment consists of small task packs worth $5 ETH each, and the payment will be by MetaMask Requirements: Dart Contact me on Telegram: t.me/LesterRF Please send me this emoji: 👨‍💻

Need Vietnamese Speaker to Make Local Phone Calls in Vietnam

I'm looking for a Vietnam-based person who speaks fluent Vietnamese and can make local phone calls within Vietnam. Your task will be to: Call specific organizations or companies (list provided), Politely deliver useful information to...

transitions job

We are seeking a talented individual to join our team as a Transitions Job. This position requires strong writing and translation skills, particularly in Portuguese, Spanish, and Hindi. The ideal candidate will have a passion...

English to Italian Document Translation

We are looking for a highly skilled translator to help us convert two documents from English to Italian. The ideal candidate will have strong proficiency in both languages, along with experience translating formal and professional...

Translate From English to Cantonese/Chinese

We are seeking a skilled and detail-oriented Cantonese Translator to join our team. The ideal candidate will be responsible for translating written documents from English to Cantonese with high accuracy, maintaining the original meaning, tone,...