Python Developer – Transition Repetition Analysis Module

120
ETH, DAI, USDT
+53
0 days (till May 25th, 2025)

Milestone 1 – Linguistic QA Validator (French Transition Rules)

🎯 Objective

Develop a Python module to validate batches of AI-generated French transition phrases. This module ensures:
1. No stylistically significant word repetition across transitions in a group
2. “Enfin” is used only in the final transition of each group
3. Grammatical stopwords (like "le", "de", "à", "et") are excluded from repetition checks

📁 Module Target

File: utils/validate_prompt_compliance.py

📚 Definitions

✅ "Repetition" Violation

Flag repeated meaningful words in a group of transitions.
Use a French stopword list to ignore non-stylistic words such as:
["le", "la", "les", "de", "des", "un", "une", "à", "et", "en", "du", "par", "que", "si", "ce", "sur"]

🛑 "Enfin" Misuse

Flag if “enfin” appears in any position other than the last transition in a group.

🧩 Required Functions

tokenize(text: str) -> List[str]: Normalize case, remove punctuation, return word tokens

check_transition_group(transitions: List[str]) -> Dict:
Example return:
{
  "repetition": ["par", "direction"],
  "enfin_misplaced": True
}

validate_batch(batch_outputs: List[List[str]]) -> Dict: Returns summary of violations and per-output breakdown

📤 Output Format (Example)

{
  "total_outputs": 5,
  "outputs_with_violations": 4,
  "violations_summary": {
    "repetition": {
      "count": 3,
      "affected_outputs": [1, 2, 4],
      "violated_words": ["par", "direction", "dans"]
    },
    "enfin_misplaced": {
      "count": 1,
      "affected_outputs": [3]
    }
  },
  "details": [
    {
      "output_id": 1,
      "transitions": ["Par ailleurs,", "Par contre,", "Par exemple,"],
      "violations": {"repetition": ["par"]}
    },
    {
      "output_id": 2,
      "transitions": ["Prenons la direction de Paris,", "Ensuite, prenons la direction de Lyon,", "Enfin, une note sur Marseille"],
      "violations": {"repetition": ["prenons", "direction"]}
    },
    {
      "output_id": 3,
      "transitions": ["Enfin, une annonce importante", "Puis une autre nouvelle", "Pour conclure,"],
      "violations": {"enfin_misplaced": true}
    },
    {
      "output_id": 4,
      "transitions": ["Dans un autre registre,", "Dans la même région,", "Encore dans le domaine économique,"],
      "violations": {"repetition": ["dans"]}
    },
    {
      "output_id": 5,
      "transitions": ["À noter également,", "Nous terminons avec cette info :", "Pour finir,"],
      "violations": {}
    }
  ]
}

✅ Completion Criteria

- tokenize() correctly splits and lowercases all transition text
- Repetition logic excludes stopwords
- enfin_misplaced triggers only when “enfin” is not last
- All outputs match the JSON schema above
- Module is testable and cleanly structured

🧠 Skills Required

- Python 3
- Regex and tokenization
- Set logic and dictionaries
- JSON formatting
- NLP or editorial QA experience (preferred)

120
ETH, DAI, USDT
+53
0 days (till May 25th, 2025)

More Jobs from this customer

🧾 Extract Structured Transition Triplets from DOCX Files

We are hiring a developer to build a Streamlit application that extracts structured examples of transition phrases from .docx documents containing regional French news articles. These transitions are short, context-appropriate phrases that connect ideas between...

🧾 Extract Structured Transition Triplets from DOCX Files

We are hiring a developer to build a Streamlit application that extracts structured examples of transition phrases from .docx documents containing regional French news articles. These transitions are short, context-appropriate phrases that connect ideas between...

More Jobs like this

Show more
Uniswap v3 & v4 Sniper Bot Developer

Looking for an experienced DeFi developer to build a crypto sniper bot on Base blockchain (OP Stack L2) that supports both Uniswap v3 and v4.Strong Solidity + Node.js backgroundFamiliar with Uniswap v3 and Uniswap v4...

Crypto Bot Setup for Token Launch (Confidential)

The tokens were acquired during a presale, and I plan to sell a portion after the launch, expected between September 1–15, 2025. I have some locked tokens I won’t sell. I need assistance configuring the...

Social media marketing

i need instagram and tiktok marketing for my website

Selling 40B $Seconds Time Farm

You can contact me in telegram @Rareape for faster transact

Pro YouTube Editor for Snowball Channel – $20 per Video

🚨 Only for Real Video Editors (Snowball YouTube Channel) We are looking for professional editors only. If you are not experienced → please don’t apply. Channel: Snowball @snowball1 (950+ subs). What we need: 7 YouTube...

Generate a realistic video with Veo3

Generate a video with Veo3, send me a link for verification and if I like it I'll send the payment and you'll send the file.

Technical writer

Job Title: Copywriter – DeFi Location: Remote Type: Contract / Part-time / Full-time About the Role We are seeking a talented Copywriter with strong technical knowledge of DeFi to craft high-impact, engaging, and conversion-driven content...

OnlyFans Promo Campaign – Acquire 250 Subscribers

We are looking for an experienced marketer to run a promo campaign for an OnlyFans profile. The goal is to acquire 250 paying subscribers for a budget of $270. You will manage audience targeting, outreach,...

SEO Website optimization

I have a recently created website that is a blog and provides information about a specfic person. I'm looking for someone to fine tune the site but more importantly, give it into the search engines...

Solana Copy trading Bot/ Auto Detection & Fast Execution

DEVELOPMENT OF AN ULTRA-FAST COPY TRADING BOT ON SOLANA (BLOCK 0 / BLOCK 1) DESCRIPTION: I am looking for an experienced Solana developer with a strong mastery of the Web3 ecosystem, low-latency RPCs, and network optimization, to...

Uniswap v3 & v4 Sniper Bot Developer

Looking for an experienced DeFi developer to build a crypto sniper bot on Base blockchain (OP Stack L2) that supports both Uniswap v3 and v4.Strong Solidity + Node.js backgroundFamiliar with Uniswap v3 and Uniswap v4...

Crypto Bot Setup for Token Launch (Confidential)

The tokens were acquired during a presale, and I plan to sell a portion after the launch, expected between September 1–15, 2025. I have some locked tokens I won’t sell. I need assistance configuring the...

Social media marketing

i need instagram and tiktok marketing for my website

Selling 40B $Seconds Time Farm

You can contact me in telegram @Rareape for faster transact

Pro YouTube Editor for Snowball Channel – $20 per Video

🚨 Only for Real Video Editors (Snowball YouTube Channel) We are looking for professional editors only. If you are not experienced → please don’t apply. Channel: Snowball @snowball1 (950+ subs). What we need: 7 YouTube...

Generate a realistic video with Veo3

Generate a video with Veo3, send me a link for verification and if I like it I'll send the payment and you'll send the file.

Technical writer

Job Title: Copywriter – DeFi Location: Remote Type: Contract / Part-time / Full-time About the Role We are seeking a talented Copywriter with strong technical knowledge of DeFi to craft high-impact, engaging, and conversion-driven content...

OnlyFans Promo Campaign – Acquire 250 Subscribers

We are looking for an experienced marketer to run a promo campaign for an OnlyFans profile. The goal is to acquire 250 paying subscribers for a budget of $270. You will manage audience targeting, outreach,...

SEO Website optimization

I have a recently created website that is a blog and provides information about a specfic person. I'm looking for someone to fine tune the site but more importantly, give it into the search engines...

Solana Copy trading Bot/ Auto Detection & Fast Execution

DEVELOPMENT OF AN ULTRA-FAST COPY TRADING BOT ON SOLANA (BLOCK 0 / BLOCK 1) DESCRIPTION: I am looking for an experienced Solana developer with a strong mastery of the Web3 ecosystem, low-latency RPCs, and network optimization, to...