AI Evaluation Specialist

Status
Hong Kong +1
Full time
Remote
Compensation is not specified
Role
Data Scientist
Description

Binance is a renowned global blockchain ecosystem that operates the largest cryptocurrency exchange platform worldwide, serving over 280 million individuals across 100+ countries. Our solid reputation is built on industry-leading security standards, transparent user fund management, fast trading engine, extensive liquidity, and an unparalleled array of digital asset products. Binance's diverse offerings span trading, financial services, education, research, payments, institutional support, Web3 features, and more. We harness the potential of digital assets and blockchain technology to establish an inclusive financial ecosystem that promotes financial freedom and enhances financial inclusion on a global scale.

We are currently seeking a dedicated AI Evaluation Specialist tasked with designing, implementing, and overseeing comprehensive evaluation frameworks covering every stage in the lifespan of LLM agents. Your role will play a pivotal part in Binance's AI integration journey by ensuring the dependability, adaptability, and regulatory compliance of AI agents deployed across various domains such as Customer Service, Growth, and Compliance.

Responsibilities:

  • Engage in the complete software development lifecycle, encompassing requirements analysis, test planning, execution, defect tracking, product release, and maintenance.
  • Serve as a primary contact for A.I Agents evaluation and continuous monitoring.
  • Develop effective test strategies and conduct hands-on testing to guarantee the accuracy, reliability, and performance of AI and data applications.
  • Conduct root cause analysis of test failures and product issues, and facilitate optimization for future enhancements.
  • Design and implement internal tools utilizing AI technology to enhance engineering and testing efficiency.

Requirements:

  • Bachelor’s or Master’s degree in Computer Science, Artificial Intelligence, Data Science, or a related field.
  • Profound understanding of Large Language Models (LLMs), autonomous AI agents, and their system architectures.
  • Experience with AI evaluation methodologies encompassing offline benchmarking, online monitoring, and hybrid human-AI evaluation approaches.
  • Knowledge of software engineering best practices such as Test-Driven Development (TDD) and Behavior-Driven Development (BDD) in AI contexts.
  • Ability to craft adaptive, lifecycle-spanning evaluation frameworks incorporating quantitative and qualitative metrics.
  • Prior experience with evaluation tools and frameworks is beneficial.
  • Proficiency in analyzing complex system-level behaviors, including reasoning pipelines, tool integrations, and agent actions.
  • Strong analytical abilities with a background in data-driven diagnostics and root cause analysis.
  • Excellent communication skills for clear documentation of evaluation plans, results, and recommendations.
  • Experience collaborating in cross-functional teams and managing feedback loops between evaluation and development.
  • Previous exposure to working with infrastructure or platform teams to enhance AI tooling and automation platforms.

Binance offers a dynamic work environment where you can:

  • Play a pivotal role in shaping the future within the foremost blockchain ecosystem globally.
  • Collaborate with top-tier professionals in a user-centric, globally-distributed organization with a flat structure.
  • Engage in unique, fast-paced projects autonomously in an innovative setting.
  • Grow your career and continuously learn within a results-oriented workplace.
  • Enjoy competitive compensation and company benefits.
  • Benefit from remote work arrangements depending on team-specific work requirements.

At Binance, we are committed to fostering an inclusive work environment by promoting diversity within our workforce as we believe it is essential for our continued success. By applying for a job at Binance, you acknowledge that you have reviewed and agreed to our Candidate Privacy Notice.

Skills Required
Avatar
Binance
Website
Not specified
Company size
Not specified
Location
United States
Description
Not specified
Status

More Full-time Jobs

Show more

Senior web3 engineer

Treynor, USA
Treynor, USA
Part time
Remote
About the Project:
We’re building a SocialFi platform combining real estate and DeFi. After launching our MVP, we’re now progressing to version 2.
 
Role:
Join our remote team to develop website, smart contracts, integrate blockchain features, and ensure platform security.
 
Responsibilities:
- Develop and deploy smart contracts on Ethereum or Layer 2.
- Integrate wallet connect and decentralized features.
- Collaborate and direct contribute with frontend/backend teams.
- Conduct security audits.
 
Requirements:
- Experience with React, Ethers.js, Web3.js.
- Familiarity with wallet extensions and DeFi protocols.
- Ability to fix errors within 1 hour during tests.
- Comfortable working independently in a remote setup.
 
Nice to Have:
- Knowledge of cross-chain protocols or cryptography.
- Experience in social finance or real estate blockchain projects.
 
Application:
Send your resume and Linkedin profile. Selected candidates will do a quick coding test.
 
Contact: contact@hubsai.net
 
Join us to help shape the future of SocialFi!
 
Payment in Crypto
19,000-22,000
Monthly
See details

Recruiter, Marketer, Advertiser, Business Developer

Part time
Remote
We are building a company and looking for people who want to grow with it
Open roles
- Marketer
- Business Developer
- Recruiter
- Advertiser
Requirements
- Clear natural English communication without translators or AI
- Professional fast and reliable
- Fluent in your native language
If you want to be part of something early and actually make an impact — reach out
Payment in Crypto
500-1,000
Monthly
See details

Junior / Middle Metaverse Developer (Unity / Web3 / VR)

Manzhouli, China
Manzhouli, China
Full time
Remote
We are looking for a Metaverse Developer to help build interactive virtual environments and immersive digital experiences. The role involves developing 3D worlds, integrating Web3 technologies, and working with modern tools for real-time applications.
You will work with Unity, WebGL, and blockchain technologies to create engaging metaverse features and optimize them for performance across different devices. The developer will collaborate with designers and backend engineers to deliver scalable and stable products.
Requirements include experience with Unity and C#, understanding of 3D development, Web3 or blockchain basics, and familiarity with Git and API integration. Experience with VR/AR, NFTs, or multiplayer systems is a plus.
We offer remote work, flexible hours, crypto payments, and the opportunity to build innovative metaverse products.
Payment in Crypto
1,500-3,500
Monthly
See details

Web3 Fullstack Developer

Part time
Remote
🌐 About Us
Neonflick is a tech organization specializing in Web3 development.
We believe Web3 solutions are not as popular as they could be — mainly due to complexity. Our mission is to simplify the user experience and make decentralized technology more accessible and easy to use for everyone.
🚀 About the Role
We are looking for a Web3 Fullstack Developer who is passionate about decentralized technologies and excited to contribute to building meaningful Web3 products. You will work on developing new products, improving existing ones, and fixing bugs, all while proposing innovative ideas to enhance our platform.
🔹 Responsibilities
Develop functional and user-friendly Web3 products
Improve and optimize existing products and features
Identify and fix bugs or issues in the system
Propose and implement new ideas for product development
Collaborate with the team to ensure high-quality software delivery
🔹 Requirements
Strong interest in Web3 / blockchain / decentralized technologies
Fullstack development experience (frontend + backend)
Experience with smart contracts and decentralized frameworks is a plus
Problem-solving skills and attention to detail
Ability to work independently and proactively
🔹 What You Get
Early-stage involvement in a Web3 tech organization
Influence over product development and feature design
Opportunity to build experience and a portfolio in the Web3 space
Potential long-term collaboration and future compensation as the project grows
A chance to contribute to making Web3 simpler and more accessible
⚠️ Important
This is an unpaid position. We are looking for someone motivated by vision, learning, and long-term impact rather than immediate salary.
If you are passionate about Web3 and building meaningful decentralized products, we’d love to hear from you.

Senior Blockchain Developer - Ethereum/PulseChain - GAM3S.GG

Lisbon, Portugal
Lisbon, Portugal
Full time
Remote
About the Job
We're looking for a Senior Blockchain Developer with strong Ethereum/PulseChain smart contract experience to work on our DeFi/GameFi platform. You'll be building and maintaining betting contracts, yield farming mechanisms, token swap systems, and Web3 frontend integrations for our decentralized application on PulseChain.
Key Responsibilities
Smart Contract Development
System Architecture & Integration
Frontend Web3 Integration
Security & Compliance
Performance & Monitoring
Required Technical Skills
Blockchain Expertise
Frontend Web3 Development
Development Tools
Integration & Tools
Payment in Crypto
10,417
Monthly
See details