
The Matching API is a high-throughput similarity search engine that finds related Profiles for a Profile and related Jobs for a Job. Powered by Hiring Superintelligence, it uses HR-native embeddings, multilingual representation, and deterministic ranking to power deduplication, clustering, and look-alike talent discovery without taxonomies or keyword tuning.
{ "score": 0.98, "profile": { "name": "Jane Doe", "similarity": "look-alike" } }
Trusted by Customers, Partners & the AI Ecosystem

Query Profiles with an anchor profile or Jobs with an anchor job, and get ranked hits (JSON) with similarity scores plus pagination metadata (total, page, limit, …). Results are deterministic: same inputs → same order.
Anchor Profile
key or reference, and source
Search Params
(Searching API): keywords, Geo, Ranges, Facets, Taxonomies, Raw filters
Score Threshold
minimum similarity score
Pagination & Sorting
index (source/board), page, limit, sort_by, order_by
1{
2 "code": 200,
3 "message": "Matching completed successfully.",
4 "meta": {
5 "page": 1,
6 "limit": 10,
7 "total": 312
8 },
9 "data": [
10 {
11 "profile": {
12 "key": "abc123",
13 "info": {
14 "first_name": "Sarah",
15 "last_name": "Chen",
16 "location": "San Francisco, CA"
17 },
18 "skills": ["Python", "ML", "NLP"],
19 "experiences_duration": 7.2
20 },
21 "score": 0.97
22 },
23 {
24 "profile": {
25 "key": "def456",
26 "info": {
27 "first_name": "James",
28 "last_name": "Park",
29 "location": "New York, NY"
30 },
31 "skills": ["Data Science", "PyTorch"],
32 "experiences_duration": 5.1
33 },
34 "score": 0.91
35 }
36 ]
37}Trusted by fast-growing HR Tech and Global Enterprise
Cohere gave us cosine similarity, not outcome-based similarity. Two profiles could be equally close in vector space, yet only one consistently succeeds. We kept adding heuristics to compensate.
HrFlow.ai Profiles Matching replaced that with an HR-native Profile encoder trained on real hiring and application signals, so similarity reflects outcomes, not just semantics.
Elasticsearch wasn’t built for Profile→Profile or Job→Job similarity. We stitched together keyword overlap, boosting rules, and RRF, then spent months patching multilingual and seniority edge cases.
HrFlow.ai Matching gave us zero-config, production-ready deterministic similarity with hybrid filters, without the engineering headaches.
Pinecone solved storage and ANN retrieval, but not HR similarity quality. We kept cycling through Hugging Face encoders, managing migrations, and adding heuristics—yet results stayed inconsistent across roles and industries.
HrFlow.ai Matching shipped the full HR-native stack: Profile/Job encoders, deterministic scoring, custom features, and reasoning—so we stopped building the missing layers ourselves.
Cohere gave us cosine similarity, not outcome-based similarity. Two profiles could be equally close in vector space, yet only one consistently succeeds. We kept adding heuristics to compensate.
HrFlow.ai Profiles Matching replaced that with an HR-native Profile encoder trained on real hiring and application signals, so similarity reflects outcomes, not just semantics.
Elasticsearch wasn’t built for Profile→Profile or Job→Job similarity. We stitched together keyword overlap, boosting rules, and RRF, then spent months patching multilingual and seniority edge cases.
HrFlow.ai Matching gave us zero-config, production-ready deterministic similarity with hybrid filters, without the engineering headaches.
Pinecone solved storage and ANN retrieval, but not HR similarity quality. We kept cycling through Hugging Face encoders, managing migrations, and adding heuristics—yet results stayed inconsistent across roles and industries.
HrFlow.ai Matching shipped the full HR-native stack: Profile/Job encoders, deterministic scoring, custom features, and reasoning—so we stopped building the missing layers ourselves.
Cohere gave us cosine similarity, not outcome-based similarity. Two profiles could be equally close in vector space, yet only one consistently succeeds. We kept adding heuristics to compensate.
HrFlow.ai Profiles Matching replaced that with an HR-native Profile encoder trained on real hiring and application signals, so similarity reflects outcomes, not just semantics.
Elasticsearch wasn’t built for Profile→Profile or Job→Job similarity. We stitched together keyword overlap, boosting rules, and RRF, then spent months patching multilingual and seniority edge cases.
HrFlow.ai Matching gave us zero-config, production-ready deterministic similarity with hybrid filters, without the engineering headaches.
Pinecone solved storage and ANN retrieval, but not HR similarity quality. We kept cycling through Hugging Face encoders, managing migrations, and adding heuristics—yet results stayed inconsistent across roles and industries.
HrFlow.ai Matching shipped the full HR-native stack: Profile/Job encoders, deterministic scoring, custom features, and reasoning—so we stopped building the missing layers ourselves.
Cohere gave us cosine similarity, not outcome-based similarity. Two profiles could be equally close in vector space, yet only one consistently succeeds. We kept adding heuristics to compensate.
HrFlow.ai Profiles Matching replaced that with an HR-native Profile encoder trained on real hiring and application signals, so similarity reflects outcomes, not just semantics.
Elasticsearch wasn’t built for Profile→Profile or Job→Job similarity. We stitched together keyword overlap, boosting rules, and RRF, then spent months patching multilingual and seniority edge cases.
HrFlow.ai Matching gave us zero-config, production-ready deterministic similarity with hybrid filters, without the engineering headaches.
Pinecone solved storage and ANN retrieval, but not HR similarity quality. We kept cycling through Hugging Face encoders, managing migrations, and adding heuristics—yet results stayed inconsistent across roles and industries.
HrFlow.ai Matching shipped the full HR-native stack: Profile/Job encoders, deterministic scoring, custom features, and reasoning—so we stopped building the missing layers ourselves.
With open-source encoders, we hit major fairness and compliance risks. A male anchor profile often returned mostly male similar profiles, creating allocation and representation biases we couldn’t justify under GDPR or the EU AI Act.
HrFlow.ai Matching gave us deterministic similarity with built-in fairness controls and HR-safe ingestion.
OpenAI embeddings treated past and recent experience similarly, and missed certifications, seniority fit, and trajectory signals. We didn’t want semantic twins—we wanted hiring twins.
HrFlow.ai Matching delivered Profile→Profile similarity grounded in hiring outcomes, so look-alikes actually behaved like look-alikes in the pipeline.
Google vectors mostly clustered job descriptions. We needed next-apply similarity: jobs a candidate is likely to click and apply to after liking a role.
HrFlow.ai Jobs Matching uses Job encoders trained on application signals, so similar jobs became similar intent, and our engagement and time-to-fill improved without extra tuning.
With open-source encoders, we hit major fairness and compliance risks. A male anchor profile often returned mostly male similar profiles, creating allocation and representation biases we couldn’t justify under GDPR or the EU AI Act.
HrFlow.ai Matching gave us deterministic similarity with built-in fairness controls and HR-safe ingestion.
OpenAI embeddings treated past and recent experience similarly, and missed certifications, seniority fit, and trajectory signals. We didn’t want semantic twins—we wanted hiring twins.
HrFlow.ai Matching delivered Profile→Profile similarity grounded in hiring outcomes, so look-alikes actually behaved like look-alikes in the pipeline.
Google vectors mostly clustered job descriptions. We needed next-apply similarity: jobs a candidate is likely to click and apply to after liking a role.
HrFlow.ai Jobs Matching uses Job encoders trained on application signals, so similar jobs became similar intent, and our engagement and time-to-fill improved without extra tuning.
With open-source encoders, we hit major fairness and compliance risks. A male anchor profile often returned mostly male similar profiles, creating allocation and representation biases we couldn’t justify under GDPR or the EU AI Act.
HrFlow.ai Matching gave us deterministic similarity with built-in fairness controls and HR-safe ingestion.
OpenAI embeddings treated past and recent experience similarly, and missed certifications, seniority fit, and trajectory signals. We didn’t want semantic twins—we wanted hiring twins.
HrFlow.ai Matching delivered Profile→Profile similarity grounded in hiring outcomes, so look-alikes actually behaved like look-alikes in the pipeline.
Google vectors mostly clustered job descriptions. We needed next-apply similarity: jobs a candidate is likely to click and apply to after liking a role.
HrFlow.ai Jobs Matching uses Job encoders trained on application signals, so similar jobs became similar intent, and our engagement and time-to-fill improved without extra tuning.
With open-source encoders, we hit major fairness and compliance risks. A male anchor profile often returned mostly male similar profiles, creating allocation and representation biases we couldn’t justify under GDPR or the EU AI Act.
HrFlow.ai Matching gave us deterministic similarity with built-in fairness controls and HR-safe ingestion.
OpenAI embeddings treated past and recent experience similarly, and missed certifications, seniority fit, and trajectory signals. We didn’t want semantic twins—we wanted hiring twins.
HrFlow.ai Matching delivered Profile→Profile similarity grounded in hiring outcomes, so look-alikes actually behaved like look-alikes in the pipeline.
Google vectors mostly clustered job descriptions. We needed next-apply similarity: jobs a candidate is likely to click and apply to after liking a role.
HrFlow.ai Jobs Matching uses Job encoders trained on application signals, so similar jobs became similar intent, and our engagement and time-to-fill improved without extra tuning.
Integrate 200+ tools with the flip of a switch.
















































HR-native ETL with 200+ connectors plus Webhooks to ingest, normalise, and sync jobs & profiles across your stack—reliable pipelines with unified schemas.
No-code automation platform with 8,000+ app integrations to move data between tools using triggers + actions.
Visual automation platform to extract/transform/route data across 3,000+ apps (plus HTTP modules for any API).
Microsoft Power Automate—workflow automation with 1,000+ API connectors (and support for custom connectors).
Enterprise iPaaS/automation platform with 1,200+ pre-built connectors for orchestrating integrations and data workflows at scale.
Salesforce's low-code workflow automation tool; extended via AppExchange with 7,000+ apps to add integrations and capabilities.
HrFlow.ai Matching is a large-scale search engine for Profile-to-Profiles and Job-to-Jobs similarity search. It combines HR-native deep hierarchical encoders (full career trajectories, job requirements, context, and work environment) with deterministic similarity scoring and a two-stage pipeline (high-throughput retrieval/ANN index + optional refinement) that supports multilingual and cross-lingual representations. It supports hybrid search filters, explainability (Reasoning API), and feature fusion (tags, metadata, tabular signals)—powered by specialized Profile encoders and Job encoders trained with built-in fairness regularization and representation-bias calibration, aligned with the EU AI Act and international ethical AI requirements.
Built for sensitive HR data—secure by default, enterprise-ready.
TLS in transit + encryption at rest to protect documents and extracted data.
Minimal storage by default, with configurable retention policies to match your compliance needs.
Built for sensitive HR data—secure by default, enterprise-ready. AI Act– and GDPR-ready processing, with documented controls for data handling and compliance.
Data processing and storage can be aligned with your required region (e.g., EU or US) depending on your deployment.
| Feature | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Deployment & Trust | |||||||||||
| Headquarters | 🇫🇷 France | 🇺🇸 USA | 🇺🇸 USA | 🇮🇱 Israel | 🇺🇸 USA | ||||||
| 🇺🇸 USA & 🇪🇺 EU Servers | Built-in | config | config | config | |||||||
| GDPR / AI-Act readiness | by design | config | |||||||||
| HR Compliance (Safety & Guardrails) | Built-in | ||||||||||
| Pretraining Data | 1,2B Hiring Signals | Noisy & Biased Web Data | Config (Depends on embedding) | Config (Depends on embedding) | Noisy & Biased Web Data | ||||||
| HR-Focused | |||||||||||
| Input Security (Prompt injection) | |||||||||||
| Pricing model | per record | per input tokens + record (expensive) | per Server (unpredictable) | per input tokens + record (expensive) | per input (expensive) | ||||||
| Vector Database | Built-in | Built-in | Built-in | Built-in | |||||||
| Max Index Storage | >2B records | >2B records | >2B records | >2B records | |||||||
| Refresh Speed (Index update / new record) | ~1s | ~1s | ~1s | ~1s | ~5s | ||||||
| Speed (avg response / 1M records) | ~2s | ~2s | ~1s | ~2s | ~14 days | ||||||
| DevOps burden (production scale) | Lowest | Medium | High | Medium | High | ||||||
| Deployment model | Managed API/Saas | Managed API/ Self-host | Managed API/ Self-host | Managed API/ Self-host | Managed API/ Saas | ||||||
| Core Technology | |||||||||||
| Technology | Deep hierarchical Encoders / Fairness & Bias Optimization | Deep Flat Encoders | Apache Lucene / TF-IDF encoder / Opensource Flat Encoders | Opensource Flat Encoders | Deep Flat Encoders | ||||||
| Multilingual | 43 lang | 23 lang | Config | Config | 40 lang | ||||||
| Crosslingual | |||||||||||
| Match Scores | Outcome-based Similarity | Cosine Similarity | Cosine Similarity / Keywords Overlap / Reciprocal Rank Fusion (RRF) | Cosine Similarity | Cosine Similarity | ||||||
| Hiring Likelihood Profile similarity | Built-in (Profile encoder) | ||||||||||
| Application Likelihood Job similarity | Built-in (Job encoder) | ||||||||||
| White-collar Roles Accuracy | High | Low | Config (Depends on embedding) | Config (Depends on embedding) | Low | ||||||
| Blue-collar Roles Accuracy | High | Low | Config (Depends on embedding) | Config (Depends on embedding) | Low | ||||||
| Junior Roles Accuracy | High | Low | Config (Depends on embedding) | Config (Depends on embedding) | Low | ||||||
| Senior Roles Accuracy | High | Low | Config (Depends on embedding) | Config (Depends on embedding) | Low | ||||||
| Custom Feature Engineering | Built-in (Tags & Metadata) | Keyword boosting + Reciprocal Rank Fusion (RRF) | |||||||||
| Fairness Regularization | Built-in (Constraints) | ||||||||||
| Data Calibration & Debiasing | Built-in (Pipeline) | ||||||||||
| HR Stack integrations (add-ons) | |||||||||||
| Hybrid Search | Built-in (Searching API) | Built-in | Config | Built-in | |||||||
| Reasoning & Explainability | Built-in (Reasoning API) | Built-in (matched keywords) | |||||||||
| Resume, CV, Job parsers | Built-in (Parsing API) | Config | |||||||||
| HR data enrichment & taxonomies | Built-in (Linking/Tagging/Asking APIs) | ||||||||||
| Jobboards / ATS / HCM / HRIS connectors | 200+ connectors (Data Studio) | ||||||||||
| Candidate & Recruiter UI | Widgets (App Studio) | ||||||||||
Everything you need to know about the Matching API
Our APIs are designed to complement each other and unlock your data's full potential
Transform HR documents into structured, enriched Talent & Workforce Data — powering every layer of Hiring Intelligence.
API OverviewUnlock Hiring Superintelligence at scale — with transparent, fair, and explainable ranking across every Talent signal.
API OverviewHrFlow.ai is an API-first company and the leading AI-powered HR data automation platform.
The company helps +1000 customers (HR software vendors, Staffing agencies, large employers, and headhunting firms) to thrive in a high-volume and high-frequency labor market.
The platform provides a complete and fully integrated suite of HR data processing products based on the analysis of hundreds of millions of career paths worldwide -- such as Parsing API, Tagging API, Embedding API, Searching API, Scoring API, and Upskilling API. It also offers a catalog of +200 connectors to build custom scenarios that can automate any business logic.