latest news

in this website, we will provide you all new latest news

  • Home
  • Business
    • Internet
    • Market
    • Stock
  • Parent Category
    • Child Category 1
      • Sub Child Category 1
      • Sub Child Category 2
      • Sub Child Category 3
    • Child Category 2
    • Child Category 3
    • Child Category 4
  • Featured
  • Health
    • Childcare
    • Doctors
  • Home
  • Business
    • Internet
    • Market
    • Stock
  • Downloads
    • Dvd
    • Games
    • Software
      • Office
  • Parent Category
    • Child Category 1
      • Sub Child Category 1
      • Sub Child Category 2
      • Sub Child Category 3
    • Child Category 2
    • Child Category 3
    • Child Category 4
  • Featured
  • Health
    • Childcare
    • Doctors
  • Uncategorized

Thursday, 27 February 2025

New top story on Hacker News: Show HN: Ranked Search for Semi-Structured Data

 Champ     11:21     Hacker News     No comments   

Show HN: Ranked Search for Semi-Structured Data
7 by alrudolph | 0 comments on Hacker News.
We’ve been working on a search problem that requires querying both text and numbers simultaneously. For example, in a dataset of clothing items with descriptions and prices, a search for “slim pants for $20” should prioritize skinny jeans for $25 over slim pants for $50 because they are semantically similar and the price is closer. I’ve found that standard embedding models struggle with numerical ordering, while text-to-SQL methods rely on exact matches and often filter out too many results. To solve this, we built a system designed specifically for structured datasets like CSVs or tables. Here’s a demo link where you can upload a small CSV to try out (no login required): https://ift.tt/F51MQlB . Unlike most RAG approaches, we process each column independently, handling text with embeddings and numbers with custom scoring. When a user submits a query, we parse it into relevant fields—for instance, extracting “slim pants” as the description and “20” as the price. We then compute cosine similarity between the description embeddings and “slim pants” while also calculating the percent error between the user’s price input and the numerical field. These individual similarity scores are then combined across all columns to generate a final ranking. Right now, our system works best with well-structured data, so some preprocessing is often needed. We’re working on improving this by detecting and restructuring messy data automatically, such as pivoting columns or extracting attributes from large text fields. We’re also adding feedback mechanisms, like a thumbs up/down system, to refine future search results based on user input. I’d love to hear about your experiences with similar search challenges and would appreciate any feedback!

  • Share This:  
  •  Facebook
  •  Twitter
  •  Google+
  •  Stumble
  •  Digg
Email ThisBlogThis!Share to XShare to Facebook
Newer Post Older Post Home

0 comments:

Post a Comment

Popular Posts

  • 简报:中美发表应对气候危机联合声明;医生称纳瓦尔尼病危
    By BY EMILY CHAN AND KONEY BAI from NYT World https://ift.tt/3dva1lP via IFTTT
  • New York Post Reporter Who Wrote False Kamala Harris Story Resigns
    By BY MICHAEL M. GRYNBAUM from NYT Business https://ift.tt/3aKd8Ex via IFTTT
  • New top story on Hacker News: Visa and Mastercard are getting overwhelmed by gamer fury over censorship
    Visa and Mastercard are getting overwhelmed by gamer fury over censorship 181 by mrzool | 134 comments on Hacker News.
  • New top story on Hacker News: The Power of Starting Again
    The Power of Starting Again 10 by memorable | 2 comments on Hacker News.
  • New top story on Hacker News: Organic Maps migrates to Forgejo due to GitHub account blocked by Microsoft
    Organic Maps migrates to Forgejo due to GitHub account blocked by Microsoft 30 by mraniki | 8 comments on Hacker News.
  • New top story on Hacker News: Ask HN: How to be productive with big existing code base
    Ask HN: How to be productive with big existing code base 134 by maheshs | 103 comments on Hacker News. I have just started working with o...
  • New top story on Hacker News: My Experience with Claude Code After 2 Weeks of Adventures
    My Experience with Claude Code After 2 Weeks of Adventures 3 by dejavucoder | 0 comments on Hacker News.
  • New top story on Hacker News: Ask HN: Who wants to be hired? (July 2025)
    Ask HN: Who wants to be hired? (July 2025) 13 by whoishiring | 82 comments on Hacker News. Share your information if you are looking for ...
  • New top story on Hacker News: Nuclear Waste Reprocessing Gains Momentum in the U.S.
    Nuclear Waste Reprocessing Gains Momentum in the U.S. 14 by rbanffy | 4 comments on Hacker News.
  • New top story on Hacker News: Instrumenting Next.js with runtime secret injection
    Instrumenting Next.js with runtime secret injection 6 by nimishk | 3 comments on Hacker News.

Recent Posts

Categories

  • BBC News
  • BBC News - Technology
  • BBC News - World
  • BOLLYWOOD Jagran Hindi News - entertainment:bollywood
  • CBNNews.com
  • CLASS 10 BEST BOOKS FOR BECOME A TOPPER
  • CRICKETJagran Hindi News - cricket:headlines
  • FOX NEWS
  • Hacker News
  • INDIAJagran Hindi News - news:national
  • NYT
  • Reuters: World News

Unordered List

Pages

  • Home

Text Widget

Blog Archive

  • ►  2026 (29)
    • ►  January (29)
  • ▼  2025 (738)
    • ►  December (53)
    • ►  November (52)
    • ►  October (60)
    • ►  September (61)
    • ►  August (63)
    • ►  July (71)
    • ►  June (64)
    • ►  May (71)
    • ►  April (61)
    • ►  March (66)
    • ▼  February (51)
      • Three dead as 'brutal' cyclone sweeps through Reunion
      • New top story on Hacker News: Merlion: A Machine L...
      • Six killed in blast at Pakistan's 'University of J...
      • New top story on Hacker News: Show HN: Wampy, inte...
      • New top story on Hacker News: Show HN: Ranked Sear...
      • New top story on Hacker News: Your camera can take...
      • What an AI-generated video of Gaza reveals about T...
      • New top story on Hacker News: A peek into a possib...
      • New top story on Hacker News: Show HN: I made a si...
      • New top story on Hacker News: Micro Journal: Distr...
      • Taliban says it will try to release British couple...
      • Watch: A real-life flying car takes to the skies
      • New top story on Hacker News: Decades of Research ...
      • Pope's condition 'remains critical', Vatican says
      • California asks US government for billions in fire...
      • New top story on Hacker News: The Profitable Startup
      • Israeli family mourns 'man of peace' as body retur...
      • Watch: Tourists warned to stay away from Mount Etn...
      • New top story on Hacker News: Typst 0.13 is out now
      • New top story on Hacker News: An inside look at NS...
      • New top story on Hacker News: (Ab)using general se...
      • Can Europe and UK persuade Trump they're relevant ...
      • New top story on Hacker News: Debugging an Undebug...
      • New top story on Hacker News: Show HN: Hackyournew...
      • New top story on Hacker News: Half-Life 2 and Dish...
      • Cover up or pay a fine, Portugal's Albufeira warns
      • New top story on Hacker News: Show HN: A New Way t...
      • Half of French island in Indian Ocean burnt by wil...
      • €5m worth of cocaine seized in Kildare
      • One killed in ballistic missile attack on Kyiv, Uk...
      • New top story on Hacker News: Rust Kernel Policy
      • Stripey-faced fish named after warrior princess San
      • Elderly hostage in Gaza was killed in 7 October at...
      • South Africa mourns pioneering female nuclear scie...
      • New top story on Hacker News: Mac(OS)talgia
      • New top story on Hacker News: Teen on Musk's DOGE ...
      • Bodies of migrants found in Libya mass grave, auth...
      • New top story on Hacker News: Cities can cost effe...
      • New top story on Hacker News: A whirlwind tutorial...
      • New top story on Hacker News: Show HN: A website t...
      • 15 things Trump and his team did this week
      • New top story on Hacker News: Scala 3 Migration: R...
      • Man launches fight against fine for loudspeaker ca...
      • New top story on Hacker News: Ingesting PDFs and W...
      • Hunger-striking journalist challenges Georgia's go...
      • New top story on Hacker News: Roc Rewrites the Com...
      • New top story on Hacker News: Ask HN: Who wants to...
      • Around 160,000 protest against far-right in Berlin
      • New top story on Hacker News: ScatterBrain: Unmask...
      • Victims of Philadelphia air ambulance crash named
      • New top story on Hacker News: Discovery of collage...
    • ►  January (65)
  • ►  2024 (756)
    • ►  December (73)
    • ►  November (69)
    • ►  October (64)
    • ►  September (58)
    • ►  August (71)
    • ►  July (63)
    • ►  June (63)
    • ►  May (64)
    • ►  April (64)
    • ►  March (66)
    • ►  February (35)
    • ►  January (66)
  • ►  2023 (1593)
    • ►  December (64)
    • ►  November (69)
    • ►  October (80)
    • ►  September (112)
    • ►  August (111)
    • ►  July (129)
    • ►  June (135)
    • ►  May (181)
    • ►  April (173)
    • ►  March (189)
    • ►  February (166)
    • ►  January (184)
  • ►  2022 (2295)
    • ►  December (177)
    • ►  November (178)
    • ►  October (202)
    • ►  September (194)
    • ►  August (194)
    • ►  July (198)
    • ►  June (184)
    • ►  May (186)
    • ►  April (195)
    • ►  March (184)
    • ►  February (183)
    • ►  January (220)
  • ►  2021 (7845)
    • ►  December (335)
    • ►  November (635)
    • ►  October (656)
    • ►  September (636)
    • ►  August (713)
    • ►  July (713)
    • ►  June (690)
    • ►  May (707)
    • ►  April (690)
    • ►  March (713)
    • ►  February (644)
    • ►  January (713)
  • ►  2020 (8315)
    • ►  December (713)
    • ►  November (688)
    • ►  October (614)
    • ►  September (690)
    • ►  August (713)
    • ►  July (713)
    • ►  June (690)
    • ►  May (713)
    • ►  April (690)
    • ►  March (711)
    • ►  February (667)
    • ►  January (713)
  • ►  2019 (19506)
    • ►  December (712)
    • ►  November (689)
    • ►  October (712)
    • ►  September (681)
    • ►  August (712)
    • ►  July (713)
    • ►  June (689)
    • ►  May (2935)
    • ►  April (2907)
    • ►  March (3014)
    • ►  February (2731)
    • ►  January (3011)
  • ►  2018 (21108)
    • ►  December (3036)
    • ►  November (2927)
    • ►  October (3024)
    • ►  September (2931)
    • ►  August (3016)
    • ►  July (3033)
    • ►  June (2790)
    • ►  May (350)
    • ►  March (1)

About Me

Champ
View my complete profile
Powered by Blogger.

Sample Text

Copyright © latest news | Powered by Blogger
Design by Hardeep Asrani | Blogger Theme by NewBloggerThemes.com