星期日, 22 6 月, 2025
ZKE News
  • Home
  • Live Crypto Prices
  • Crypto News
    • Bitcoin
    • Altcoins
  • NFT News
  • Blockchain
  • Regulations
  • Scams
No Result
View All Result
  • Home
  • Live Crypto Prices
  • Crypto News
    • Bitcoin
    • Altcoins
  • NFT News
  • Blockchain
  • Regulations
  • Scams
No Result
View All Result
ZKE News
No Result
View All Result

Optimizing IVF-PQ Performance with RAPIDS cuVS: Key Tuning Techniques

by NZU
19 7 月, 2024
in Blockchain
0
Optimizing IVF-PQ Performance with RAPIDS cuVS: Key Tuning Techniques

Related articles

Crypto & NFT Data Tracker CoinMarketCap Got Hacked

Crypto & NFT Data Tracker CoinMarketCap Got Hacked

21 6 月, 2025
Bored Ape NFT Creator Yuga Labs Drops NFT Stickers On TON

Bored Ape NFT Creator Yuga Labs Drops NFT Stickers On TON

21 6 月, 2025


Tony Kim
Jul 18, 2024 19:39

Learn how to optimize the IVF-PQ algorithm for vector search performance using RAPIDS cuVS, with practical tips on tuning hyper-parameters and improving recall.





In the first part of the series, an overview of the IVF-PQ algorithm was presented, explaining its foundation on the IVF-Flat algorithm and the use of Product Quantization (PQ) to compress the index and support larger datasets. In part two, the focus shifts to the practical aspects of tuning IVF-PQ performance, which is crucial for achieving optimal results, especially when dealing with billion-scale datasets.

Tuning Parameters for Index Building

IVF-PQ shares some parameters with IVF-Flat, such as coarse-level indexing and search hyper-parameters. However, IVF-PQ introduces additional parameters that control compression. One of the critical parameters is n_lists, which determines the number of partitions (inverted lists) into which the input dataset is clustered. The performance is influenced by the number of lists probed and their sizes. Experiments suggest that n_lists in the range of 10K to 50K yield good performance across recall levels, though this can vary depending on the dataset.

Another crucial parameter is pq_dim, which controls compression. Starting with one fourth the number of features in the dataset and increasing in steps is a good technique for tuning this parameter. Figure 2 in the original blog post illustrates significant drops in QPS, which can be attributed to factors such as increased compute work and shared memory requirements per CUDA block.

The pq_bits parameter, ranging from 4 to 8, controls the number of bits used in each individual PQ code, affecting the codebook size and recall. Reducing pq_bits can improve search speed by fitting the look-up table (LUT) in shared memory, although this comes at the cost of recall.

Additional Parameters

The codebook_kind parameter determines how the codebooks for the second-level quantizer are constructed, either for each subspace or for each cluster. The choice between these options can impact training time, GPU shared memory utilization, and recall. Parameters such as kmeans_n_iters and kmeans_trainset_fraction are also important, though they rarely need adjustment.

Tuning Parameters for Search

The n_probes parameter, discussed in the previous blog post on IVF-Flat, is essential for search accuracy and throughput. IVF-PQ provides additional parameters like internal_distance_dtype and lut_dtype, which control the representation of distance or similarity during the search and the datatype used to store the LUT, respectively. Adjusting these parameters can significantly impact performance, especially for datasets with large dimensionality.

Improving Recall with Refinement

When tuning parameters is not enough to achieve the desired recall, refinement offers a promising alternative. This separate operation, performed after the ANN search, recomputes exact distances for selected candidates and reranks them. The refinement operation can significantly improve recall, as demonstrated in Figure 4 of the original blog post, though it requires access to the source dataset.

Summary

The series on accelerating vector search with inverted-file indexes covers two cuVS algorithms: IVF-Flat and IVF-PQ. IVF-PQ extends IVF-Flat with PQ compression, enabling faster searches and the ability to handle billion-scale datasets with limited GPU memory. By fine-tuning parameters for index building and search, data practitioners can achieve the best results efficiently. The RAPIDS cuVS library offers a range of vector search algorithms to cater to various use cases, from exact searches to low-accuracy-high-QPS ANN methods.

For practical tuning of IVF-PQ parameters, refer to the IVF-PQ notebook on GitHub. For more details on the provided APIs, see the cuVS documentation.

Image source: Shutterstock


Credit: Source link

Previous Post

Basel Committee releases final disclosure framework for banks’ crypto exposures

Next Post

Analyst Says Altcoin That’s Up Over 120% in Two Weeks Primed for Another Leg Up, Updates Outlook on Shiba Inu

Related Posts

Crypto & NFT Data Tracker CoinMarketCap Got Hacked

Crypto & NFT Data Tracker CoinMarketCap Got Hacked

21 6 月, 2025

In recent months, th...

Bored Ape NFT Creator Yuga Labs Drops NFT Stickers On TON

Bored Ape NFT Creator Yuga Labs Drops NFT Stickers On TON

21 6 月, 2025

Join Our Telegram ch...

Bitcoin (BTC) Market Evolution: Institutional Influence and Sovereign Reserves

Bitcoin (BTC) Market Evolution: Institutional Influence and Sovereign Reserves

20 6 月, 2025

Ja...

Character.AI Appoints Karandeep Anand as New CEO

Character.AI Appoints Karandeep Anand as New CEO

20 6 月, 2025

Je...

Enhancements in Bitcoin’s Initial Block Download (IBD) Process

Enhancements in Bitcoin’s Initial Block Download (IBD) Process

20 6 月, 2025

Jo...

Load More

发表回复 取消回复

您的邮箱地址不会被公开。 必填项已用 * 标注

TAO & SHIB Analysis Stalls; Web3 Ai Hits $8.3M As Traders Shift Focus

TAO & SHIB Analysis Stalls; Web3 Ai Hits $8.3M As Traders Shift Focus

22 6 月, 2025
Russia Paid Teenage Spies in Bitcoin, Blockchain Forensics Reveal

Russia Paid Teenage Spies in Bitcoin, Blockchain Forensics Reveal

15 6 月, 2025
Trump Family Quietly Decreases Stake In World Liberty Financial

Trump Family Quietly Decreases Stake In World Liberty Financial

20 6 月, 2025
Optimizing AI Retrieval: Choosing the Best Chunking Strategy

Optimizing AI Retrieval: Choosing the Best Chunking Strategy

19 6 月, 2025
Bitcoin treasury hype will burst like memecoin supercycle – Crypto VC

Bitcoin treasury hype will burst like memecoin supercycle – Crypto VC

18 6 月, 2025

ZKE NEWS

ZKE News is an online news source that provides the latest updates on crypto news, including Bitcoin, Altcoin, Blockchain, NFT news, crypto regulation, scams, and much more.

Categories

  • Altcoins
  • Bitcoin
  • Blockchain
  • Crypto News
  • NFT News
  • Regulations
  • Scams

Tags

Altcoins Bitcoin Blockchain Crypto News NFT News Regulations Scams
  • Contact Us
  • Privacy Policy
  • Terms of Use
  • DMCA

© 2023 - news.zke.us - All Rights Reserved!

No Result
View All Result
  • Home
  • Live Crypto Prices
  • Crypto News
    • Bitcoin
    • Altcoins
  • NFT News
  • Blockchain
  • Regulations
  • Scams

© 2018 JNews by Jegtheme.

  • bitcoinBitcoin(BTC)$57,792.00-0.07%
  • ethereumEthereum(ETH)$3,102.631.60%
  • tetherTether(USDT)$1.00-0.06%
  • binancecoinBNB(BNB)$522.791.33%
  • solanaSolana(SOL)$141.960.36%
  • usd-coinUSDC(USDC)$1.000.04%
  • staked-etherLido Staked Ether(STETH)$3,109.901.85%
  • rippleXRP(XRP)$0.4379300.51%
  • ToncoinToncoin(TON)$7.21-1.35%
  • dogecoinDogecoin(DOGE)$0.1074050.22%
  • cardanoCardano(ADA)$0.3837042.22%
  • tronTRON(TRX)$0.1312550.99%
  • avalanche-2Avalanche(AVAX)$25.71-1.89%
  • shiba-inuShiba Inu(SHIB)$0.0000160.46%
  • wrapped-bitcoinWrapped Bitcoin(WBTC)$57,635.00-0.37%
  • polkadotPolkadot(DOT)$6.120.47%
  • chainlinkChainlink(LINK)$12.75-0.39%
  • bitcoin-cashBitcoin Cash(BCH)$337.722.22%
  • uniswapUniswap(UNI)$8.060.22%
  • leo-tokenLEO Token(LEO)$5.82-0.47%
  • daiDai(DAI)$1.00-0.17%
  • nearNEAR Protocol(NEAR)$4.601.94%
  • litecoinLitecoin(LTC)$66.672.01%
  • matic-networkPolygon(MATIC)$0.512.19%
  • Wrapped eETHWrapped eETH(WEETH)$3,226.781.47%
  • KaspaKaspa(KAS)$0.170006-0.24%
  • PepePepe(PEPE)$0.0000091.70%
  • Ethena USDeEthena USDe(USDE)$1.000.08%
  • internet-computerInternet Computer(ICP)$7.18-0.70%
  • Renzo Restaked ETHRenzo Restaked ETH(EZETH)$3,141.781.50%
  • ethereum-classicEthereum Classic(ETC)$20.921.45%
  • fetch-aiArtificial Superintelligence Alliance(FET)$1.19-0.56%
  • moneroMonero(XMR)$156.200.91%
  • AptosAptos(APT)$6.111.91%
  • stellarStellar(XLM)$0.087154-0.62%
  • render-tokenRender(RNDR)$6.491.31%
  • hedera-hashgraphHedera(HBAR)$0.065939-3.04%
  • cosmosCosmos Hub(ATOM)$6.031.74%
  • ArbitrumArbitrum(ARB)$0.711.89%
  • crypto-com-chainCronos(CRO)$0.084440-1.84%
  • filecoinFilecoin(FIL)$3.961.20%
  • blockstackStacks(STX)$1.5311.41%
  • MantleMantle(MNT)$0.695.02%
  • okbOKB(OKB)$36.790.61%
  • makerMaker(MKR)$2,306.631.51%
  • vechainVeChain(VET)$0.0258250.28%
  • injective-protocolInjective(INJ)$20.54-0.30%
  • First Digital USDFirst Digital USD(FDUSD)$1.00-0.03%
  • immutable-xImmutable(IMX)$1.25-1.34%
  • optimismOptimism(OP)$1.664.89%
  • BittensorBittensor(TAO)$260.472.91%
  • SuiSui(SUI)$0.73-1.44%
  • the-graphThe Graph(GRT)$0.1857292.27%
  • BonkBonk(BONK)$0.0000263.40%
  • Rocket Pool ETHRocket Pool ETH(RETH)$3,463.321.46%
  • NotcoinNotcoin(NOT)$0.015896-3.20%
  • dogwifhatdogwifhat(WIF)$1.62-3.51%
  • Mantle Staked EtherMantle Staked Ether(METH)$3,217.261.40%
  • lido-daoLido DAO(LDO)$1.675.99%
  • arweaveArweave(AR)$22.490.58%
  • Bitget TokenBitget Token(BGB)$1.040.87%
  • FLOKIFLOKI(FLOKI)$0.0001493.91%
  • OndoOndo(ONDO)$0.984.71%
  • WhiteBIT CoinWhiteBIT Coin(WBT)$9.570.50%
  • theta-tokenTheta Network(THETA)$1.361.96%
  • CelestiaCelestia(TIA)$6.80-4.62%
  • aaveAave(AAVE)$90.175.14%
  • fantomFantom(FTM)$0.4687043.51%
  • thorchainTHORChain(RUNE)$3.612.50%
  • jasmycoinJasmyCoin(JASMY)$0.0247126.62%
  • BrettBrett(BRETT)$0.118640-0.80%
  • algorandAlgorand(ALGO)$0.138571-0.33%
  • ether.fi Staked ETHether.fi Staked ETH(EETH)$3,090.241.21%
  • Pyth NetworkPyth Network(PYTH)$0.3017931.24%
  • JupiterJupiter(JUP)$0.78-1.73%
  • quant-networkQuant(QNT)$70.02-1.87%
  • elrond-erd-2MultiversX(EGLD)$37.221.39%
  • SeiSei(SEI)$0.328578-3.83%
  • CoreCore(CORE)$1.103.06%
  • gatechain-tokenGate(GT)$6.941.56%
  • ethereum-name-serviceEthereum Name Service(ENS)$27.130.86%
  • akash-networkAkash Network(AKT)$3.54-0.39%
  • kucoin-sharesKuCoin(KCS)$8.94-1.67%
  • FlareFlare(FLR)$0.019127-1.31%
  • flowFlow(FLOW)$0.551.29%
  • dYdXdYdX(DYDX)$1.321.96%
  • mantra-daoMANTRA(OM)$0.960.88%
  • Kelp DAO Restaked ETHKelp DAO Restaked ETH(RSETH)$3,139.661.41%
  • axie-infinityAxie Infinity(AXS)$5.361.33%
  • galaGALA(GALA)$0.021604-0.81%
  • eosEOS(EOS)$0.520.36%
  • Tokenize XchangeTokenize Xchange(TKX)$9.640.57%
  • StarknetStarknet(STRK)$0.59-0.68%
  • bittorrentBitTorrent(BTT)$0.0000011.27%
  • msolMarinade Staked SOL(MSOL)$169.76-0.40%
  • BeamBeam(BEAM)$0.0148692.24%
  • FasttokenFasttoken(FTN)$2.340.62%
  • bitcoin-cash-svBitcoin SV(BSV)$38.061.96%
  • usddUSDD(USDD)$1.000.40%
  • tezosTezos(XTZ)$0.74-0.96%