Page Not Found
Page not found. Your pixels are in another canvas.
A list of all the posts and pages found on the site. For you robots out there is an XML version available for digesting as well.
Page not found. Your pixels are in another canvas.
About me
This is a page not in th emain menu
Published:
We develop the next generation multi-modal query optimizer with the latest AI techniques. In our recent paper LaPuda: LLM-Enabled Policy-Based Query Optimizer for Multi-Modal Data, a large language model (LLM) based query optimizer is designed for efficient multi-modal, semantic query processing.

Combining powerful LLM and traditional database optimization techniques, the new optimizer achieves significant performance to optimize the multi-modal query plans (with both structured and semantic information, covering multiple data modalities), resulting in 3x query execution speedup for complex multi-modal queries. The design scheme also proves the effectiveness of using LLM in modern query optimizers, especially for complex semantic queries. This project is an important step towards practical design of multi-modal data systems in the LLM era.
Published:
Notes on using Milvus v2.1.x
Published:
The only portable way of using templates at the moment is to implement them in header files by using inline functions.
Published:
Notes on C++ (1).
Published:
Basic notes for learning Elixir.
Published:
My solutions to some Leetcode problems ranging in #201-250.
Published:
ASM is an important tool for manipulating java bytecode. Here is a summary of typical problems when I was implementing a compiler using Java and ASM.
Short description of portfolio item number 1
Short description of portfolio item number 2 
Yifan Wang. "A Survey on Efficient Processing of Similarity Queries over Neural Embeddings." arXiv preprint arXiv:2204.07922 (2022). PDF
Yifan Wang and Daisy Zhe Wang. 2022. Extensible Database Simulator for Fast Prototyping In-Database Algorithms. In Proceedings of the 31st ACM International Conference on Information & Knowledge Management (CIKM '22). Association for Computing Machinery, New York, NY, USA, 5029–5033. https://doi.org/10.1145/3511808.3557205 PDF
Yifan Wang, and Daisy Zhe Wang. "Learned Accelerator Framework for Angular-Distance-Based High-Dimensional DBSCAN". In Proceedings of the 26th International Conference on Extending Database Technology (EDBT), 28th March-31st March, 2023. PDF
Yifan Wang, Haodi Ma, and Daisy Zhe Wang. "LIDER: An Efficient High-dimensional Learned Index for Large-scale Dense Passage Retrieval". PVLDB, 16(2): 154 - 166, 2022. doi:10.14778/3565816.3565819 PDF
Rong Zhu, Lianggui Weng, Wenqing Wei, Di Wu, Jiazhen Peng, Yifan Wang, Bolin Ding, Defu Lian, Bolong Zheng, Jingren Zhou. "PilotScope: Steering Databases with Machine Learning Drivers." VLDB 2024 PDF
Yifan Wang, Vyom Pathak, and Daisy Zhe Wang. "Xling: A Learned Filter Framework for Accelerating High-Dimensional Approximate Similarity Join." arXiv preprint arXiv:2402.13397 (2024). PDF
Yifan Wang, Haodi Ma, and Daisy Zhe Wang. "LaPuda: LLM-Enabled Policy-Based Query Optimizer for Multi-Modal Data". Advances in Knowledge Discovery and Data Mining. PAKDD 2025.
Tinghui Zhang, Yifan Wang, Daisy Zhe Wang. "SCOPE: A Generative Approach for LLM Prompt Compression". arXiv preprint arXiv:2508.15813
Zhuocheng Gan, Yifan Wang. "Efficient Filtered-ANN via Learning-based Query Planning". arXiv preprint arXiv:2602.17914 PDF
Ruyu Li, Tinghui Zhang, Haodi Ma, Daisy Zhe Wang, Yifan Wang. "Beyond Single-Modal Analytics: A Framework for Integrating Heterogeneous LLM-Based Query Systems for Multi-Modal Data". arXiv preprint arXiv:2602.01701 PDF
Undergraduate Course, University of Hawaii at Manoa, 2024
Very large database systems, data integration, data warehousing, designing big data systems, parallel query processing, distributed transactions.
Graduate Course, University of Hawaii at Manoa, 2025
Exploration of advanced database and information retrieval research and introduction to academic research, including paper reviewing, presentation, method designing, evaluation conducting and paper writing.
Undergraduate Course, University of Hawaii at Manoa, 2025
Concepts, tools, and techniques for analyzing and mining massive data sets. Data cleaning and pre-processing. Data analysis and mining techniques. Big Data platforms. Big Data visualization.
Undergraduate Course, University of Hawaii at Manoa, 2026
Introduction to relational database systems, data modeling, query processing, transaction processing, storage, and indexing.
Graduate Course, University of Hawaii at Manoa, 2026
Exploration of advanced database and information retrieval research and introduction to academic research, including paper reviewing, presentation, method designing, evaluation conducting and paper writing.