Page Not Found
Page not found. Your pixels are in another canvas.
A list of all the posts and pages found on the site. For you robots out there is an XML version available for digesting as well.
Page not found. Your pixels are in another canvas.
About me
This is a page not in th emain menu
Published:
We develop the next generation multi-modal query optimizer with the latest AI techniques. In our recent paper LaPuda: LLM-Enabled Policy-Based Query Optimizer for Multi-Modal Data, a large language model (LLM) based query optimizer is designed for efficient multi-modal, semantic query processing.
Combining powerful LLM and traditional database optimization techniques, the new optimizer achieves significant performance to optimize the multi-modal query plans (with both structured and semantic information, covering multiple data modalities), resulting in 3x query execution speedup for complex multi-modal queries. The design scheme also proves the effectiveness of using LLM in modern query optimizers, especially for complex semantic queries. This project is an important step towards practical design of multi-modal data systems in the LLM era.
Published:
Notes on using Milvus v2.1.x
Published:
The only portable way of using templates at the moment is to implement them in header files by using inline functions.
Published:
Notes on C++ (1).
Published:
Basic notes for learning Elixir.
Published:
My solutions to some Leetcode problems ranging in #201-250.
Published:
ASM is an important tool for manipulating java bytecode. Here is a summary of typical problems when I was implementing a compiler using Java and ASM.
Short description of portfolio item number 1
Short description of portfolio item number 2
Yifan Wang. "A Survey on Efficient Processing of Similarity Queries over Neural Embeddings." arXiv preprint arXiv:2204.07922 (2022). PDF
Yifan Wang and Daisy Zhe Wang. 2022. Extensible Database Simulator for Fast Prototyping In-Database Algorithms. In Proceedings of the 31st ACM International Conference on Information & Knowledge Management (CIKM '22). Association for Computing Machinery, New York, NY, USA, 5029–5033. https://doi.org/10.1145/3511808.3557205 PDF
Yifan Wang, and Daisy Zhe Wang. "Learned Accelerator Framework for Angular-Distance-Based High-Dimensional DBSCAN". In Proceedings of the 26th International Conference on Extending Database Technology (EDBT), 28th March-31st March, 2023. PDF
Yifan Wang, Haodi Ma, and Daisy Zhe Wang. "LIDER: An Efficient High-dimensional Learned Index for Large-scale Dense Passage Retrieval". PVLDB, 16(2): 154 - 166, 2022. doi:10.14778/3565816.3565819 PDF
Rong Zhu, Lianggui Weng, Wenqing Wei, Di Wu, Jiazhen Peng, Yifan Wang, Bolin Ding, Defu Lian, Bolong Zheng, Jingren Zhou. "PilotScope: Steering Databases with Machine Learning Drivers." VLDB 2024 PDF
Yifan Wang, Vyom Pathak, and Daisy Zhe Wang. "Xling: A Learned Filter Framework for Accelerating High-Dimensional Approximate Similarity Join." arXiv preprint arXiv:2402.13397 (2024). PDF
Yifan Wang, Haodi Ma, and Daisy Zhe Wang. "LaPuda: LLM-Enabled Policy-Based Query Optimizer for Multi-Modal Data". Advances in Knowledge Discovery and Data Mining. PAKDD 2025.
Tinghui Zhang, Yifan Wang, Daisy Zhe Wang. "SCOPE: A Generative Approach for LLM Prompt Compression". arXiv preprint arXiv:2508.15813
Undergraduate/Graduate course, University of Florida, 2019
Giving lectures for lab work of the course, grading and question answering.
Undergraduate course, University of Florida, 2022
Instructing lab work of the course, grading and question answering.
Graduate course, University of Florida, 2023
Assisting in classes, grading assignments/exams/projects, holding office hours and question answering.
Graduate course, University of Florida, 2024
Assisting in classes, making up exams, grading assignments/exams/projects, holding office hours and question answering.