Researcher/Professor
  • Shimin Chen

  • E-mail: chensm@ict.ac.cn

Biography:

I received my Ph.D. in Computer Science from Carnegie Mellon University in 2005, and my B.E. and M.E. from Tsinghua University in 1997 and 1999, respectively. I worked as a researcher, senior researcher, and research manager at Intel Labs , Carnegie Mellon University and HP Labs . In 2013, I joined ICT CAS and started the database systems research group in ICT. My research interests are in database management systems, big data processing, and computer architecture. I have worked on (i) exploiting NVM, SSDs, and multicores in data intensive systems; (ii) exploiting multicore processors to improve software correctness; and (iii) exploiting database and cloud techniques to provide general-purpose support for big data processing. I teach the undergrad database system course in the fall semester, and the graduate big data system course in the spring semester in University of Chinese Academy of Sciences (UCAS) .

Research area:

Database management systems; Big data processing; Computer architecture.

Selected papers:

  • Zhuoxuan Liu, Shimin Chen. "Pea Hash: A Performant Extendible Adaptive Hashing Index". (to appear) SIGMOD 2023. (SIGMOD'23).
  • Zichun Huang, Shimin Chen. "Density-optimized Intersection-free Mapping and Matrix Multiplication for Join-Project Operations". (pdf, code). PVLDB 15(10): 2244-2256, 2022 (PVLDB'22).
  • Gang Liu, Leying Chen, Shimin Chen. "Zen+: a robust NUMA-aware OLTP engine optimized for non-volatile main memory" (pdf). VLDB Journal, 2022 (https://doi.org/10.1007/s00778-022-00737-1) (VLDBJ'22).
  • Leon Lee, Siphrey Xie, Yunus Ma, Shimin Chen. "Index Checkpoints for Instant Recovery in In-Memory Database Systems" (pdf). PVLDB 15(8): 1671-1683, 2022 (PVLDB'22).
  • Jiacheng Wu, Yong Zhang, Shimin Chen, Yu Chen, Jin Wang, Chunxiao Xing. "Updatable Learned Index with Precise Positions" (pdf). PVLDB 14(8): 1276-1288, 2021 (PVLDB'21).
  • Gang Liu, Leying Chen, Shimin Chen. "Zen: a High-Throughput Log-Free OLTP Engine for Non-Volatile Main Memory" (pdf, code). PVLDB, 14(5): 835 - 848, 2021. (PVLDB'21).
  • Shaonan Ma, Kang Chen, Shimin Chen, Mengxing Liu, Jianglang Zhu, Hongbo Kang, and Yongwei Wu. "ROART: Range-query Optimized Persistent ART" (pdf). 19th USENIX Conference on File and Storage Technologies (FAST'21).
  • Mengsu Ding, Shimin Chen, Nantia Makrynioti, Stefan Manegold. "Progressive Join Algorithms Considering User Preference" (pdf,talk). 11th Conference on Innovative Data Systems Research (CIDR'21).
  • Songjie Niu, Shimin Chen. "TransGPerf: Exploiting Transfer Learning for Modeling Distributed Graph Computation Performance" (pdf). Journal of Computer Science and Technology, 36(4): 778-791, 2021 (JCST'21).
  • Songjie Niu, Shimin Chen: TransGPerf: Exploiting Transfer Learning for Modeling Distributed Graph Computation Performance.
  • Jihang Liu, Shimin Chen, Lujun Wang. "LB+-Trees: Optimizing Persistent Index Performance on 3DXPoint Memory" (pdf,talk,code). PVLDB 13(7): 1078-1090. (PVLDB'20).
  • Jihang Liu, Shimin Chen. "Initial Experience with 3D XPoint Main Memory" (pdf). HardBD & Active workshop, located at ICDE'19 (HardBD & Active'19). Journal version: to appear in DAPD SI on Self-Managing and Hardware-Optimized Database Systems, currently available online.
  • Mengsu Ding, Shimin Chen. "Efficient Partitioning and Query Processing of Spatio-Temporal Graphs with Trillion Edges" (pdf). In proceedings of the 35th International Conference on Data Engineering, 2019 (ICDE'19).
  • Zhiyi Wang, Dongyan Zhou, Shimin Chen. "STEED: An Analytical Database System for TrEE-structured Data" (pdf). In PVLDB Volume 10 Issue 12, 2017 (PVLDB'17 demo).
  • Zhiyi Wang, Shimin Chen. "Exploiting Common Patterns for Tree-Structured Data" (pdf). In proceedings of the 2017 SIGMOD International Conference on Management of Data (SIGMOD'17).
  • Shimin Chen. "Tree-Structured Big Data Management System" (pdf). Communications of CCF, Volume 13, Number 8, pp.22-26, 2017 (CCCF'17).
  • Mengsu Ding, Shimin Chen. "Helius: A Lightweight Big Data Processing System" (pdf). Journal of Computer Application, Volume 37, Number 2, pp.305-310, 2017 (JCA'17).
  • Yanfeng Zhang, Shimin Chen, Ge Yu. "Efficient Distributed Density Peaks for Clustering Large Data Sets in MapReduce" (pdf). IEEE Transactions on Knowledge and Data Engineering, Volume 28, Number 12, pp.3218-3230, Dec 2016 (TKDE'16).
  • Dongxiao Song, Shimin Chen. "Exploiting SIMD for Complex Numerical Predicates" (pdf). International Workshop on Big Data Management on Emerging Hardware, co-located at ICDE'16 (HardBD'16).
  • Yanfeng Zhang, Shimin Chen, Qiang Wang, Ge Yu. "i2MapReduce: Incremental MapReduce for Mining Evolving Big Data" (pdf). IEEE Transactions on Knowledge and Data Engineering, Volume 27, Number 7, pp.1906-1919, July 2015 (TKDE'15).
  • Shimin Chen, Qin Jin. "Persistent B+-Trees in Non-Volatile Main Memory" (pdf). VLDB 2015 (PVLDB Volume 8 Issue 7), March 2015. (VLDB'15).
  • Manos Athanassoulis, Shimin Chen, Anastasia Ailamaki, Phillip B. Gibbons, Radu Stoica. "Online Updates on Data Warehouses via Judicious Use of Solid-State Storage" (pdf). ACM Transaction on Database Systems, Volume 40, Number 1, March 2015 (TODS'15).
  • Songjie Niu, Shimin Chen. "Optimizing CPU Cache Performance for Pregel-Like Graph Computation" (pdf). International Workshop on Big Data Management on Emerging Hardware, co-located at ICDE'15 (HardBD'15).
  • Shimin Chen. "Big Data Analysis and Data Velocity" (pdf). Journal of Computer Research and Development (in Chinese), Volume 52, 2015 (JCRD'15).
  • Zhao Cao, Shimin Chen, Dongzhe Ma, Jianhua Feng, Min Wang. "Efficient and Flexible Index Access in MapReduce" (pdf). 17th International Conference on Extending Database Technology (EDBT'14).
  • Zhao Cao, Shimin Chen, Feifei Li, Min Wang, X. Sean Wang. "LogKV: Exploiting Key-Value Stores for Event Log Processing" (pdf). 6th biennial Conference on Innovative Data Systems Research (CIDR'13).
  • Yanfeng Zhang, Shimin Chen. "i2MapReduce: Incremental Iterative MapReduce" (pdf). 2nd International Workshop on Cloud Intelligence (colocated with VLDB 2013) (Cloud-I'13).
  • Peng Jiang, Huiman Hou, Lijiang Chen, Shimin Chen, Conglei Yao, Chengkai Li, Min Wang. "Wiki3C: Exploiting Wikipedia for Context-aware Concept Categorization" (pdf). 6th ACM International Conference on Web Search and Data Mining (WSDM'13).
  • Lijiang Chen, Yibing Zhao, Shimin Chen, Hui Fang, Chengkai Li, Min Wang. "iPLUG: Personalized List Recommendation in Twitter" (pdf). 14th International Conference on Web Information Systems Engineering (WISE'13).
  • Michelle L. Goodstein, Shimin Chen, Phillip B. Gibbons, Michael A. Kozuch, Todd C. Mowry. "Chrysalis Analysis: Incorporating Synchronization Arcs in Dataflow-Analysis-based Parallel Monitoring" (pdf). 21st International Conference on Parallel Architectures and Compilation Techniques (PACT'12).
  • Sean M. Snyder, Shimin Chen, Panos K. Chrysanthis, Alexandros Labrinidis. "QMD: Exploiting Flash for Energy Efficient Disk Arrays" (pdf). Seventh International Workshop on Data Management on New Hardware (DaMoN'11).
  • Manos Athanassoulis, Shimin Chen, Anastasia Ailamaki, Phillip B. Gibbons, Radu Stoica. "MaSM: Efficient Online Updates in Data Warehouses" (pdf). In proceedings of the 2011 SIGMOD International Conference on Management of Data (SIGMOD'11).
  • Shimin Chen, Phillip B. Gibbons, Michael Kozuch, Todd C. Mowry. "Log-Based Architectures: Using Multicore to Help Software Behave Correctly" (pdf). ACM SIGOPS Operating Systems Review, Volume 45 Issue 1, January 2011 (OS Review'11).
  • Shimin Chen, Phillip B. Gibbons, Suman Nath. "Rethinking Database Algorithms for Phase Change Memory" (pdf). In proceedings of the 5th Biennial Conference on Innovative Data Systems Research (CIDR'11).
  • Manos Athanassoulis, Anastasia Ailamaki, Shimin Chen, Phillip B. Gibbons, Radu Stoica. "Flash in a DBMS: Where and How?" (pdf). IEEE Data Engineering Bulletin, Volume 33, Number 4, December 2010 (DEBull'10).
  • Shimin Chen, Anastasia Ailamaki, Manos Athanassoulis, Phillip B. Gibbons, Ryan Johnson, Ippokratis Pandis, Radu Stoica. "TPC-E vs. TPC-C: Characterizing the New TPC-E Benchmark via an I/O Comparison Study" (pdf). SIGMOD Record, Volume 39, Number 3, September 2010. (SIGMOD Record'10).
  • Shimin Chen, Phillip B. Gibbons, Suman Nath. "PR-Join: A Non-Blocking Join Achieving Higher Early Result Rate with Statistical Guarantees."(pdf) In proceedings of the 2010 SIGMOD International Conference on Management of Data (SIGMOD'10).
  • Olatunji Ruwase, Shimin Chen, Phillip B. Gibbons, Michael Kozuch, Todd C. Mowry. "Decoupled Lifeguards: Enabling Path Optimizations for Dynamic Correctness Checking Tools." (pdf) In proceedings of ACM SIGPLAN 2010 Conference on Programming Language Design and Implementation (PLDI'10).
  • Michelle Goodstein, Evangelos Vlachos, Shimin Chen, Phillip Gibbons, Michael Kozuch, Todd Mowry. "Butterfly Analysis: Adapting Dataflow Analysis to Dynamic Parallel Monitorin"(pdf). In proceedings of 15th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS'10).
  • Evangelos Vlachos, Michelle Goodstein, Michael Kozuch, Shimin Chen, Babak Falsafi, Phillip B. Gibbons, Todd C. Mowry. "ParaLog: Enabling and Accelerating Online Parallel Monitoring of Multithreaded Applications "(pdf). In proceedings of 15th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS'10).
  • Shimin Chen. "FlashLogging: Exploiting Flash Devices for Synchronous Logging Performance"(pdf). In proceedings of the 2009 SIGMOD International Conference on Management of Data (SIGMOD'09).
  • Shimin Chen, Michael Kozuch, Theodoros Strigkos, Babak Falsafi, Phillip B. Gibbons, Todd C. Mowry, Vijaya Ramachandran, Olatunji Ruwase, Michael Ryan, Evangelos Vlachos. "Flexible Hardware Acceleration for Instruction-Grain Lifeguards"(pdf on ieee site). IEEE Micro, Jan/Feb 2009 Special Issue: Micro's Top Picks in Computer Architecture Conferences (IEEE Micro Top Picks).
  • Olatunji Ruwase, Phillip B. Gibbons, Todd C. Mowry, Vijaya Ramachandran, Shimin Chen, Michael Kozuch, Michael Ryan. "Parallelizing Dynamic Information Flow Tracking Lifeguards"(pdf). In proceedings of the 20th ACM Symposium on Parallelism in Algorithms and Architectures (SPAA'08).
  • Shimin Chen, Michael Kozuch, Theodoros Strigkos, Babak Falsafi, Phillip B. Gibbons, Todd C. Mowry, Vijaya Ramachandran, Olatunji Ruwase, Michael Ryan, Evangelos Vlachos. "Flexible Hardware Acceleration for Instruction-Grain Program Monitoring"(pdf). In proceedings of the 35th International Symposium on Computer Architecture (ISCA'08) (accepted in IEEE Micro Special Issue "Top Picks from Computer Architecture Conferences").
  • Shimin Chen, Steven W. Schlosser. "Map-Reduce Meets Wider Varieties of Applications." (pdf). Intel Research Pittsburgh Tech Report, IRP-TR-08-05, May, 2008.
  • Guy E. Blelloch, Rezaul A. Chowdhury, Phillip B. Gibbons, Vijaya Ramachandran, Shimin Chen, Michael Kozuch. "Provably Good Multicore Cache Performance for Divide-and-Conquer Algorithms"(pdf). In proceedings of the 2008 ACM-SIAM Symposium on Discrete Algorithms (SODA'08).
  • Shimin Chen, Anastasia Ailamaki, Phillip B. Gibbons, and Todd C. Mowry. "Improving Hash Join Performance through Prefetching"(pdf). ACM Transactions on Database Systems, Volume 32, Number 3, August 2007 (TODS'07). (This is the journal version of the ICDE 2004 paper. It includes a detailed performance study on an Itanium 2 machine, detailed analyses of the prefetching techniques, and discussion on practical issues of implementing the prefetching techniques in production DBMSs.)
  • Shimin Chen, Phillip B. Gibbons, Michael Kozuch, Vasileios Liaskovitis, Anastasia Ailamaki, Guy E. Blelloch, Babak Falsafi, Limor Fix, Nikos Hardavellas, Todd C. Mowry, Chris Wilkerson. "Scheduling Threads for Constructive Cache Sharing on CMPs" (ps, pdf). In proceedings of the 19th ACM Symposium on Parallelism in Algorithms and Architectures (SPAA'07).
  • Shimin Chen, Babak Falsafi, Phillip B. Gibbons, Michael Kozuch, Todd C. Mowry, Radu Teodorescu, Anastasia Ailamaki, Limor Fix, Gregory R. Ganger, Bin Lin, Steven W. Schlosser. "Log-Based Architectures for General-Purpose Monitoring of Deployed Code" (pdf). In proceedings of Workshop on Architectural and System Support for Improving Software Dependability (ASID) held with ASPLOS, 2006.
  • Vasileios Liaskovitis, Shimin Chen, Phillip B. Gibbons, Anastasia Ailamaki, Guy E. Blelloch, Babak Falsafi, Limor Fix, Michael Kozuch, Todd C. Mowry, Chris Wilkerson. "Parallel Depth First vs. Work Stealing Schedulers on CMP Architectures" (pdf). In proceedings of the 18th ACM Symposium on Parallelism in Algorithms and Architectures (SPAA'06), brief announcements track.
  • Shimin Chen. "Redesigning Database Systems in Light of CPU Cache Prefetching" (ps, pdf). Ph.D. Thesis, Carnegie Mellon University, Computer Science Department, December 2005.
  • Shimin Chen, Anastasia Ailamaki, Phillip B. Gibbons, and Todd C. Mowry. "Inspector Joins" (ps, pdf). In proceedings of the 31st VLDB Conference, 2005 (VLDB'05).
  • Shimin Chen, Phillip B. Gibbons, and Suman Nath. "Database-Centric Programming for Wide-Area Sensor Systems" (ps, pdf). In proceedings of the International Conference on Distributed Computing in Sensor Systems, 2005 (DCOSS'05).
  • Shimin Chen, Anastasia Ailamaki, Phillip B. Gibbons, and Todd C. Mowry. "Improving Hash Join Performance through Prefetching"(ps, pdf). In proceedings of the 20th International Conference on Data Engineering, 2004 (ICDE'04). The full paper was published as CMU Technical Report CMU-CS-03-157, October 2003. (Best Paper Award)
  • Roger Barga, Shimin Chen, and David Lomet. "Improving Logging and Recovery Performance in Phoenix/App" (pdf). In proceedings of the 20th International Conference on Data Engineering, 2004 (ICDE'04).
  • Shimin Chen, Phillip B. Gibbons, Todd C. Mowry, and Gary Valentin. "Fractal Prefetching B+-Trees: Optimizing Both Cache and Disk Performance"(ps,pdf). In proceedings of the 2002 SIGMOD International Conference on Management of Data (SIGMOD'02). The full paper was published as CMU Technical Report CMU-CS-02-115, March 2002.
  • Shimin Chen, Phillip B. Gibbons, and Todd C. Mowry. "Improving Index Performance through Prefetching"(ps, pdf). In proceedings of the 2001 SIGMOD International Conference on Management of Data (SIGMOD'01). The full paper was published as CMU Technical Report CMU-CS-00-177, December 2000. (Runner-up for Best Paper Award)