Building High-performing Web Rendering of Large Data Sets
2023 (English)Independent thesis Advanced level (professional degree), 20 credits / 30 HE credits
Student thesis
Abstract [en]
Interactive visualization is an essential tool for data analysis. Cloud-based data analysis software must handle growing data sets without relying on powerful end-user hardware. This thesis explores and tests various methods to speed up primarily time series plots of large data sets on the web for the biotechnology research company Sartorius.
To increase rendering speed, I focused on two main approaches: downsampling and hardware acceleration. To find which sampling algorithms suit Sartorius's needs, I implemented multiple alternatives and compared them quantitatively and qualitatively. The results show that downsampling increases or eliminates data set size limits and that test users favored algorithms maintaining local outliers. With hardware acceleration that substantially increased the amount of simultaneously rendered points for more detailed representations, these methods pave the way for efficient visualization of large data sets on the web.
Place, publisher, year, edition, pages
2023. , p. 32
Keywords [en]
Plot, Trajectory plot, Downsampling, Data set, Interactive visualization, Hardware acceleration, Web rendering, WebGL, D3FC, Apache Arrow
National Category
Other Computer and Information Science Computer Sciences
Identifiers
URN: urn:nbn:se:umu:diva-210056OAI: oai:DiVA.org:umu-210056DiVA, id: diva2:1770736
External cooperation
Sartorius Stedim Data Analytics AB
Subject / course
Examensarbete i teknisk fysik
Educational program
Master of Science Programme in Engineering Physics
Presentation
2023-06-09, NAT.D.300, Johan Bures väg 14, Umeå, 08:00 (English)
Supervisors
Examiners
2023-06-212023-06-192023-06-21Bibliographically approved