Show Notes(01:58) Dzejla described her undergraduate experience studying Computer Science at the Sarajevo School of Science and Technology back in the mid-2000s.(07:59) Dzejla recapped her overall experience getting a Ph.D. in Computer Science at Stony Brook University.(14:38) Dzejla unpacked the key research problem in her Ph.D. thesis titled “Upper and Lower Bounds on Sorting and Searching in External Memory.”(19:13) Dzejla went over the details of her paper “Don’t Thrash: How to Cache Your Hash on Flash,” — which describes the Cascade Filter, an approximate-membership-query data structure that scales beyond main memory, that is an alternative to the well-known Bloom-filter data structure.(24:41) Dzejla elaborated on her work “The batched predecessor problem in external memory,” — which studies the lower bounds in three external memory models: the I/O comparison model, the I/O pointer-machine model, and the index-ability model.(29:56) Dzejla shared her learnings from being a Teaching Assistant for the Introduction to Algorithms course at Stony Brook (both at the undergraduate and graduate level).(35:08) Dzejla went over her summer internships at Microsoft’s Server and Tools Division during her Ph.D.(41:06) Dzejla reasoned about her decision to return to Sarajevo School of Science and Technology as an Assistant Professor of Computer Science.(47:22) Dzejla dissected the essential concepts and methods covered in her Data Structures, Introductory Algorithms, Advanced Algorithms, and Algorithms for Big Data courses taught at SSIT.(48:42) Dzejla provided a brief overview of the Computer Science/Software Engineering department at the International University of Sarajevo (where she has been a professor since 2017.(50:57) Dzejla briefly talked about the courses that she taught at IUS, including Intro to Programming, Human-Computer Interaction, and Algorithms/Data Structures.(52:49) Dzejla shared the challenges of writing Algorithms and Data Structures for Massive Datasets, which introduces data processing and analytics techniques specifically designed for large distributed datasets.(56:14) Dzejla explained concepts in Part 1 of the book — including Hash Tables, Approximate Membership, Bloom Filters, Frequency/Cardinality Estimation, Count-Min Sketch, and Hyperloglog.(58:38) Dzejla provided a brief overview of techniques to handle streaming data in Part 2 of the book.(01:00:14) Dzejla mentioned the data structures for large databases and external-memory algorithms in Part 3 of the book.(01:02:15) Dzejla shared her thoughts about the tech community in Sarajevo.(01:04:16) Closing segment.Dzejla’s Contact InfoLinkedInTwitterGoogle ScholarMentioned Content

Papers

“Upper and Lower Bounds on Sorting and Searching in External Memory” (Dzejla’s Ph.D. Thesis, 2014)“Don’t Thrash: How to Cache Your Hash on Flash” (2012)“The batched predecessor problem in external memory” (2014)

People

Erik Demaine (Computer Science Professor at MIT)Michael Bender (Computer Science Professor at Stony Brook, Dzejla’s Ph.D. Advisor)Joseph Mitchell (Computational Geometry Professor at Stony Brook)Steven Skiena (Computer Science Professor at Stony Brook)Jeff Erickson (Computer Science Professor at UIUC)

Books

“Algorithms and Data Structures for Massive Datasets” (by Dzejla Medjedovic, Emin Tahirovic, and Ines Dedovic)“The Algorithm Design Manual” (by Steven Skiena)

Here is a permanent 40% discount code (good for all Manning products in all formats) for Datacast listeners: poddcast19. Link at http://mng.bz/4MAR.

Here is one free eBook code good for a copy of Algorithms and Data Structures for Massive Datasets for a lucky listener: algdcsr-7135. Link at http://mng.bz/Q2y6

Twitter Mentions