personal-website/posts/2016/mar20.html

175 lines
11 KiB
HTML

<!DOCTYPE html>
<html lang="en">
<head>
<meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
<meta charset="utf-8">
<meta http-equiv="X-UA-Compatible" content="IE=edge">
<meta name="viewport" content="width=device-width, initial-scale=1">
<meta name="description" content="">
<meta name="author" content="">
<link rel="icon" href="../../favicon.png">
<title>March 20</title>
<!-- Bootstrap core CSS -->
<link href="../../css/bootstrap.min.css" rel="stylesheet">
<!-- IE10 viewport hack for Surface/desktop Windows 8 bug -->
<link href="../../css/ie10-viewport-bug-workaround.css" rel="stylesheet">
<!-- Custom styles for this template -->
<link href="../../css/vis.css" rel="stylesheet">
</head>
<body>
<div class="container">
<!-- Static navbar -->
<nav class="navbar navbar-inverse">
<div class="container-fluid">
<div class="navbar-header">
<button type="button" class="navbar-toggle collapsed" data-toggle="collapse" data-target="#bs-example-navbar-collapse-2">
<span class="sr-only">Toggle navigation</span>
<span class="icon-bar"></span>
<span class="icon-bar"></span>
<span class="icon-bar"></span>
</button>
<a class="navbar-brand" href="../../index.html">VICKY STEEVES</a>
</div>
<div class="collapse navbar-collapse" id="bs-example-navbar-collapse-2">
<ul class="nav navbar-nav">
<li><a href="../../index.html">Home</a></li>
<li><a href="../../resume.html">Resume</a></li>
<li><a href="../../blog.html">Blog</a></li>
</ul>
<ul class="nav navbar-nav navbar-right">
<!--dropdown-->
<li class="dropdown"><a href="#" class="dropdown-toggle" data-toggle="dropdown" role="button" aria-haspopup="true" aria-expanded="false">Data<span class="caret"></span></a>
<ul class="dropdown-menu">
<li><a href="https://osf.io/7mj2q/" >Open Science Framework</a></li>
<li><a href="https://github.com/VickySteeves" >GitHub</a></li>
</ul>
</li><!--end dropdown-->
<!--dropdown-->
<li class="dropdown"><a href="#" class="dropdown-toggle" data-toggle="dropdown" role="button" aria-haspopup="true" aria-expanded="false">Social Media<span class="caret"></span></a>
<ul class="dropdown-menu">
<li><a href="https://twitter.com/VickySteeves" >Twitter</a></li>
<li><a href="https://www.instagram.com/vickysteeves/" >Instagram</a></li>
<li><a href="https://www.linkedin.com/in/victoriaisteeves" >LinkedIn</a></li>
</ul>
</li><!--end dropdown-->
</ul>
</div>
</div>
</nav>
<div class="blog-header">
<h1 class="blog-title">Data, Science, & Librarians, <br /> Oh My!</h1>
<p class="lead blog-description">My thoughts as I navigate the world of data librarianship.</p>
</div>
<div class="row">
<div class="col-sm-8 blog-main">
<div class="blog-post">
<h2 class="blog-post-title">Getting Use Cases is Hard</h2>
<p class="blog-post-meta">March 20, 2016 by <a href="../../resume.html">Vicky Steeves</a></p>
<p>One of my big tasks since coming into NYU last August was to work on the <a href="https://github.com/ViDA-NYU/reprozip" >ReproZip</a> project. My role is largely outreach and education: I was tasked with teaching ReproZip and general reproducibility principles, gathering use cases in a wider variety of disciplines (when I arrived, the use cases were largely in computer science), and supporting users in general.</p>
<p>ReproZip kind of blew my mind when I arrived; it's an open source software tool that simplifies the process of creating reproducible experiments. Basically it tracks operating system calls and creates a package that contains all the binaries, files, and dependencies required to reproduce the experiment. A reviewer can then extract the experiment on their own machine using ANY operating system (even if it's different from the original one!!) to reproduce the results. As a librarian, I was like "OH MY GOD. THE DIGITAL PRESERVATION GAME JUST GOT UPPED." Anyway, here's basically how ReproZip works -- in 2 steps:</p>
<p><u>Step 1: Packing</u></p>
<img src="../../img/packing.png" height=100% width=100%>
<br/>
<hr>
<p><u>Step 2: Unpacking</u></p>
<img src="../../img/unpacking.png" height=100% width=100%>
<br/><br/>
<p><a href="https://remram.fr/" >Rémi Rampin</a>, the current developer of ReproZip, <a href="http://bigdata.poly.edu/~fchirigati/" >Fernando Chirigati</a>, the former developer, and I created this great GitHub repository called <a href="https://github.com/ViDA-NYU/reprozip-examples" >ReproZip Examples</a>, dedicated to showcasing examples and use cases from different domains using ReproZip. We have everything from digital humanities (a history paper used ReproZip!) to archived websites and client-server architecture, to machine learning. It's awesome -- check it out and try to unpack stuff if you want!</p>
<p>I'm really advocating hard for libraries to start at least investigating using ReproZip for their digital collections -- there is so much unused potential for this it's actually crazy, which brings us around to the title. Getting use cases is hard.</p>
<p>In May, Rémi and I will be at the Data and Software and Preservation for Open Science workshop, <a href="https://daspos.crc.nd.edu/index.php/workshops/container-strategies-for-data-software-preservation-that-promote-open-science" >Container Strategies for Data Software Preservation that Promote Open Science</a>. I'm serving as an external organizer, but the two of us will be doing some extensive work with ReproZip while there.</p>
<img src="../../img/daspos.png" height=100% width=100%>
<p class="caption">Image from the <a href="https://daspos.crc.nd.edu/" >DASPOS</a> website.</p>
<p>The DASPOS project, NSF funded, "represents a collective effort to explore the realization of a viable data, software, and computation preservation architecture for High Energy Physics (HEP)." But at this point, it's grown FAR beyond HEP -- the workshop so far is slotted to have representation from a variety of fields and professions (like libraries!!).</p>
<p>In addition to a talk/demo during the conference proceedings, Rémi and I are leading three breakout sessions that will allow people to try out ReproZip for themselves, using their research if they brought some. I'm hoping that, with the new ReproZip-Examples, we can get some people at the DASPOS workshop to add their own .rpz packages for us to try and reproduce! This would be the best-case scenario, but it depends a lot on the research of the participants.</p>
<p>Anyway. I'm really looking forward to learning more about some other containerizing tools like <a href="https://github.com/crcresearch/daspos-umbrella" >Umbrella</a> and meeting some other folks (hopefully a lot of librarians!!) who are involved in the reproducibility and preservation space. The community doing active tool development in this area seems fairly small, so it'll be great for fostering interoperability having us all in a room.</p>
</div><!-- /.blog-post -->
</div><!-- /.blog-main -->
<!--blog sidebar -->
<div class="col-sm-3 col-sm-offset-1 blog-sidebar">
<div class="sidebar-module sidebar-module-inset alert alert-dismissible alert-danger">
<h4>About</h4>
<p>A blog chronicling my career mostly, with some scattered pictures of my cat.</p>
</div>
<div class="sidebar-module alert alert-dismissible alert-success">
<h4>Archives</h4>
<ol class="list-unstyled">
<li><a href="jul30.html">July 2016</a></li>
<li><a href="jun27.html">June 2016</a></li>
<li><a href="may15.html">May 2016</a></li>
<li><a href="apr20.html">April 2016</a></li>
<li><a href="#">March 2016</a></li>
<li><a href="feb16.html">February 2016</a></li>
<li><a href="jan15.html">January 2016</a></li>
<li><a href="../2015/dec16.html">December 2015</a></li>
<li><a href="../2015/nov20.html">November 2015</a></li>
<li><a href="../2015/oct10.html">October 2015</a></li>
<li><a href="../2015/sep21.html">September 2015</a></li>
<li><a href="../2015/aug14.html">August 2015</a></li>
<li><a href="../2015/jun2.html">June 2015</a></li>
<li><a href="../2015/may1.html">May 2015</a></li>
<li><a href="../2015/apr14.html">April 2015</a></li>
<li><a href="../2015/mar24.html">March 2015</a></li>
<li><a href="../2015/feb12.html">February 2015</a></li>
<li><a href="../2015/jan14.html">January 2015</a></li>
<li><a href="../2014/dec18.html">December 2014</a></li>
<li><a href="../2014/nov10.html">November 2014</a></li>
<li><a href="../2014/oct23.html">October 2014</a></li>
</ol>
</div>
<div class="sidebar-module alert alert-dismissible alert-info">
<h4>Elsewhere</h4>
<a class="twitter-timeline" href="https://twitter.com/VickySteeves" data-widget-id="615910359816384512">Tweets by @VickySteeves</a>
<script>!function(d,s,id){var js,fjs=d.getElementsByTagName(s)[0],p=/^http:/.test(d.location)?'http':'https';if(!d.getElementById(id)){js=d.createElement(s);js.id=id;js.src=p+"://platform.twitter.com/widgets.js";fjs.parentNode.insertBefore(js,fjs);}}(document,"script","twitter-wjs");</script>
</div>
</div>
<!-- /.blog-sidebar -->
</div><!-- /.row -->
</div><!-- /.container -->
<footer class="blog-footer">
<p><a href="mailto:victoriaisteeves@gmail.com">Email Vicky</a> | <a href="#">Back to top</a></p>
<p><a rel="license" href="http://creativecommons.org/licenses/by-nc-sa/4.0/"><img alt="Creative Commons License" style="border-width:0" src="https://i.creativecommons.org/l/by-nc-sa/4.0/88x31.png" /></a><br /><span xmlns:dct="http://purl.org/dc/terms/" property="dct:title">Data, Science, & Librarians, Oh My!</span> by <a xmlns:cc="http://creativecommons.org/ns#" href="http://vickysteeves.com/blog.html" property="cc:attributionName" rel="cc:attributionURL">Vicky Steeves</a> is licensed under a <a rel="license" href="http://creativecommons.org/licenses/by-nc-sa/4.0/">Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License</a></p>
</footer>
<!-- Bootstrap core JavaScript
================================================== -->
<!-- Placed at the end of the document so the pages load faster -->
<script src="https://ajax.googleapis.com/ajax/libs/jquery/1.11.3/jquery.min.js"></script>
<script>window.jQuery || document.write('<script src="../../assets/js/vendor/jquery.min.js"><\/script>')</script>
<script src="../../js/bootstrap.min.js"></script>
<!-- IE10 viewport hack for Surface/desktop Windows 8 bug -->
<script src="../../js/ie10-viewport-bug-workaround.js"></script>
</body>
</html>