médialab Tools

Tools we develop, and tools we use



Hyphe is a web corpus curation tool featuring a research-driven web crawler

I Want Hue

Colors for data scientists. Generate and refine palettes of optimally distinct colors.


SeeAlsology is a simple tool to explore in a quick & dirty way the semantic area related to Wikipedia pages by producing a graph of links in the "See also" sections.


Explore networks and publish narratives. More and more people work with graphs nowadays, but it is not always easy to publish and share the graph interpretation on the web. Manylines is a web tool to solve this issue. It allows the user to display, spatialize and categorize a network, before explaining it and sharing a narrative about it: an interactive story where each “slide” is a particular zoom, pan and filtering of the network, completed by a title and description, with fluid transitions like in Prezi.


Helpers for scientometrics. Convert files, get networks, visualize stuff from Scopus or Web of Science.

Table 2 Net

Extract a network from a table. Set a column for nodes and a column for edges. It deals with multiple items per cell.


Easily create rich qualitative and quantitative maps from CSV files against different geographical projections.


A tweet curation tool for humanities. Takes output from Gazouilloire or TCAT and lets one easily browse all tweets to select them qualitatively.

Hyphe Browser

A full browser connected to a Hyphe server to constitute, navigate and categorize the webpages of your corpus in situ the Navicrawler way.

experimental Source > Try it !

Google Bookmarklets

Easily change a Google Search into its 100 results version and extract a CSV of results across multiple pages within your browser using simple bookmarklets.

Advanced tools for developers


Sigma is a JavaScript library dedicated to graph drawing. It makes easy to publish networks on Web pages, and allows developers to integrate network exploration in rich Web applications.


artoo.js - the client-side scraping companion.


Take advantage of artoo.js scraping features in a node.js serverside crawling framework optionally using PhantomJS.

experimental Source > Try it !

CSV Rinse Repeat

You have a CSV and you need to transform it into a better CSV, which includes filtering, clustering, parsing, merging, (...) and matching regular expressions. Why not do it in JS? You're welcome.


Gazouilloire is a simple backend tool to collect within a Mongo database all tweets matching a list of keywords, optionnally with a planning of periods of collect for each keyword. For better completion, it collects tweets from both Twitter's Search API and Stream API.


A python library to read or write gexf format, the xml file format to store graph.

Other uncontinued projects


Visualize densities on spatialized networks. Get a global heatmap or a heatmap of the neighborhood of a given node.

ANTA, actor-network text analyzer

ANTA or Actor Network Text Analyzer is a piece of software developed by the Sciences Po médialab to analyses medium-size text corpora, by extracting the expressions they contained in a set of texts and drawing a network of the occurrence of such expressions in the texts.


A simple interface to extract texts from (almost) any url.


Publish a simple website from a _public_ google drive folder without coding in HTML

issue 2 navicrawler

A python library to transform a issuecrawler xml file into a navicrawler wxsf (xml) file format.

Tools from others we use & like

Gephi is an interactive visualization and exploration platform for all kinds of networks and complex systems, dynamic and hierarchical graphs. Runs on Windows, Linux and Mac OS X. Gephi is open-source and free.

by Mathieu Jacomy, Matthieu Bastian, Sébastien Heymann Source > Go !

Issuecrawler, the Lippmanian Device, TCAT for twitter collection, NetVizz for Facebook, Google/Amazon/Wikipedia/... scrapers: plenty of useful digital tools for sociologists built by the Digital Methods Initiative at the university of Amsterdam

by the Digital Methods Initiative (Erik Borra, Bernhard Rieder, ...) > Go !


Developed by the Density Design Research Lab at Politecnico di Milano, Raw is a web interface to help easily create custom vector-based visualizations on top of the amazing d3.js library by Mike Bostock.

by Density Design (Giorgio Caviglia, Michele Mauri) Source > Go !

Palladio is a product data-driven tool to build networks in History and analyze relationships across time though a graphical interfaces based on humanistic inquiry.

by Stanford HDLab (Nicole Coleman, Ethan Jewett, Giorgio Caviglia) Source Documentation > Go !

The CorTexT initiative of IFRIS proposes tools to explore and analyse semantics dynamics in corpus of texts.

by IFRIS (Jean-Philippe Cointet, Philippe Breucker, Guillaume Orsal) Source Documentation > Go !

uMap let you create maps with OpenStreetMap layers in a minute and embed them in your site.

by Yohan Boniface Source Documentation > Go !

Manet is an easily installable REST API server which allows capturing screenshots of websites using various parameters.

by Vladislav Bauer Source > Go !

Open Refine from Google is always useful when dealing with dirty data.

by Google Source