How much time do you waste on data chores?

The Data Detektiv works with individuals, companies, and non-profits to provide custom data infrastructure and solutions for integration, migration, analysis, and visualization. We are able to scale to meet the needs of any project working remotely or locally.

Data Rescue in Tokyo

Two weeks ago I attended the 7th Plenary of the Research Data Alliance (RDA) in Tokyo. I enjoy attending RDA Plenaries not only because of the interesting topics but also because of all my wonderful RDA colleagues. I was recently appointed the RDA US Data Share Ambassador, so I work ...

Dataphile

I eat data for breakfast

Keystone Predators and Centrality: Ecosystem as Social Network Part 1

A few weeks ago I announced a project that would train an algorithm to recognize important taxa in an ecosystem using the characteristics of species interactions within that ecosystem. This post documents the first bit of work I’ve ...

Read more
11
Aug

Ecosystem as Social Network

Lately, I’ve been thinking about how interactions between organisms in an ecosystem can be represented as a graph, with nodes and edges, similar to a social network. The nodes represent an organism or group of organisms while the edges ...

Read more
23
Jun

Trickle Down Attribution

Last week I was in Portland, Oregon attending the annual meeting of Force11, a community interested in the future of research communications. There were many great speakers and panel discussions, but what interested me the most was the unveiling ...

Read more
3
May

Data Rescue in Tokyo

Two weeks ago I attended the 7th Plenary of the Research Data Alliance (RDA) in Tokyo. I enjoy attending RDA Plenaries not only because of the interesting topics but also because of all my wonderful RDA colleagues. I was recently appointed the ...

Read more
12
Mar

Citizen Science Data Integration

One of the projects I’m working on now is integrating data about North American butterfly observations collected by about a dozen different citizen science butterfly monitoring programs. The people who collect the data are volunteers who are ...

Read more
13
Feb

Semantic Linking of Phenotypes and Environments

One of the fundamental goals of biology is understanding the interactions of environment and phenotype, but this is a surprisingly difficult topic to study – not because of the concepts, but because of the data. Observations about ...

Read more
17
Dec

Molecules and Metadata

GenBank is a repository for genetic sequences sponsored by the National Institutes of Health. It holds over 100 million different sequences from organisms across the tree of life, from humans to mushrooms to amoebae. Studying genetic sequences ...

Read more
9
Oct

How Fair is Big Data?

“Big Data” and machine learning are used in a wide variety of disciplines, from making credit and insurance decisions to driving medical research, but how accurate is this approach? If algorithms are ground-truthed used a biased ...

Read more
27
Sep

Finding Dark Data

One of my recent clients was working on optimizing a hydrodynamic, particle-tracking model for predicting the fate and transport of oil droplets in the Gulf of Mexico during an accidental marine spill. To do this, the client needed a database of ...

Read more
22
Aug

CONTACT US

Go Top