- 'These developments reveal a diversity of perspectives, interests, and expectations regarding Big Data' (Ekbia, et al., 2014:1524)
-
Authors identify recurring themes and problems
- disciplined-specific
- issue-oriented
- methodologically-informed
- Authors' synthesization frames these all as 'dilemmas'
-
Discussion of phenomenon vs. appearance, and the gap between them
-
19th Century – save the phenomenon, produce the appearance, via causal mechanisms
-
20th Century – save the appearance, let go of causal explanations
- 21st Century – predict the appearance, via Big Data
- Where phenomenon becomes appearance – and raises the question of whether data, itself, is enough?
- 'Big Data methodologies ... collapse the distinction between phenomena and appearances altogether, presenting us with structuralism run amok' (2014:1530)
-
Big Data processing
- mechanical
-
human judgment
- human labor and expertise
- 'human-machine symbiosis' (2014:1535)
-
quantitative vs. qualitative
- 'traditional tension ... is rendered obsolete' with Big Data (2014:1531)
- '[d]ata need to be imagined and enunciated to exist as data, and such imagination happens in the particulars of disciplinary orientations, methodologies, and evolving practices' (2014:1531)
-
issues encountered with Big Data analysis
- 'cleaning'
- sharing/transparency
-
privacy
- reidentification
-
'what counts and what doesn't'
- sampling
- selection bias
-
relates to scale and scope of the data being analyzed
-
historical dillemas remain the same
- relevance
- validity
- generalizability
- replicability
-
open data or not
- intellectual property
- ethics
- creator contribution or exploitation
- creator compliance or resistance
- 'Big data is dark data ... [need to] devise techniques that bring human judgment and technological prowess to bear in a meaningfully balanced manner' (2014:1539)
-
political economy specific issues
- wealth creation: winners vs. losers
- 'filter bubbles' and recommender systems (2014:1538)
- Big Data doesn't relieve a researcher of responsibility for his/her work
-
Data visualization
- accuracy vs. aesthetic appeal
- may be weighted toward aesthetic appeal
- are tradeoffs noted to end users?
- see figure 2 (2014:1533)
- '[t]he complexities associated with understanding Big Data and the visual capabilities of the digital medium have pushed this kind of practice to a whole new level, heavily tilting the balance between truth and beauty toward the latter' (2014:1533)
- Digital divide: haves vs. have nots
- 'Real or near-real-time information processing – that is, delivery of information and analysis results as they occur – is the goal and a defining characteristic of Big Data analytics' (2014:1534)
-
Big Data attributes
- autonomy
- opacity
- generativity
- disparity
- futurity
- 'The duality, futurity, and disparity of Big Data, along with its various conceptualizations among practitioners, make it unlikely for a consensus view to emerge in terms of dealing with the dilemmas introduced here' (2014:1540)
- Will require a 'concerted effort between academics and policy makers, in rich conversation with the public' (2014:1541)
-
Academic interest in Big Data
- Increasing publications on the topic
-
Non-academic interest in Big Data
-
Big Data offers opportunities for
- commerce
- innovation
- social engineering
- Users include business, industry, and government
-
General public interest in Big Data
- Fostered by 'the blurring of boundaries between data produced by humans and data about humans' (Ekbia, et al., 2014:1524, c.f. Shilton, 2012)
-
Definition remains undefined, with perspective dependent upon the definer/purpose
-
product-oriented
-
focus is quantitative and on the size of data
- volume
- velocity
- variety
- value
- veracity
- This perspective centers on the 'attributes of the data themselves' (2014:1526)
-
process-oriented
-
relating to collection, curation, use of data
- storage
- management
- aggregation
- searching
- analysis
- This perspective centers on the 'processing and storing abilities of technologies' (2014:1526)
-
cognition-oriented
-
concerns the way humans relate to data
- human limits when faced with Big Data
- need for mediation, including 'trans-disciplinary work, technological infrastructures, statistical analyses, and visualization techniques' (2014:1527)
- This perspective centers on what is needed for humans to make sense of the data
-
social movement - added by the authors
- socioeconomic, cultural, and political shifts
- diffusion of technological innovation
- complex ecosystem
- strategic partnerships
- This perspective accounts for gaps 'between these visions and the realities on the ground' (2014:1528)