Hi, we are Dept - an international digital agency with over 1000 experienced thinkers & makers. One agency uniting creativity, technology, and data. Helping reinvent & accelerate your digital reality by creating experiences that people want and businesses need. www.deptagency.com
Appeared in VentureBeat
In an era of big data and AI, what are the roles of decentralized internet and data storage concepts? The tensions and contradictions of these parallel developments were unpacked at SXSW in a compelling talk, Designing For the Next 30 Years of the Web, by Justin Bingham (CTO of Janeiro Digital) and John Bruce (Co-founder and CEO of Inrupt). They presented a whole new way of storing data and therefore breaking the current privacy paradigm, and their approach merits discussion outside of just one tech conference.
DECENTRALIZING THE WEB
Data is the core of the internet as it’s the exchange of information between both ends. However, as the internet has evolved, the way data is exchanged has shifted significantly from the intentions of one of its creators, Sir Tim Berners-Lee, who had envisioned an internet where information exchange did not include the transfer of actual data to the requesting party. Instead, he believed data would only exist with its owner and the internet would consist of links to it for reading and writing purposes.
That’s why Berners-Lee started the Solid project, which re-introduces his original idea of a decentralized approach to the internet. Personal data is kept by the individual user and not stored centrally with each service supplier.
Built on the company Inrupt, he introduces a more peer-to-peer internet with Personal Online Data Stores (Pods) for everyone. The Solid network is fully conceptualized around these Pods that contain all the data of one person, whether it be your bank account or latest social posts. In this case, the data referring to you is fully owned by you, while otherwise, this data resides with your bank and the social platform itself. Both Inrupt and people within the Solid Community provide Pods that run on their respective servers, but you can also create your own Pod on your own server for ultimate privacy. There is no central owner of these Pods since this would undermine the Solid principal.
ONE SINGLE INTEGRATION
With Solid and Pods, all services, from your favorite taxi company to your insurance company, would communicate through one API with your personal data, each having separate read and write access to different parts of that data whilst reading and writing simultaneously. To cater to this, Inrupt started working with Janeiro Digital to create an open standard that all applications can work with. The beauty of this is that applications only need to learn one standard and integrate with the Pod to provide a data-driven service. Integrations between different services are no longer required.
Imagine writing an application that could combine and show posts from different social networks; one would have to retrieve data from each of them. Instead, if each of these social networks would store their posts in the Pod, this new application could simply be granted access to all posts by its owner, reducing the number of integrations to a single one. Furthermore, if this new application wants to combine posts with other personal data, it could easily grab that information from your Pod. Want to create the new Facebook? No problem. All the historical data is available in your Pod. No need to migrate.
BIG DATA & AI
Although SOLID is known by many as an acronym for development principles it got a whole new meaning here at SXSW: Social linked data. With one centralized integration system, as described above, this makes perfect sense. However, it also raises the question: how would this fit within the world of big data, machine learning, and AI? All of these concepts rely heavily on centralized storage, and Pods are anything but that; especially when Pods are hosted all over the world, with no guarantees on network availability.
So, if data cannot be accumulated and needs to be fetched and interpreted over millions of Pods, how would it be possible to perform any machine learning without a significant performance penalty? And even if the data could be replicated and combined with more data, wouldn’t this then contradict the whole idea of Solid in the first place? And even if that is feasible, though temporarily, wouldn’t people reject data access for the means of data mining and only allow access for the primary purpose of that service?
THE BIG PLAYERS
The above questions apply mostly to the big players, the companies that service a huge chunk of the current centralized internet. These companies rely on possessing our data. The majority of their turnover, which drives their shareholder value, is based on the data they collect from us; data that they will never willingly give up for the purpose of the greater good. As long as these companies interpret privacy as a crucial element of their business, they will not embrace initiatives like Solid, where data owners can decide how their data is used. For most companies, the centralized approach is simply more convenient.
On the other hand, Bruce and Bingham also explained how Pods can introduce new benefits to companies and customers by having instant access to more data. One example is the combination of wearable data with that of an insurance policy, where the step-counter of your smartwatch could instigate a lower premium. Of course this is an interesting view, but somewhat oversimplified, since it’s likely that this would require the user to also consent to other data, including that of purchased food for instance, which could then be used to eventually increase the premium. All in all, it is quite likely that companies will use the Pods to trade consents between data, where certain services will only be made available if another consent is given as well. It is up to you to decide if the benefit is worth the trade-off. But how fragile will this freedom of choice be when it comes to basic services like healthcare?
The beauty of Solid lies with its simplicity, which showcases that it’s not compatible with current, complex website structures and their profit model of collecting data. The internet has become extremely vast and consists of many established platforms. Trying to change that will take an enormous amount of time, development effort, and most of all, goodwill. Having a completely new approach that disqualifies all existing applications out there can only succeed if it can grow to a similar size or bigger. Still, the Solid project is young, and hopefully it will gain a lot of traction. Since the start of Inrupt, it has already seen a lot of attention, so the potential is there.