Just in:
Building Green Floors: ADNEC Group, Terrax Partner on Sustainable Event Flooring // Abu Dhabi Gears Up for Domestic Tire Production with Multi-Billion Dirham Investment // Maximizing Mileage: Tips for Prolonging Your Vehicle’s Lifespan // UAE, Iraq Discuss Strengthening Ties // Crypto Wallet Urges iPhone Users to Disable iMessage Over Unpatched Vulnerability // Andertoons by Mark Anderson for Tue, 16 Apr 2024 // Urgent Plea for De-escalation in the Region Issued by the UAE // CUHK Tops QS World University Rankings, Solidifying Its Global Research Leadership: Secures Top Positions in Hong Kong with 8 Subjects and 19* Subjects Among Top 50 // Emirati Women Leaders Gather to Celebrate Eid al-Fitr with Fatima bint Mubarak // Strengthening Ties: UAE and Malaysia Forge Path for Broader Cooperation // Abu Dhabi Police on High Alert for Potential Low-Pressure Weather System // Tall & Active: Finding the Perfect Activewear Fit // Microsoft, UAE AI firm to invest $1.5 billion // Digital Gathering Spaces: Crafting Vibrant Community Websites // Aramco Vice President addresses Aramco’s sustainability initiatives at One Earth Summit // A Taste of Morocco Arrives at Dubai’s Global VillageThe aromatic spices and vibrant culture of Morocco have arrived at the Dubai Global Village, as the Moroccan pavilion officially opened its doors to the public. Spanning an impressive space, the pavilion promises to transport visitors to the heart of Morocco, offering a captivating glimpse into the country’s rich heritage, traditional crafts, and delectable cuisine.Stepping into the pavilion is akin to stepping onto the bustling streets of a Moroccan marketplace. The air is filled with the enticing aroma of fragrant tagines and freshly brewed mint tea, whetting the appetites of visitors. Colorful displays of intricately woven textiles, handcrafted pottery adorned with Berber designs, and gleaming brass lamps line the stalls, each piece a testament to the meticulous skill of Moroccan artisans.Visitors can embark on a sensory journey through Morocco, exploring the vibrant culture and traditions of the North African nation. Those seeking a retail adventure can browse through a curated selection of Moroccan goods, including hand-woven rugs, leather goods, and babouche slippers, all reflecting the country’s unique blend of Arabic, Berber, and European influences.Beyond shopping, the pavilion offers a chance to immerse oneself in Moroccan culture. Live music performances featuring traditional instruments like the oud and the darbuka fill the air, transporting visitors to a vibrant Marrakech marketplace. Artisans showcase their skills, demonstrating the age-old techniques of carpet weaving, pottery making, and metalwork, offering a glimpse into the heart of Moroccan craftsmanship.For those seeking a culinary adventure, the pavilion boasts a variety of restaurants serving up authentic Moroccan delicacies. Visitors can savor the fragrant flavors of tagines, simmered meats and vegetables in a conical clay pot, or sample the fluffy sweetness of baghrir, a type of semolina pancake drizzled with honey and argan oil. No Moroccan experience is complete without a steaming cup of mint tea, traditionally poured from a height to create a foamy head.The Moroccan pavilion at the Dubai Global Village is more than just a marketplace; it’s a portal to a captivating culture. Whether you’re tertarik (attracted) to the intricate craftsmanship, enticed by the flavorful cuisine, or captivated by the lively music, the pavilion offers a chance to experience the magic of Morocco firsthand. // World Trade Charts New Course After Three Decades // FAB Makes Record-Breaking Profit in 4th Quarter // Microsoft Pours $1.5 Billion into UAE AI Leader G42 // UK Poised for Crypto Regulations by July //
HomeTAP ResearchArtificial data give the same results as real data—without compromising privacy

Artificial data give the same results as real data—without compromising privacy

1488804798 artificialda

Credit: Massachusetts Institute of Technology

Although data scientists can gain great insights from large data sets—and can ultimately use these insights to tackle major challenges—accomplishing this is much easier said than done. Many such efforts are stymied from the outset, as privacy concerns make it difficult for scientists to access the data they would like to work with.


In a paper presented at the IEEE International Conference on Data Science and Advanced Analytics, members of the Data to AI Lab at the MIT Laboratory for Information and Decision Systems (LIDS) Kalyan Veeramachaneni, principal research scientist in LIDS and the Institute for Data, Systems, and Society (IDSS) and co-authors Neha Patki and Roy Wedge describe a machine learning system that automatically creates synthetic —with the goal of enabling efforts that, due to a lack of access to real data, may have otherwise not left the ground. While the use of authentic data can cause significant privacy concerns, this synthetic data is completely different from that produced by real users—but can still be used to develop and test data science algorithms and models.

ADVERTISEMENT

“Once we model an entire database, we can sample and recreate a synthetic version of the data that very much looks like the original database, statistically speaking,” says Veeramachaneni. “If the original database has some missing values and some noise in it, we also embed that noise in the synthetic version… In a way, we are using machine learning to enable machine learning.”

The paper describes the Synthetic Data Vault (SDV), a system that builds machine learning models out of real databases in order to create artificial, or synthetic, data. The algorithm, called “recursive conditional parameter aggregation,” exploits the hierarchical organization of data common to all databases. For example, it can take a customer-transactions table and form a multivariate model for each customer based on his or her transactions.

ADVERTISEMENT

This model captures correlations between multiple fields within those transactions—for example, the purchase amount and type, the time at which the transaction took place, and so on. After the algorithm has modeled and assembled parameters for each customer, it can then form a multivariate model of the these parameters themselves, and recursively model the entire database. Once a model is learned, it can synthesize an entire database, filled with artificial data.

Outcome and impact

After building the SDV, the team used it to generate synthetic data for five different publicly available datasets. They then hired 39 freelance data scientists, working in four groups, to develop predictive models as part of a crowd-sourced experiment. The question they wanted to answer was: “Is there any difference between the work of data scientists given synthesized data, and those with access to real data?” To test this, one group was given the original data sets, while the other three were given the synthetic versions. Each group used their data to solve a predictive modeling problem, eventually conducting 15 tests across 5 datasets. In the end, when their solutions were compared, those generated by the group using real data and those generated by the groups using synthetic data displayed no significant performance difference in 11 out of the 15 tests (70 percent of the time).

These results suggest that synthetic data can successfully replace real data in software writing and testing—meaning that data scientists can use it to overcome a massive barrier to entry. “Using synthetic data gets rid of the ‘privacy bottleneck’—so work can get started,” says Veeramachaneni.

This has implications for data science across a spectrum of industries. Besides enabling work to begin, synthetic data will allow data scientists to continue ongoing work without involving real, potentially sensitive data.

“Companies can now take their data warehouses or databases and create synthetic versions of them,” says Veeramachaneni. “So they can circumvent the problems currently faced by companies like Uber, and enable their data scientists to continue to design and test approaches without breaching the privacy of the real people—including their friends and family—who are using their services.”

In addition, the model from Veeramachaneni and his team can be easily scaled to create very small or very large synthetic data sets, facilitating rapid development cycles or stress tests for big data systems. Artificial data is also a valuable tool for educating students—although real data is often too sensitive for them to work with, synthetic data can be effectively used in its place. This innovation can allow the next generation of data scientists to enjoy all the benefits of big data, without any of the liabilities.


Explore further:
Combatting retail fraud using a simulator

More information:
“The Synthetic data vault”, dai.lids.mit.edu/SDV.pdf

Source link

ADVERTISEMENT

ADVERTISEMENT
Just in:
Hinen to Showcase Innovative Energy Solutions at Solar & Storage Live Australia 2024 // Emirati Women Leaders Gather to Celebrate Eid al-Fitr with Fatima bint Mubarak // HeeSay Launched ‘LivelyLaugh’ Campaign to Celebrate Songkran 2024, driving New Interactive Trends among LGBTQ+ People // Aramco Vice President addresses Aramco’s sustainability initiatives at One Earth Summit // DFS CIRCLE Celebrates First Anniversary: Journey to ‘Collect the World’ with Exclusive Gifts designed by the trending illustrator, matsui, and Destination-unique Collectibles! // Maximizing Mileage: Tips for Prolonging Your Vehicle’s Lifespan // FAB Makes Record-Breaking Profit in 4th Quarter // Ad Blockers Gain New Purpose in Fight Against Government Spyware // Microsoft, UAE AI firm to invest $1.5 billion // Ramdev, aide in Supreme Court today // Renowned Dutch Microbiologist and Expert in Water Quality and Health Named Lee Kuan Yew Water Prize 2024 Laureate // World Trade Charts New Course After Three Decades // United Terra Enterprises PLC proposed work plan for Visoka approved by regulatory body (AKBN) and state-run Albpetrol. // UK Poised for Crypto Regulations by July // Crypto Wallet Urges iPhone Users to Disable iMessage Over Unpatched Vulnerability // Abu Dhabi Gears Up for Domestic Tire Production with Multi-Billion Dirham Investment // Tall & Active: Finding the Perfect Activewear Fit // Microsoft Pours $1.5 Billion into UAE AI Leader G42 // UAE, Iraq Discuss Strengthening Ties // With record scale, China’s consumer products expo shares opportunities and market with world //