(I am knowingly sending this to the whole list to spark some conversation.)
The solid state drive (SSD) question is a good one in general. And, specifically, what impact SSD will have on the Neteezas, Teradatas and SQL Server Parallel Data Warehouse editions of the world? Those products
seem premised on spinning disks. Is that correct? Can these products adapt? Is MPP as necessary when everything's SSD and seek times cease to be an issue?
More info on cloumnar db technology would be great. SSD has impact there too -- since solid state drives -are- memory, what's the benefit of loading a db -in- memory?
These are futures questions since the economics of SSD for Big Data still seem prohibitive...but I imagine that will be the case for only so long...
From: [address removed] [[address removed]] On Behalf Of Yuriy Michael Goldman [[address removed]]
Sent: Wednesday, July 21, 2010 1:05 PM
To: [address removed]
Subject: [nybi] August break, upcoming events and your feedback desired --
Hello folks -
No meetup in August - it is too hot.
We will continue our segment on high-performance analytical DBs in September with a
presentation from INGRES. They claim to outperform a small ParAccel or AsterData cluster with a single Server. Clearly this is a very different view of the MPP space and definitely of interest
to us.
Following this event
we will host GotoMetrics (
http://gotometrics.com) - a young startup with a focus on
visual analytics of terabyte data. We know that existing Reporting solutions can sit on top of a database cluster. For example, SpotFire can sit on top of Vertica. JasperSoft can sit on top of Ingres and SAS can send work through AsterData, etc.
What innovation is GotoMetrics offering the marketplace?
I've spoken to the team a few times now and they are a very bright group. In preparation for our session with them I want to pass along a question:
what are your knowledge gaps within the, DataWarehousing, Analytics space?
Barry Zane from ParAccel did an excellent job explaining existing bottlenecks in classical analytics DBs and the motivation for columnar, highly-compressed, and clustered implementations that are clever to balance work load, perform in-memory calculations and
even support non-sql expressions (ie AsterData's MapReduce/SQL language).
Which areas shall GotoMetrics focus on? For example, is there interest in better understanding of Columnar vs Row implementation of DBs? Do you want to learn more about the role of Solidstate storage
in the market? Perhaps a deeper dive on different Databases in the market place along with their strengths/weaknesses? How about an exploration of architectures? For example, Vertica does not have "queen bee"/loader nodes while ParAccel does; in Vertica,
all nodes participate in calculations. What are the architectural characteristics at play? What are some challenges with Visualizations when dealing with multi-tarabyte data?
Please send through your ideas, comments and questions.
Kind regards,
Yuriy Goldman
NYBI Meetup Organizer
--
Please Note: If you hit "
REPLY", your message will be sent to
everyone on this mailing list ([address removed])
This message was sent by Yuriy Michael Goldman ([address removed]) from
The New York Business Intelligence Meetup.
To learn more about Yuriy Michael Goldman, visit his/her
member profile
To unsubscribe or to update your mailing list settings,
click here
Meetup, PO Box 4668 #37895 New York, New York 10163-4668 | [address removed]