Showing posts with label adMarketplace. Show all posts
Showing posts with label adMarketplace. Show all posts

Tuesday, February 03, 2015

HP Vertica Enables Rapid Matching of Consumer Inferences to Ads at Huge Scale for adMarketplace

Transcript of a BriefingsDirect podcast on how big data and data analytics combine to instantly match search users with ads appropriate to their interests.

Listen to the podcast. Find it on iTunes. Download the transcript. Sponsor: HP.

Dana Gardner: Hello, and welcome to the next edition of the HP Discover Podcast Series. I'm Dana Gardner, Principal Analyst at Interarbor Solutions, your host and moderator for this ongoing sponsored discussion on IT innovation and how it’s making an impact on people’s lives.

Once again, we're focusing on how companies are adapting to the new style of IT to improve IT performance and deliver better user experiences, as well as better business results.
Become a member of myVertica today
and gain access to the
FREE HP Vertica Community Edition
This time, we're coming to you directly from the recent HP Big Data 2014 Conference in Boston. We're here to learn directly from IT and business leaders alike how big data, cloud, and converged infrastructure implementations are supporting their goals.

Gardner
Our next innovation case study interview highlights how adMarketplace is using big data to improve its search advertising capabilities for its customers. [See part one of this two-part series.] We'll learn more about search for advertising and how big data plays a role in that in our discussion with our guest, Raj Yakkali, Director of Data Infrastructure at adMarketplace in New York. Welcome, Raj.

Raj Yakkali: Hello, Dana. Very nice to be here, and thank you.

Gardner: Good to have you here. Tell us a little bit about adMarketplace. What do you do, and why is big data such a big part of that?

Yakkali: adMarketplace is a leading advertising platform for search intent. We provide the advertisers with the consumer space where they can project their ads. The benefit of the adMarketplace comes into play where we provide a data platform that can match those ads with the right user intent.

Yakkali
When user searches for a certain keyword, they're directly telling us what they want to see, and we match it perfectly well with our ads. The relationship that we have with our advertisers is that we match them well and make it accessible in exactly what the user is thinking. We do some predictive analytics on top of what the user is saying. We add that dimension to our user search and provide ads aptly.

Gardner: I'm all for getting better ads based on lot of things I already get. Do you have more than just keywords in terms of how you can draw inference, and what sort of scale of data are we talking about when it comes to all that inference information about an intent on behalf of the consumer?

15 dimensions

Yakkali: Keyword search is one side or one dimension of the user search. There are also category campaigns that the advertisers are running. At the same time, there's a geospatial analysis to it as well. There are 15 dimensions that we go through to provide an ad that is perfectly fit for the advertiser and for the consumer to see and take advantage of to meet their needs. With some of the ads, we are trying to serve the user’s requirements and needs.

http://bit.ly/1sWpHmCGardner: With all these variables, this sounds like you're going to be gathering an awful lot of information. You also need to reply back with your results very fast or you lose the opportunity for that consumer to get the ad and then even click through and make a decision. Tell me about scale and speed.

Yakkali: You're right on with that question. In this business, latency is your enemy. If you look into the certain metrics, there are almost a half a billion requests that we're receiving every day and we have to match all of those ads with a sub-second performance. We have internal proprietary datasets, which we take care of before matching these ads. And there are two platforms that we've built internally.

One is called Bid Smart. That performs the analysis between the user intent and the traffic sources that the user search is coming from. At the same time, the price of that ad goes to the publisher. There are the pricing strategies, the traffic sources, and the user intent of the search. All of these things are put together. That predictive analytics system gathers all this information and emits the right ad towards the consumer.
With the partnership with Vertica, we’re able to take the dataset, derive analytics about it, and provide our marketers with all that information.

On top of it, if you look into the amount of data, those half a billion requests that are coming into our system, it generates around two terabytes per hour. At certain times, we can't store all of it for analytics. There is a lot of data that's not inside the database. Now, with the partnership with Vertica, we’re able to take the dataset, derive analytics about it, and provide our marketers with all that information. Bid Smart is the one that does the pricing and matching.

The other thing is Advertiser 3D, which provides that detailed analytics into all these dimensions on the metrics. That provides a very good insight. Now, when it comes to the competition or the opportunity to deliver the right ad at the right time, that's where data work flows make a difference.

We utilize Vertica to directly stream all this click data into it, rather than going into certain other locations and then doing it in a batch format. We directly live-stream that data into Vertica, so that it is readily available for analytics. Our Bid Smart System makes use of that dataset. That's where we get the opportunity to deliver much better ads, with price tags, and the right user intent matched.

Gardner: It sounds very complex. There's an awful lot going on for just serving up an ad. I suppose people don’t appreciate that, but the economics here are very compelling, the more refined and appropriate an ad can be, the more likely the consumer is to buy, but there are a lot of resources that don't get wasted in the meantime. Do you have any sense of what the payoff is, either in business, financial, or technical terms for when you can really accomplish this goal of targeted advertising?

Conversion rate

Yakkali: So our conversion rate is a major key performance indicator (KPI) when it comes to understanding how well we are doing. When we see higher conversion rates, that gives us the sense that we've done the best job and user is happy with what they are searching and what they are getting.

At the at the same time, the publishers, as well as the advertisers, are happy, because the user is coming to us again and again to get that similar, beautiful experience. The advertisers are able to sell more products that meet the needs of the user. And the users are able to get the product that really caters to their needs. We're in the middle of all these things, trying provide the facilitation to the advertisers, as well as the users and the publishers' space.

Gardner: I daresay this is the very future of advertising. Now for you to accomplish these goals and create those positive KPIs, are you housing Vertica in your own data center, do you use cloud, hybrid cloud? Given that you have different platforms, different datasets, how do you manage this technically?

Yakkali: On that end, we started with testing cloud two or three years ago, but again, it turned out that because of so many unknowns and troubleshooting, we had to go with our data centers. Now, we host all our systems in our own data centers and we manage it.

We have our own hardware to deal with. Our system is a 24/7, and we have to be able to deliver the sub-second latency performance. Having your own infrastructure, you have the controlled environment where you can tweak and tune your system to get the best performance out of it.
Become a member of myVertica today
and gain access to the
FREE HP Vertica Community Edition
Considering that it is a 24/7, there are fewer excuses that you can get away with in not delivering it. For that, we do innovation in terms the data flows and the process of how we ingest the data, how we process the data, how we emit the data, and how we clean up the data when we don’t really need it.

All these things have to come together, and it really helps us having that control on all of our infrastructure and all elements in the data pipeline, starting from the user intent and user search, until we provide the data and the results.

Gardner: How long have you been using Vertica in this regard and how did you go about making that decision?

Yakkali: We've been using Vertica for four to five years and our data pipeline was not on Vertica to start with, but as Vertica came into the picture and we saw the great beauty and the powerful features that it brings to capitalize our ability.

That really helped us. With Vertica in place, we have been migrating our mechanics slowly to use it for the real-time analysis and real-time bidding and all those beautiful features that make us do what we can do better. So it’s been a great partnership with Vertica and we see many more features coming in with the new version. Our Bid Smart mechanism is also improving, and with that, algorithmic capabilities are increasing. So it’s progressing.

Feedback loop

Gardner: Tell us a little bit about where your business is heading. In addition to speed, complexity, and scale, where do you see the ability to create this feedback loop? It’s very rapid feedback loop between a lot of incoming data and an action like streaming up an ad. It seems like this could be applied to either other marketing or advertising chores or perhaps even have an ancillary business-development direction. You’ve got this platform and these data centers. Is there something else that you're gearing up for?

Yakkali: At this point, we're in the business of connecting the advertisers, the publishers, and the users. But that is an untapped business to what it can accomplish. The market has started its pathway towards the level of reaching that epitome. If we take a step back and try to understand it, initially, when search started, there was no Google or anything. It was more about curated search.

So the publishers put out all this content together and then projected it out to the user. They didn't know what user wanted. At the same time, when the user looked at this content, they didn't know whether they want it or whether it catered to their needs.

Then, Google came along and user search started. What that directly told was "I want this piece of information. I want to use this piece of information. And I want to see this ad that is relevant to my needs." That’s a very powerful thing. When you hear that part, you're able to analyze that piece and match it properly with the advertisers. But then again, it started to fragment.
At this point, we're in the business of connecting the advertisers, the publishers, and the users.

Now, it’s not only Google. There is Yahoo, Bing, there is mobile, and there are certain apps. There are many apps in the mobile space and each one has its own search. So not all the searches are going to Google, Yahoo, or Bing. Search is already fragmented.

We tap all those pieces. The market that is beyond Google. Yahoo-Bing is stronger and it is growing. So there is a lot of market that needs to be tapped into. We come into the place connecting the advertisers to tap that untapped marketplace.

We've been improving our internal Bid Smart algorithm that came out in the last year. Then, we also launched Advertiser 3D last year as well. Those two products have been providing tremendous growth in our revenue, and the retention rates have been stellar.

The top 60 percent of Google’s top spenders are working with us to complement their business. At the same time, we're also able to provide 50 percent increase in year-over-year revenues. It's additional revenue for them, and even our revenues are increasing based on that fact.

Gardner: It seems like you have an awful lot of runway ahead of you in terms of where search could be applied, and analytics can be drawn from that to augment these services and explode that market.

Is Vertica being used just for the intercept between the incoming data and the outgoing ad, or you are also analyzing what goes on within these marketplace so that you better appreciate, whether you can offer reports, audit trails, and that sort of thing? Is this an inclusive platform, or do you use different analytics platforms for different aspects of what you are doing?

End to end

Yakkali: We do almost everything. It is an end-to-end platform. As part of the business we look into the operational metrics of the whole thing, starting from the user search until the ad is delivered. Then, from that end, there is always that analytics piece that comes onto play, which provides insights to the marketers.

Our market base is filled with the very data-savvy marketers, and they look into each and every data dimension to understand their return on investment (ROI). We give them transparency through our Advertise 3D System and utilizing that, they're able to navigate through the space and aptly tune their campaigns to get the best out of it and to deliver the best to the customer.

Gardner: Any thoughts about other organizations that are also facing significant challenges around speed, scale, also perhaps with a big runway, in terms of knowing that more and more business could be coming their way therefore more data? What would you advise them in terms of the data architecture or the planning in order to accomplish the goals?

Yakkali: When we look at the industries and the market, the ad industry still is untapped. The healthcare industry is just getting into the business of doing much more with analytics. It’s all about the speed and the latency and the insights as well. One, at the operational level and the other, at the insight level to do more innovation on top of it.
Our market base is filled with the very data-savvy marketers, and they look into each and every data dimension to understand their return on investment (ROI).

The ability to listen to the customer depends on how fast you can capture all that feedback, and you tighten that loop of feedback so that you're able to do something with it and make a better product out of it.

So it’s all about taking a look at the datasets very closely as to what they mean, what the user is asking us, what do they want to see, and how you are listening to the customer. Those two aspects really make the difference.

You want to listen to the customer, what they really want. Are you providing it and are you able to guess what they want for tomorrow for that predictive, and going into prescriptive analytics, phase later on. You're telling them what they need to do even before they tell you.

That's the stage that the market is going towards. We're not even scratching the surface of prescriptive analytics. The wave has not yet started towards that route. We're still at the predictive analytics phase, and there is still a lot more to go within that space. Get the foundation stronger, drive towards prescriptive analytics, and listening to your customer, are the three aspects that would make any industry. Those three would be the key foundational pieces for making innovation.
Become a member of myVertica today
and gain access to the
FREE HP Vertica Community Edition
Gardner: Thanks so much. We've been learning about how adMarketplace is using big data to perform some very complex marketing activities for their advertisers to match intent from a customer with an ad that suits their needs based on ever-growing amount of data and inference. [See part one of this two-part series.] I'd like to thank our guest. We've been joined Raj Yakkali, Director of Data Infrastructure at adMarketplace in New York. Thanks so much.

Yakkali: Thank you very much, Dana. It was a pleasure talking to you.

Gardner: And I'd like to thank our audience as well, for joining us for the special new style of IT discussion, coming to you directly from the recent HP Big Data 2014 Conference in Boston.

I'm Dana Gardner, Principal Analyst at Interarbor Solutions, your host for this ongoing series of HP sponsored discussions. Thanks again for listening, and come back next time.

Listen to the podcast. Find it on iTunes. Download the transcript. Sponsor: HP.

Transcript of a BriefingsDirect podcast on how big data and data analytics combine to instantly match search users with ads appropriate to their interests. Copyright Interarbor Solutions, LLC, 2005-2015. All rights reserved.

You may also be interested in:

Thursday, January 22, 2015

adMarketplace Solves Search Intent Challenge with HP Vertica Big Data Warehouse Solution

Transcript of a BriefingsDirect podcast on how an consumer intent search company is able to handle massive amounts of data and analyze it quickly with HP Vertica.

Listen to the podcast. Find it on iTunesDownload the transcript. Sponsor: HP.

Dana Gardner: Hello, and welcome to the next edition of the HP Discover Podcast Series. I’m Dana Gardner, Principal Analyst at Interarbor Solutions, your host and moderator for this ongoing sponsored discussion on IT innovation and how it’s making an impact on people’s lives.

Gardner
Once again, we're focusing on how companies are adapting to the new style of IT to improve IT performance and deliver better user experiences, as well as better business results.

This time, we're coming to you directly from the recent HP Discover 2014 Conference in Las Vegas. We're here to learn directly from IT and business leaders alike how big datacloud, and converged infrastructure implementations are supporting their goals.

Our next innovation case study interview explores how New York-based adMarketplace, a search syndication advertising network, has met its daunting data-warehouse requirements.
and gain access to the
FREE HP Vertica Community Edition
We'll learn how adMarketplace captures and analyzes massive data to allow for efficient real-time bidding for traffic sources for online advertising. And we'll hear how the data-analysis infrastructure also delivers rapid cost-per-click insights to advertisers.

To learn more about how adMarketplace manages its big-data challenges, please join me in welcoming Michael Yudin, the Chief Technology Officer at adMarketplace.

Michael Yudin: Hello. Thank you, Dana.

Gardner: Tell us first about what adMarketplace does. It sounds very interesting, but I'm not sure I fully understand it.

Yudin: Well, adMarketplace is the leading marketplace for search intent advertising, and let me explain what that means. Search advertising is the best form of advertising ever invented. For the first time, a consumer actually tells a computer what they're interested in. That’s why Google became so successful as a search engine.

Yudin
Some things are changing in the marketplace these days. Consumer search intent is fracturing. You probably wonder what this means. It’s very simple. What this means is Google is no longer the only place you go to search for stuff.

I'll give you an example. Last night, I was looking for a Brazilian steakhouse here in Las Vegas. I didn't go on google.com. I opened my iPhone and I fired up a yellow pages (YP) app and I entered "Brazilian steakhouse" in the search box.

There are a variety of apps in my phone like that for travel, sports, news, and various other things I'm interested in. Anytime I search there, I don’t go to google.com. Consumer search has really fractured and adMarketplace has solved the monetization problem for that.

Providing value

Gardner: So when people are searching in areas other than say Google or Yahoo, how does your organization intercept with that and how does that provide value to both the consumer that’s searching and advertisers that want to provide them information?

Yudin: It benefits both the consumer and the advertiser. In the search world, an ad is really nothing more than a search result in response to user’s query. That’s why it’s so great.

Our clients are the Internet's largest marketers and brands. They use adMarketplace to acquire additional customers in addition to the other marketing channels like Google, where they are pretty much already maxed out.

http://bit.ly/1En8DHKThere are only so many searches that happen in Google and they're declining. So advertisers are looking for new ways to capture consumer intent and to convert this into sales and measurable return on investment (ROI), and that's what we do for them.

Gardner: Of course, a really important thing here is to match properly, and that requires data and analysis -- and it requires speed. Tell us a little about the requirements. How do you do this technically?

Yudin: You just nailed it. This is a very, very big data problem and it has to be solved at scale and fast. And it’s also a 24x7 problem. We can never take our system down. We have a global business, and anytime you go and you search for something as a consumer, you expect to see the result right away.
and gain access to the
FREE HP Vertica Community Edition
Our network handles about half a billion search queries per day and this results in about two terabytes of data per hour constantly generated by our platform, across multiple data centers. We needed a very scalable and robust analytical data warehouse solution that could handle this. Two years ago, we evaluated a number of vendors and settled on HP Vertica, which was best able to satisfy our tough requirements.

Gardner: And are these requirements primarily about the scale and volume, or are we talking also about a need for rapid query, or all the above? Give us a bit more insight into the actual requirements for your network?

Yudin: That's a great question, and I think this is what makes Vertica unique. There are products out there that can store a lot of data, but you can't get this data out of these solutions quickly and at high concurrency. We require a system that can ingest large amounts of data constantly. I am talking about terabytes and terabytes of data. This data has to be queryable right away, with very low latency requirements.

Some of our queries for Advertiser 3D and analytical dashboard are preplanned queries obviously, but they are very big data queries and the service-level agreement (SLA) on these queries is two seconds. Very few products can do that. Some queries are obviously more complex, but we're still talking about seconds and not hours.

Concurrency requirement

On top of this, there's a concurrency requirement and that’s a very big weak spot of a lot of products. Vertica is actually able to provide sufficient concurrency, and it’s never enough.

I do know that there's an upcoming release of Vertica 7, where this is going to be improved even further, but it’s quite acceptable right now. And it has to be fault tolerant, which means that it should be able to sustain a hardware failure on any of its nodes -- and it can do that.

Gardner: Tell us a bit about where you've built Vertica in terms of data centers. Are they your own? Do you have managed service providers? How are you managing your infrastructure that supports Vertica and then therefore your data processes?

Yudin: We own our own infrastructure. So these are not managed services. We actually once used managed services, but we've outgrown them. And Vertica runs on dedicated hardware.
This was driven by business requirements. We didn’t just decide that we needed this

We also have several other Vertica clusters that run on virtualized hardware, and even though it’s dedicated infrastructure, it’s really dedicated at the cloud level now. So call it private cloud. It's a buzzword. It's a mix of dedicated and virtualized. It's elastic scaling.

Gardner: And the transition. You mentioned that two years ago, you were searching for a product. How were you able to bring this on board and what sort of growth have you had as a result -- in terms of data volume, but also in your business, in terms of customers and overall business metrics of growth?

Yudin: This was driven by business requirements. We didn’t just decide that we needed this. So we started to undertake a very, very ambitious project -- Advertiser 3D. If you go to our website, www.admarketplace.com, you can read more about it.

This is a very elegant, simple, and yet powerful, system to match and price traffic across a multitude of traffic sources. To deliver this product, we didn’t have a choice. We had to have a powerful analytical back-end data warehouse. That's when we started to evaluate products and chose Vertica.

Gardner: And have there been any other benefits of going to Vertica in terms of being able to increase the number of features, or have you been able to leverage the technology in new business opportunities in terms of what you can offer your customers, not just to have met the requirements, but perhaps whole new types of benefits?
and gain access to the
FREE HP Vertica Community Edition

Heavy lifting

Yudin: Definitely. Our customers don’t know and don’t even care that we use Vertica on the back end. That’s probably why we won an HP award, because we integrated it into our overall solution very elegantly and seamlessly, but it obviously does a lot of heavy lifting on the back end.

And the project was successful and transformed our business. Our growth rates have accelerated over 50 percent on our core revenue and performance. Data-savvy marketers, and our clients started to see significantly double-digit improvement in ROIs.

Gardner: As Chief Technology Officer there, you've gone through a fairly significant change in your infrastructure and adoption, as you've just described. Looking back, are there any lessons learned that you could offer to others who are also running into a wall with their data infrastructure or looking for alternatives? Any thoughts on how you would advise them to make the transition?
Our growth rates have accelerated over 50 percent on our core revenue and performance.

Yudin: Definitely. The number one advice I would give anybody is don’t believe anything until you do two things: Try it yourself and get references from people who actually use this and whom you trust. That's very important.

Gardner: Well, great. We've been talking about how adMarketplace captures and analyzes massive data to allow for efficient real-time bidding for traffic sources for online advertising.

I would like to thank our guest, Michael Yudin, the Chief Technology Officer at adMarketplace. Thanks so much.

Yudin: Thank you, Dana. My pleasure.

Gardner: And I also want to thank our audience as well for joining us for this special new style of IT discussion coming to you directly from the recent HP Discover 2014 Conference.

I'm Dana Gardner, Principal Analyst at Interarbor Solutions, your host for this ongoing series of HP-sponsored discussions. Thanks again for listening, and come back next time.

Listen to the podcast. Find it on iTunesDownload the transcript. Sponsor: HP.

Transcript of a BriefingsDirect podcast on how an consumer intent search company is able to handle massive amounts of data and analyze it quickly with HP Vertica. Copyright Interarbor Solutions, LLC, 2005-2015. All rights reserved.

You may also be interested in: