I have about 3 million documents that are pdfs, docs and images. I have build a website and if user search from website interface, I have to serve those hbase stored documents as required.
How can I do it?
Is it good to use hbase for serving web documents (in future these
documents will be further increased) ?
My hadoop version is 1.2.1 and hbase 0.94.
I prefer in this case to have a search server that index this data and the web will integrate with this search server api for example: Solr, is an open source search server.
Hope this helps.
Related
I'm new to Sitecore concepts, and I'm searching almost five days for an answer. I couldn't find what I'm looking for.
I'm trying to access Sitecore analytic data from a web service. I found a web service of Sitecore with using this document. I want to extract data that are about this. I believe this is a public demo site.
I want to access Web Api, then extract data and use it at my own project. Any idea?
When working with Sitecore 9, the API you want to use to connect to analytics data is known as xConnect. A secure connection is required using trusted certificates, so you cannot connect to an existing instance that somebody else setup like the Habitat demo you linked to.
xConnect is an abstraction API that allows you to collect and search all data in the xDB. The architecture fully supports both vertical and horizontal scaling of xConnect services separate from your Sitecore installations.
Resources
You can read more about xConnect here in the official developer documentation: https://doc.sitecore.net/developers/xp/xconnect/
There is an xConnect tutorial available here: https://doc.sitecore.net/developers/xp/getting-started/#tutorials-xconnect
I also have a small tutorial you can use on GitHub to start learning the concepts: https://github.com/jst-cyr/XConnectTutorial
So I'm working on a site that has pretty specific global site search functionality that utilizes GSS which, as many of you already know, is going away in April. I need to crawl the site and send XML over to Cloudsearch, but I'm kind of confused as to how to go about this and I haven't found much material on building a global site search using AWS Cloudsearch after scouring the internet for a couple of days. So far, I'm planning on crawling the site with Apache Nutch, but I would really appreciate some input.
Did you come across our blog? Index the web with AWS CloudSearch Index the web with StormCrawler (revisited). I described how to use Nutch and StormCrawler to index to AWS Cloudsearch.
If you need the search to be hosted, I'd recommend Elasticsearch and Elastic Cloud instead. I found Cloudsearch slow, cumbersome and expensive and also there are more resources for Elasticsearch for StormCrawler and Apache Nutch.
Is it possible to create a product on amazon programmatically using the api? I would ideally like to create a product using the spree commerce CMS and at the end of the day sync products between the CMS and an online amazon store.
On the Amazon side, you can create products using the Amazon Marketplace Web Service, in particular the feeds API (code samples here).
On the Spree side, you can make this happen by either writing an extension, or by using Spree's middleware product (Wombat) and creating a custom integration for it.
Spree is pushing hard for more integrations that have broad appeal, and syncing with Amazon MWS definitely fits the bill, so you may be able to contact them and get some help doing these things (my company is a Spree partner, and we're currently working on an open source integration to a popular CRM tool for another seller).
Can someone please tell me the step that I should take to integrate google docs in .cfm page. It is a multi-user web application with single log-in , one username n password.
Thanks.
Try this free library at: http://cfgoogle.riaforge.org/ , read this post: http://www.coldfusionjedi.com/index.cfm/2007/12/8/Google-Docs-CFC
If you want to do it yourself without using the library above, you will need to learn how to consume RESTful services using the <cfhttp> tag, and construct and parse XML using <cfxml> and xmlParse().
This Yahoo guide introduces how to use CF to consume Yahoo's RESTful services: http://developer.yahoo.com/coldfusion/index.html
Forgive my ignorance, but do people build websites with Microsoft Dynamics CRM?
I have a potential client who says that is the technology they will use for a new web project, for which I would be doing the HTML templating. I want to learn all I can as I am new to this particular system, but I can't seem to find anything related to web building and CRM. Is it more likely the client is using another piece of technology that they are neglecting to tell us about?
Any experience or insight about this process is greatly appreciated!
You can't build website using MSCRM, but you can build ASP.Net website using CRM content easily. The API provided by MSCRM is a set of web services, that can be queried in a website to populate pages content.
Also, you can directly query the Database if you want ReadOnly content, and only for read-only content, because it is not supported to update/create data using SQL connections.
Note that it's required to have the external connector license to use any data in the CRM, thru the API and even thru direct SQL queries.
I have built a website using .NET, utilising the CRM4 API and database. All the website did was to submit queries to CRM using the CRM web services and metadata web services, and get data back. This company was already using CRM for their back end, so it's much easier for the front end to use the same database as well since it's a simple one.
The alternative would be to create an intermediate connector between the 2 systems -- backend and front end, which is not as efficient (for our case anyway).