Yes, follow the steps below.
Sajari is a "custom HTML tag," you can select this when adding a new tag as per below:
Sajari is typically a global install, so you can activate across your entire site as per below. Don't worry about hiding various pages from your site search, you can easily exclude them later using "crawling rules".
You can access your install code from within the Sajari app itself (if logged in). Or you can cut and paste directly from below.
Copy the code on the left, replace <project> and <collection> with your actual company and collection names.</collection></project>
Once you've completed the above steps, your content will begin to index and build statistics around popularity, recency, etc to optimize your search and recommendations.
Our crawler may have encountered issues with your site during initial crawl. If our crawler encounters errors or if you have redirects or canonicals which cause redirect loops, we will abandon crawling.
Notes:
A canonical tag (aka "rel canonical") is a way of telling search engines that a specific URL represents the master copy of a page. This is done by setting the canonical tag in the head section of the page, as below.
Canonicals are used for a variety of reasons, such as choosing the preferred domain, http vs https preference, and consolidation of ranking "juice" for a given piece of content. Good canonicals can also help improve SEO. For more information, read how Google handles canonical tags and why the SEO community considers them important.
Canonicals are very important to the way Sajari works and one of the biggest reasons for crawling failing to index content correctly. They are a very strong signal and we generally won't index a URL if it has a canonical pointing elsewhere; we will instead try to index the canonical URL. The biggest mistakes we see with canonicals are:
You can tell if you have some of these issues using our content debug tool. You should either a) fix these issues or b) remove canonical tags from your pages altogether. Removing all canonicals is much better than setting them incorrectly.
If a few PDF and DOC files are not added to your collection or have the wrong title, here are some steps to take.
Regarding the documents that have wrong title, we take the title from the metadata of the document. If no title is present, then we use the filename instead. You can do the following to update the title:
You can add data-sj-noindex anywhere in a page and it will not be indexed. Most commonly this will be defined in the <head> of an HTML page as follows:</head>
Additionally you can use crawling rules to programmatically exclude sections or certain pages of your web site. You can also set individual pages to not be indexed from the data sources tab of the admin Console.
If your staging environment is not publicly accessible, then you will need to allow our crawler access to it.
There are a number of ways to achieve this if you want:
An operation is any process that communicates with the Sajari engine.
Operations are incurred when:
You can always keep track of your operations in the Usage section of your Console.
Domain names are used in URLs to identify websites.
For example, in the URL https://www.sajari.com/faq/getting-started the domain name is 'sajari.com'.
Sajari only counts domains towards your plan where the root domain is different, e.g., if we see the websites www.acme.com, acme.com and blog.acme.com as one domain but acme.com and acme-blogs.com are two different domains.
Your API key can be found in your console under 'Credentials'.
You can exclude directory from search results by adding exclude rules in your console. If you want to exclude a specific directory then select either the dir1 or dir 2 field, set it to 'Equals', and then enter the directory name that you wish to exclude from search results.
For example, if you have a site called www.acme.com with a section containing legacy publications you wish to exclude at www.acme.com/old, you would create an exclude rule where 'dir1' equals 'old-directory'. This will remove all content from the index found within 'old'.
If the content instead lived in www.acme.com/publication/old then you would set the rule where 'dir2' equals 'old'.
You can setup a variety of rules for crawling your site.
From your Console, you can choose what domains are stored in your collection, and if crawling is active for each domain.
You can also create exclude rules based on URL structure, domain, and a variety of metadata.
Exclude rules will remove all matching records from your collection, and Sajari will not re-crawl any records that match in the future.
Sajari Site Search handles HTTP redirects automatically when a page is reindexed. The redirected pages are removed from the Collection(index) when they have a 301 or 302 HTTP status code.
The destination page of the redirect gets added to the Collection and will be shown in the search results.
Note: Any page that doesn’t have any meta changes detected will be re-crawled after 3-6 days in any case. The redirected pages might still appear in the search results until the next re-crawl takes place.
How to remove redirected pages immediately from a Collection:
You can also manually trigger the crawler to remove the page if you don't want to wait by following these steps:
The page will be removed from the index in a few minutes, and the "State" would change to "Redirect" the next time it is diagnosed. See a screenshot below:
Yes, the Sajari crawler will visit your website periodically to update or remove existing content in your collection. However, if you're have the ping-back code installed on your website, any changes will be applied to your collection instantly when that page is visited for the first time and the ping-back code is triggered.
If a page is in a collection, and then its HTTP status code changes to 404, Sajari will immediately remove this page from your collection as soon as the page is visited and the ping-back code is triggered.
Similarly, when a new page is published and viewed for the first time, this will be added to your collection instantly.
If you've created a search interface from the Sajari Console, you can trigger your own analytics on searches by subscribing to events from the search interface in javascript.
Interfaces are created using setup which is included when generating the interface from the Console.
You can subscribe to events by calling your interface with the "sub" value followed by the pipeline (either pipeline or instantPipeline) and event name, then a callback. It takes the form.
For example, if you are using the default inline interface and want to listen to the search-sent event, you would write:
For information on the events please refer to the documentation on our Github
If you've created a search interface from the Sajari Console, you can trigger your own analytics on searches by subscribing to events from the search interface in javascript.
If you've created a search interface from the Sajari Console, you can trigger your own analytics on searches by subscribing to events from the search interface in javascript.
Interfaces are created using setup which is included when generating the interface from the Console.
You can subscribe to events by calling your interface with the "sub" value followed by the pipeline (either pipeline or instantPipeline) and event name, then a callback. It takes the form:
For example, if you are using the default inline interface and want to listen to the search-sent event, you would write:
For information on the events please refer to Website Search Integration documentation on Github.
When you create a Website Search collection and add a domain, e.g. www.website.com, we only authorize searches from the domain that you have added. If a search request is made from a staging site or development site, e.g. www.staging.website.com or www.dev-website.com, you will get an authorization error i.e. "Authorization for this request failed. Check your credentials".
To test search in staging or development environments, you need add the domain or URL of the staging or development site in the Domains section of your Collection.
Make sure that the "Search from Domain" is enabled. You can also add and authorize IP addresses in the same way.
Yes, you can use your Sajari account with multiple domains, sites, or applications. You can choose to have multiple domains in one collection or have multiple collections having one domain, depending on your use cases.
There are certain limits to the amount of collections you can create on each plan, see more details on our pricing page.
If you're not seeing invited users in the console after the invitation has been sent, here's what to do.
The invites are sent out via email and the link expires after 24 hours. The Team Management section will display users only after they have created an account using the invite link.
If you are wondering whether a user has accepted an invited:
No, Sajari is hosted, i.e. "search as a service" only. You cannot download it, but you can create an account and be up and running in minutes as a service.
We manage a dedicated Kubernetes cluster of machines across multiple availability zones specifically to save you time and resources. Our cluster has many machines and services operating with round-the-clock monitoring.
Installing Sajari for your website search is an easy process. With some basic web development skills you should be able to get up and running in about 5 minutes.
For apps and custom deployments (such as connecting multiple data sources), you can use our SDKs or API directly. This will take a little longer and requires more advanced development knowledge to set up.
Yes. Serverless infrastructure and global load balancing means minimal risk of disaster. Our applications are replicated across Google's data centers globally, are backed up before each release, and can be restored or rolled back in under 5 minutes. Data is stored in triple redundancy block store. The underlying operating system is patched by Google Cloud security and our containers are moved as necessary with "live migration" (zero downtime).
Sajari's data centers are hosted in the USA and Australia. We find little speed degradation communicating transcontinentally but if you require hosting closer to your physical location, don't hesitate to ask.
For Site Search, Sajari provides native support for all the stable versions of the current major browsers: Chrome, Firefox, Safari, Edge, IE11, and IE10.
At Sajari we do not believe ordering results for specific queries is sustainable or desirable.
For any site more than a few hundred pages it is not possible for us as humans to predict the content a user is searching for in exact order, over time. User behaviors change, your content changes, and like everyone else, you're busy. Instead, we provide you with access to tune your results via query rules, in combination with our machine learning model.
While some core features are shared by many search solutions, certain things make Sajari different. When we set out to build Sajari, the intention was to address the problems inherent in current search technologies. In doing so, we think we have made it uniquely capable in some important areas.
Some Sajari specific benefits include:
Schedule a demo to see how Sajari can help your business or it compares against your current search provider.
High quality search drives increased revenue, improved user experience, and general site performance for a variety of reasons.Search that is fast keeps people on sites for longer. According to Google's own research, slow search has a real impact on site and cart abandonment rates.
Search that is relevant delights your users and makes your content more discoverable. It improves your click-through rates, and finds the information or products your users are after, quickly.
Search that is intelligent gets better over time, as if it's reading your users' minds. Our machine learning model improves the ranking of results based on user behavior.
Sajari's search technology is fast, relevant, and intelligent. Our customers typically see CTR improvements of 50%+ after implementing our search technology.
Overage charges (calculated in multiples of 10,000 queries used above 100,000) will be applied in the following month, and for each successive month that your usage exceeds 100,000 queries.
Team members who have 'Admin' or 'Account Owner' roles can delete team members and edit the roles of other team members.
However, the account owner cannot be deleted and the account ownership cannot be transferred to other team members.
You can invite your team members to your Sajari project and give them specific user roles and permissions.
The permissions and visibility for each user role is detailed below:
Sitemap can be helpful in pointing our crawler to webpages that are not internally linked within your website. When you add a new domain, we look for a sitemap at the root (e.g. www.website.com/sitemap.xml) of the domain and index all the links on the sitemap.
Follow these steps to add your sitemap to your collection:
The sitemap will be indexed and it might take a few minutes or a few hours depending on the amount of pages on your website and load in our index queues.
If you click on "See extended debug information", you might see a MIME error on the Page Debug tool. This error can be ignored, and your sitemap and all the links on your sitemap will be indexed.
Yes, Sajari supports indexing of DOC, DOCX, and PDFs for all customers on more advanced plans. See our pricing for details.
Stemming is used by search engines to return relevant results for search queries using a shared word stem even if the query the user has typed differs from the available results. For example, a user types in the query 'dental' but your website only uses the word 'dentist'.
Sajari uses a stemming algorithm to ensure that the user is still returned results for their query as the two terms share the same word stem.
Sajari is built 100% with Go. Go is a great language for speed, concurrency and distributed programming. Check out our Github project for more content on Go.
Yes, we support fuzzy search. Our fuzzy matching algorithm is very fast and can handle any character sequence. Sajari will build your own personalized dictionary from the content that appears on your site, which includes jargon and brand specific terms. More information can be found on our Synonyms page.
If you are currently on a free trial and have provided credit card details, you will be charged for your first billing period (i.e. month or year) on the day after the end of the trial.