<?xml version="1.0"?>
<rss version="2.0">
   <channel>
      <title>Anatomy of the Search Engine by CHAITYA SHAH</title>
      <link>https://padlet.com/chaitya_shah/f07iw0ghzxq</link>
      <description>Presentation topic</description>
      <language>en-us</language>
      <pubDate>2018-08-27 10:53:41 UTC</pubDate>
      <lastBuildDate>2026-01-24 21:30:48 UTC</lastBuildDate>
      <webMaster>hello@padlet.com</webMaster>
      <image>
         <url></url>
      </image>
      <item>
         <title>Question : Chaitya Shah, Aditya Pingale </title>
         <author>chaitya_shah</author>
         <link>https://padlet.com/chaitya_shah/f07iw0ghzxq/wish/275492092</link>
         <description><![CDATA[<div>Since the database consists of indexes to large number of content. There are changes in web pages on day to day or on hour basis. What are the criteria and factors considered to update the data which is already stored in the database and how it is done?</div><div><br><br></div>]]></description>
         <enclosure url="" />
         <pubDate>2018-08-27 11:11:55 UTC</pubDate>
         <guid>https://padlet.com/chaitya_shah/f07iw0ghzxq/wish/275492092</guid>
      </item>
      <item>
         <title>Drishti Shah, Nida Shah</title>
         <author></author>
         <link>https://padlet.com/chaitya_shah/f07iw0ghzxq/wish/277134475</link>
         <description><![CDATA[<div>The pages can be crawled and indexed as and when you want them to. Google Index stores the data alphabetically so as soon as you type a keyword, the relevant webpages surface on the SERPs</div>]]></description>
         <enclosure url="" />
         <pubDate>2018-09-02 09:34:25 UTC</pubDate>
         <guid>https://padlet.com/chaitya_shah/f07iw0ghzxq/wish/277134475</guid>
      </item>
      <item>
         <title>Unnati Mistry</title>
         <author>unnati_mistry</author>
         <link>https://padlet.com/chaitya_shah/f07iw0ghzxq/wish/278570949</link>
         <description><![CDATA[<div>For the sites which are already been indexed, search bots/crawlers wait for the update signals from them (ie. Updated contents links). Which means if you have created or updated new content and it is connected to your already indexed page, then it will be signal to bots to index those updated content. Updates can also be notified to bots using <strong>Sitemaps</strong> and <strong>Robot.txt</strong> files.<br><br>Roll no.: 1624017</div>]]></description>
         <enclosure url="" />
         <pubDate>2018-09-06 17:47:30 UTC</pubDate>
         <guid>https://padlet.com/chaitya_shah/f07iw0ghzxq/wish/278570949</guid>
      </item>
      <item>
         <title>Bhakti Kantariya</title>
         <author>bhakti_kantariya</author>
         <link>https://padlet.com/chaitya_shah/f07iw0ghzxq/wish/278767160</link>
         <description><![CDATA[<div>Google indexes website updates based on factors such as popularity of a site, whether the content is crawl-able and the site structure. If you have made any changes to your URL, you can ask Google to re-crawl it. But before that make sure that there are no rendering errors. You can also assist Google in finding your updated content using Sitemaps. The content discovered by bots is then sent back to Google servers, where it is added to database.&nbsp;</div><div><strong>Rollno: 1624002<br></strong><br></div>]]></description>
         <enclosure url="" />
         <pubDate>2018-09-07 09:17:28 UTC</pubDate>
         <guid>https://padlet.com/chaitya_shah/f07iw0ghzxq/wish/278767160</guid>
      </item>
      <item>
         <title>Rachana Gandhi</title>
         <author>rachana_gandhi</author>
         <link>https://padlet.com/chaitya_shah/f07iw0ghzxq/wish/278995859</link>
         <description><![CDATA[<div>Inorder to update your content, you will want the spider to re-crawl you website.This can be done in 2 ways:<br>First one being submitting a <strong>sitemap</strong> and the second one is using the<strong> Fetch</strong> as Google option in Webmaster Tools<br>Though Google "favors" some websites so you might see websites of dogs and cats getting updated faster as they have more views /popularity/content frequency than what you have. It takes more time to remove the data already present then adding new data. It often takes more than a month for content that has gone offline to be removed from Google's index.<br>So the major factors for updation can be: site structure,content frequency,popularity,the quality of your website and also the data to be updated.<br><strong>Roll No: 1624001<br></strong><br></div>]]></description>
         <enclosure url="" />
         <pubDate>2018-09-07 19:04:02 UTC</pubDate>
         <guid>https://padlet.com/chaitya_shah/f07iw0ghzxq/wish/278995859</guid>
      </item>
      <item>
         <title>Shreya Parikh</title>
         <author>shreya_parikh</author>
         <link>https://padlet.com/chaitya_shah/f07iw0ghzxq/wish/279152817</link>
         <description><![CDATA[<div>It is obvious that the content of an active website will change periodically. So it is really important to have control over crawling and indexing. <br>Factors to control crawling and indexing are :<br>1)&nbsp; Avoiding duplicate content<br>2)&nbsp; Consolidate relevancy and authority signals.<br>3) Quality of website<br>4) Popularity of website<br>The attributes we can use to handle these factors are : <br>1) <a href="https://www.contentkingapp.com/academy/control-crawl-indexing/#pagination-attributes">Pagination attributes</a><br>2) <a href="https://www.contentkingapp.com/academy/control-crawl-indexing/#mobile-attribute">Mobile attribute</a><br>3)&nbsp;<a href="https://www.contentkingapp.com/academy/control-crawl-indexing/#robotstxt">Robots.txt</a><br>4)&nbsp;<a href="https://www.contentkingapp.com/academy/control-crawl-indexing/#hreflang-attribute">Hreflang attribute</a><br><br><strong>Roll No : 1514099</strong></div>]]></description>
         <enclosure url="" />
         <pubDate>2018-09-09 12:17:02 UTC</pubDate>
         <guid>https://padlet.com/chaitya_shah/f07iw0ghzxq/wish/279152817</guid>
      </item>
      <item>
         <title>Nida Shah</title>
         <author>nida_shah</author>
         <link>https://padlet.com/chaitya_shah/f07iw0ghzxq/wish/279154754</link>
         <description><![CDATA[<div>When updating content of a web page one thing that should be kept in mind is that the content should be as original as possible . Also, if the page is popular and the URL is SEO friendly , there is a high probability&nbsp; that bots will be driven to the&nbsp; website. Crawlers visit the websites when updated and the hyperlinks are added to the URLs to be visited. This is known as the frontier. The links on the frontier are visited recursively according to the algorithm<br><br><strong>Roll No: 1514112</strong></div>]]></description>
         <enclosure url="" />
         <pubDate>2018-09-09 12:47:20 UTC</pubDate>
         <guid>https://padlet.com/chaitya_shah/f07iw0ghzxq/wish/279154754</guid>
      </item>
      <item>
         <title>Arvind Ganesh</title>
         <author>arvindganesh_a</author>
         <link>https://padlet.com/chaitya_shah/f07iw0ghzxq/wish/279178693</link>
         <description><![CDATA[<div>One can add details in the robots.txt and make sure the content is unique or the content has been give its due credit.<br>We can also tell the crawler to skip the old webpage or the url and focus on the new and upated webapge<br><strong>Roll No: 1514126</strong></div>]]></description>
         <enclosure url="" />
         <pubDate>2018-09-09 17:27:18 UTC</pubDate>
         <guid>https://padlet.com/chaitya_shah/f07iw0ghzxq/wish/279178693</guid>
      </item>
      <item>
         <title>Malvika Parulekar</title>
         <author>malvika_p</author>
         <link>https://padlet.com/chaitya_shah/f07iw0ghzxq/wish/279248271</link>
         <description><![CDATA[<div>For any changes that we make on our sites, having sitemaps helps in faster crawling and indexing by crawlers. Sitemaps are like a map which lists and maps our content to the bots. So, whenever we update our website, it helps in faster crawling to our site. Other options that help are asking your engine to fetch your site after you have updated it or enabling webmaster tools which allow some minimum updates to be fetched fasted (paid). Crawling is also majorly based on factors such as popularity, site meta, content appropriation, etc. So if you have relevant data, chances of your website being crawled faster are greater.<br><strong>Roll No : 1624007</strong></div>]]></description>
         <enclosure url="" />
         <pubDate>2018-09-10 03:24:16 UTC</pubDate>
         <guid>https://padlet.com/chaitya_shah/f07iw0ghzxq/wish/279248271</guid>
      </item>
      <item>
         <title>Mehul Monani</title>
         <author>mehul_monani</author>
         <link>https://padlet.com/chaitya_shah/f07iw0ghzxq/wish/279287305</link>
         <description><![CDATA[<div>So if you are making changes to your content more frequently than you can specify that in sitemap.xml or robots.txt file which indicates how frequently you want your website to be indexed by crawler .<br>Factors that are there in &lt;url&gt; tag are&nbsp;<br>1. Changefreq<br>2. Priority<br>3. Lastmod<br><br>You can also create multiple XML sitemaps.<br><br>Rollno:- 1624006<br><br></div>]]></description>
         <enclosure url="" />
         <pubDate>2018-09-10 07:26:19 UTC</pubDate>
         <guid>https://padlet.com/chaitya_shah/f07iw0ghzxq/wish/279287305</guid>
      </item>
      <item>
         <title>Parth Thakker</title>
         <author>parth_kt</author>
         <link>https://padlet.com/chaitya_shah/f07iw0ghzxq/wish/279291380</link>
         <description><![CDATA[<div>In order to increase the performance of the website, it is advisable to update the website at regular intervals.<br>There are two options. The first one is using the Fetch as Google option in Webmaster Tools. Here are detailed instructions:</div><ol><li>Go to: <a href="https://www.google.com/webmasters/tools/">https://www.google.com/webmasters/tools/</a> and log in</li><li>If you haven't already, add and verify the site with the "Add a Site" button</li><li>Click on the site name for the one you want to manage</li><li>Click Crawl -&gt; Fetch as Google</li><li>Optional: if you want to do a specific page only, type in the URL</li><li>Click Fetch</li><li>Click Submit to Index</li><li>Select either "URL" or "URL and its direct links"</li><li>Click OK and you're done.</li></ol><div><br>Second option is through robots.txt or using sitemap.xml </div>]]></description>
         <enclosure url="" />
         <pubDate>2018-09-10 07:50:53 UTC</pubDate>
         <guid>https://padlet.com/chaitya_shah/f07iw0ghzxq/wish/279291380</guid>
      </item>
      <item>
         <title>Ankit Ramani</title>
         <author></author>
         <link>https://padlet.com/chaitya_shah/f07iw0ghzxq/wish/279293754</link>
         <description><![CDATA[<div>When the content of a page changes,the sitemaps record that change.The fact that content is updated is also reported to the bot through the robots.txt file.Crawlers can access the sitemaps to go through this updated content.Crawling is also majorly based on factors such as popularity,site meta and content appropriation.</div>]]></description>
         <enclosure url="" />
         <pubDate>2018-09-10 08:02:39 UTC</pubDate>
         <guid>https://padlet.com/chaitya_shah/f07iw0ghzxq/wish/279293754</guid>
      </item>
      <item>
         <title>Ashwinikumar,</title>
         <author>viral_vora</author>
         <link>https://padlet.com/chaitya_shah/f07iw0ghzxq/wish/279294946</link>
         <description><![CDATA[<div>The two most important factors regarding updates to the database are frequency and automation.<br><br></div><div>Do you need data to be live and constantly in sync with your other systems, or would daily or even weekly updates to the database be sufficient? Consider that in order to automate the update process, you will typically need a consistent data source, i.e. the field types, and the files supplied each time must be the same. You should consider how often source data is likely to change, if you are ever going to import additional data and if so how your chosen software will deal with this.<br><strong>Roll no:1514122</strong></div>]]></description>
         <enclosure url="" />
         <pubDate>2018-09-10 08:08:08 UTC</pubDate>
         <guid>https://padlet.com/chaitya_shah/f07iw0ghzxq/wish/279294946</guid>
      </item>
      <item>
         <title>Viraj Shah</title>
         <author></author>
         <link>https://padlet.com/chaitya_shah/f07iw0ghzxq/wish/279316237</link>
         <description><![CDATA[<div>The two most important factors regarding updates to the database are frequency and automation.<br><br></div><div>Consider that in order to automate the update process, you will typically need a consistent data source, i.e. the field types, and the files supplied each time must be the same. You should consider how often source data is likely to change, if you are ever going to import additional data and if so how your chosen software will deal with this.<br><br>These are the most important factors to be considered when updates are made to the already present large indexed databases of the search engines.<strong><br>Roll Number : 1514114</strong><br><br></div>]]></description>
         <enclosure url="" />
         <pubDate>2018-09-10 09:34:13 UTC</pubDate>
         <guid>https://padlet.com/chaitya_shah/f07iw0ghzxq/wish/279316237</guid>
      </item>
      <item>
         <title>Saurabh Ughade</title>
         <author></author>
         <link>https://padlet.com/chaitya_shah/f07iw0ghzxq/wish/279317997</link>
         <description><![CDATA[<div>Eliminating unnecessary processing.Eliminating redundant processing.Using more efficient processing in exchange for less efficient processing<br><br>Roll No. 1514121</div><div><br></div>]]></description>
         <enclosure url="" />
         <pubDate>2018-09-10 09:42:54 UTC</pubDate>
         <guid>https://padlet.com/chaitya_shah/f07iw0ghzxq/wish/279317997</guid>
      </item>
      <item>
         <title>Aakash Zaveri</title>
         <author>aakash_zaveri</author>
         <link>https://padlet.com/chaitya_shah/f07iw0ghzxq/wish/282179922</link>
         <description><![CDATA[<div><strong>Roll No:-1514125   <br><br></strong>The factors to be considered to update the data can be mentioned in the robots.txt file which tells the crawlers as to which pages to crawl and which not to, so the only pages where changes are made are to be crawled and the updations in thedatabase are made accordingly, saving enough time and memory<strong> .                                                       </strong></div>]]></description>
         <enclosure url="" />
         <pubDate>2018-09-17 09:04:14 UTC</pubDate>
         <guid>https://padlet.com/chaitya_shah/f07iw0ghzxq/wish/282179922</guid>
      </item>
      <item>
         <title>Shreyash Sharma</title>
         <author>shreyash_sharma</author>
         <link>https://padlet.com/chaitya_shah/f07iw0ghzxq/wish/282180536</link>
         <description><![CDATA[<div>Initially, a web page can be given a “freshness” score based on its inception date, which decays over time. This freshness score may boost a piece of content for certain search queries, but degrades as the content becomes older. The factor of the contect or the data being fresh or old makes an important change in the score of the website. Tools such as google webmaster is used to change or update the data on the web. Also duplication of the data and redundancy in the data should be avoided<br><br><strong>Roll No : 1514115</strong></div>]]></description>
         <enclosure url="" />
         <pubDate>2018-09-17 09:05:48 UTC</pubDate>
         <guid>https://padlet.com/chaitya_shah/f07iw0ghzxq/wish/282180536</guid>
      </item>
      <item>
         <title>Viral Vora</title>
         <author>viral_vora</author>
         <link>https://padlet.com/chaitya_shah/f07iw0ghzxq/wish/282180860</link>
         <description><![CDATA[<div>Web pages are bound to updates and changes in content on almost a regular basis, so one obviously wants their website to be crawled whenever changes are made in the content. One way to achieve this is to add details in the robot.txt file. But one should make sure that the content updated is unique and fresh . Redudancy reduces the score of the webpage. One can also ask the crawler not to crawl the older content.&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <strong><br>ROll No: 1514123 </strong></div>]]></description>
         <enclosure url="" />
         <pubDate>2018-09-17 09:06:34 UTC</pubDate>
         <guid>https://padlet.com/chaitya_shah/f07iw0ghzxq/wish/282180860</guid>
      </item>
   </channel>
</rss>
