We love feedback from you on our products and the problems in your daily work that you would like us to solve. Please describe the challenge you're encountering and your desired outcome. Be as detailed as possible.
For technical issues or bugs please head to Support or our Developer Community. You can assign up to 20 votes in total. Thank you for your feedback.
Status explanation: 'Future Consideration' = Continuing to collect further feedback, not planned at this time. 'Investigating' = Prioritized for deeper customer and feasibility investigations ahead of planning development.
Perhaps a better approach would be pass a 'modified' time meta attribute.
...
<meta property="article:published_time" content="2019-10-18T15:19:00.000-04:00" />
<meta property="article:modified_time" content="2023-01-18T15:19:00.000-04:00" />
...
The scheduled re-import process would only re-index pages that have been modified since the last time it was scanned.
A note on this. I acknowledge that the initial scraping currently is resource-intensive, since it performs a full NLP analysis of the page content. However, a NLP re-scan would NOT be necessary for this suggested re-import - just a superficial re-import of the factual properties like title, (canonical) URL, metatags, image.
Thank you for your input. We will investigate the idea and see whether it's something we can address in a timely manner.