Dynamic Custom Homepage - ROUND TWO #5445

mheppler · 2019-01-09T17:25:42Z

Misc HTML + CSS + layout improvements

Javascript fixes

Handle cases with no JS through no script tag
Remove "Other" in JS
“Search over ###### datasets...” placeholder count needs a comma
Subject counts (1146) need commas
Activity counts (4936934) need commas
Search input watermark dataset count quickly blinks "75,000" before changing to comma-less count
Recent dataset dates (Wed Jan 09 2019) need comma (e.g. Wed Jan 09, 2019), remove day of week, remove "0" if the day is a single digit

Other customization fixes

Header: add 2px above and below Harvard logo (or make the logo slightly smaller)
Header & footer: change both to solid background #ececec

Homepage template fixes

Change page title from "Harvard Dataverse Dataverse" to "Harvard Dataverse"

Additional curation efforts

Journal images – how might we include images of the Journal, if no image of data to show? I (or Dwayne, or Mike) could make images for each journal if the journal does not have a representative image of the journal cover we can use

Related GitHub Issues

Custom home page for Harvard Dataverse Custom home page for Harvard Dataverse #5053 (released in 4.10)
Homepage Count Updates Homepage Count Updates #5447
"Activity" file downloads count (4936934) is off by 649,087 compared to the "Metrics" downloads (5,586,235) (tracked in Count of all file downloads from metrics API doesn't always match what's in UI (on old homepage metrics bar) #4970). This will be affected by additional numbers to be shown. New mockup below. Note that the numbers in the mockup are not necessarily accurate. Numbers that fit the categories shown in the mockup need to be calculated:

Updated Activity section

Misc notes...

Retest this: The navigation is really off. It's hard to get back to the "front" page once you use it to get to production...so sometimes I get the production page, sometimes I get the new homepage
Confirm settings: Create dataverse link behavior: link goes to the right page but 'dataverse' is automatically added to my repository name, which is not required anymore – it can be called anything, not just "dataverse."

scolapasta · 2019-01-09T17:29:10Z

Related to the "Activity download count being off" to-do list item: #4970

matthew-a-dunlap · 2019-01-09T17:37:56Z

For the "Activity" download counts problem, a short-term fix could be to just remove that section of the html until we get the metrics to line up in a future release

scolapasta · 2019-01-09T18:03:41Z

Should Search input watermark dataset count be 27.4k (# of datasets added?) or 81.2k (total number including harvested)? I'd vote for the latter

djbrooke · 2019-01-09T22:20:19Z

There's some more feedback coming from @mercecrosas for this issue. @TaniaSchlatter will add it tomorrow morning.

sbarbosadataverse · 2019-01-09T22:30:55Z

I sent feedback to Tania already

…

On Wed, Jan 9, 2019 at 5:20 PM Danny Brooke ***@***.***> wrote: There's some more feedback coming from @mercecrosas <https://urldefense.proofpoint.com/v2/url?u=https-3A__github.com_mercecrosas&d=DwMCaQ&c=WO-RGvefibhHBZq3fL85hQ&r=8R6PzVqt1PEocauQgZMGXsGz29-nb19M7eqlo1d8EVs&m=UDQVlWArn9ZcroUbg3pGINCl4BsWb6XANgswCoR_PuI&s=sLEa7Z1UCnfsqoTqFuaBQ_Ow9Kq_4aqx0OfYEpTD9Xo&e=> for this issue. @TaniaSchlatter <https://urldefense.proofpoint.com/v2/url?u=https-3A__github.com_TaniaSchlatter&d=DwMCaQ&c=WO-RGvefibhHBZq3fL85hQ&r=8R6PzVqt1PEocauQgZMGXsGz29-nb19M7eqlo1d8EVs&m=UDQVlWArn9ZcroUbg3pGINCl4BsWb6XANgswCoR_PuI&s=oBG_8dY2qp9ERcTej0liOFrNBqZQl24tYVe0bF9BT9Y&e=> will add it tomorrow morning. — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub <https://urldefense.proofpoint.com/v2/url?u=https-3A__github.com_IQSS_dataverse_issues_5445-23issuecomment-2D452890800&d=DwMCaQ&c=WO-RGvefibhHBZq3fL85hQ&r=8R6PzVqt1PEocauQgZMGXsGz29-nb19M7eqlo1d8EVs&m=UDQVlWArn9ZcroUbg3pGINCl4BsWb6XANgswCoR_PuI&s=dqN9uy5P0UH0UieQoazqQdwmazI5L0_SZqpZFId37J0&e=>, or mute the thread <https://urldefense.proofpoint.com/v2/url?u=https-3A__github.com_notifications_unsubscribe-2Dauth_AH79KktQDc-2DRaZve2S1MgT7OsbZJgCceks5vBmslgaJpZM4Z38kX&d=DwMCaQ&c=WO-RGvefibhHBZq3fL85hQ&r=8R6PzVqt1PEocauQgZMGXsGz29-nb19M7eqlo1d8EVs&m=UDQVlWArn9ZcroUbg3pGINCl4BsWb6XANgswCoR_PuI&s=Yu1fYPnUgITDWyemYYF8Y-Gkr4nTVtUL1QaT1Hjy7jk&e=> .

-- Sonia Barbosa Manager of Data Curation, The Dataverse Project Manager of the Murray Research Archive, IQSS Data Science Harvard University All Harvard Dataverse Repository inquiries should be sent to: support@dataverse.harvard.edu All software inquiries should be sent to: support@dataverse.org Need to deposit data? Visit http://dataverse.harvard.edu All test dataverses should be created in our demo environment: https://demo.dataverse.org/ Join our Dataverse Community! https://groups.google.com/forum/#!forum/dataverse-communit <https://groups.google.com/forum/#!forum/dataverse-community>y

]

TaniaSchlatter · 2019-01-11T21:02:31Z

@scolapasta Search input watermark dataset count should be @ 81.2k – total number including harvested.

…eholder noscript msg [ref #5445]

mheppler · 2019-01-15T17:23:23Z

Wanted to record this Stack Overflow resource for new column CSS properties used in the subject count and recent dataset sections.

landreev · 2019-02-07T16:55:58Z

Regarding the harvested datasets:
We do NOT populate the publicationdate of harvested datasets. We only fill the creationdate - and since all the harvested datasets are published by definition, it can be assumed to also be the publicationdate.
The harvested datasets in the database that happen to have the publicationdate are the legacy ones that were migrated from DVN3.

We can discuss changing this arrangement separately. But for the purposes of this issue, we should simply go ahead and change the dataset-counting queries to work based on this definition, that all the harvested datasets should be counted as published.

So instead of doing
"SELECT ... WHERE ... dvobject.publicationdate IS NOT null"
we should be doing
"SELECT ... WHERE ... (dvobject.publciationdate IS NOT null OR dataset.harvestingclient_id IS NOT null)"

matthew-a-dunlap · 2019-02-07T16:56:51Z

@landreev Thanks for investigating this! I'll make the change :)

We will try to add this again later. Is it not actually required.

matthew-a-dunlap · 2019-02-07T23:20:58Z

I've run into more problems that I thought trying to get all the file/dataset queries to work dynamically for harvested/local. I removed the dataLocation option from all files queries (as we don't use them in homepage anyways) and from dataset/bySubject . The harvest/local/all queryParam for the other dataset queries seems to work well.

After removing this from dataset/bySubject I realized that it was a hard requirement for homepage to get all the results. Talking with @landreev earlier, we agreed that the base query that we had used for datasets/files is a bit confusing and should be rewritten, but I had hoped to avoid doing that as part of the homepage story.

We may be able to sidestep this issue somewhat by writing a different/simpler query that gets the subject counts without caring about the timestamp, and having that return harvest/local. But it'll make the metrics api a bit more confusing and is still work.

I'm out tomorrow and will be unable to work on this. Feel free to revert my last two commits if needed to work on the bySubject query.

matthew-a-dunlap · 2019-02-08T13:17:52Z

btw, the approach I was trying was to update this section of bySubject/toMonth:

from datasetversion where datasetversion.dataset_id || ':' || datasetversion.versionnumber + (.1 * datasetversion.minorversionnumber) in

removing it to be how the basic toMonth query is now. There may be some problem with this tho as harvested datasets may not have a datasetversion.

landreev · 2019-02-08T15:03:08Z

I can definitely help figuring out better queries there.
Just to confirm that I'm reading this correctly - the "totals" queries are now working correctly (for local, harvested and/or both); and the bySubject query is working correctly for local datasets, but not for harvested ones - ? - I'll look into it.

And yes, it looks like the only harvested datasets that have numeric version numbers are the ones harvested from other Dataverses. The ones harvested from generic OAI archives and such don't. Whether this is a problem necessarily - we need to find out; that fragment in the query:

... ':' || datasetversion.versionnumber + (.1 * datasetversion.minorversionnumber) ...

may simply become a "0" when the version numbers are missing; and it would still uniquely identify the dataset, in combination with the dataset id.

landreev · 2019-02-08T15:05:47Z

(and yes, the bySubjectToMonth should be the same query as bySubject - but with the time argument added...)

matthew-a-dunlap · 2019-02-08T17:04:33Z

@landreev thats correct the totals look to be working correct now. Thanks for looking into this.

landreev · 2019-02-08T19:56:26Z

so yeah, these lines:

datasetversion.dataset_id || ':' || max(datasetversion.versionnumber + (.1 * datasetversion.minorversionnumber))

or

datasetversion.dataset_id || ':' || datasetversion.versionnumber + (.1 * datasetversion.minorversionnumber)

both result in empty strings when versionnumber and/or minorversionnumber are null. so count(*) works - it just counts lines, regardless of the content. But "where ... in ..." using this expression only finds the versions with the version numbers present.

(I'm working on a simpler query)

… harvested datasets (or both). (ref #5445)

landreev · 2019-02-08T21:48:05Z

OK, I haven't really made it simpler per se; I'm still relying on the "max(datasetversion.versionnumber + (.1 * datasetversion.minorversionnumber))" gimmick in order to select the latest released version, for the local datasets (haven't been able to think of a simpler/cleaner query).
But I got it to work with harvested datasets, and I used a simpler query for those - that relies on the assumption that all the harvested datasets are published, and that there's only one version per dataset.

(I've only modified the datasets/bySubjectToMonth query; if any other similar queries in there need to be able to select either local, or harvested, or both - they need be similarly modified)

djbrooke added Status: Ready labels Jan 9, 2019

djbrooke added Status: This/Next Sprint and removed Status: Ready labels Jan 9, 2019

djbrooke assigned TaniaSchlatter Jan 9, 2019

djbrooke removed the ready for estimation label Jan 9, 2019

djbrooke unassigned TaniaSchlatter Jan 9, 2019

mheppler added Status: Development and removed Status: This/Next Sprint labels Jan 9, 2019

mheppler self-assigned this Jan 9, 2019

mheppler added a commit that referenced this issue Jan 9, 2019

Removed redundant Dataverse from custom hmpg page title [ref #5445]

915810d

mheppler added a commit that referenced this issue Jan 11, 2019

Minor style and layout tweeks to dynamic custom hmpg template [ref #5445

cb17831

]

mheppler added a commit that referenced this issue Jan 11, 2019

Fixed formatting of dates and counts in dynamic custom hmpg [ref #5445]

b00a70f

TaniaSchlatter closed this as completed Jan 11, 2019

TaniaSchlatter reopened this Jan 11, 2019

mheppler added a commit that referenced this issue Jan 11, 2019

Javascript format and count tweeks in dynamic custom hmpg [ref #5445]

4128e7d

mheppler added a commit that referenced this issue Jan 14, 2019

Added additional column to recent datasets on custom hmpg, added plac…

c508344

…eholder noscript msg [ref #5445]

matthew-a-dunlap added a commit that referenced this issue Jan 14, 2019

Combine bySubject queries in homepage JS #5445 #5447

e1514dd

mheppler added a commit that referenced this issue Jan 15, 2019

Cleaned up responsive layout of dynamic custom hmpg [ref #5445]

f1c75ad

matthew-a-dunlap added a commit that referenced this issue Jan 15, 2019

Metrics queryParam documentation #5445

9889098

matthew-a-dunlap added a commit that referenced this issue Jan 15, 2019

upgrade script clear metrics cache #5445 #5447

b71c0ce

mheppler added a commit that referenced this issue Jan 15, 2019

Additional layout cleanup of dynamic custom hmpg [ref #5445]

3a89b5d

matthew-a-dunlap added a commit that referenced this issue Jan 15, 2019

Bugfix metrics load & metrics no-guestbook. Wire in metrics #5445 #5447

e008600

matthew-a-dunlap mentioned this issue Feb 6, 2019

Update Metrics Aggregator with New Metrics #5492

Closed

matthew-a-dunlap added a commit that referenced this issue Feb 6, 2019

Metrics error on unknown queryParam. Also tests. #5445

2419918

matthew-a-dunlap assigned matthew-a-dunlap and unassigned landreev Feb 7, 2019

matthew-a-dunlap added a commit that referenced this issue Feb 7, 2019

Fixed harvest query half complete #5445

cdfffcc

matthew-a-dunlap added a commit that referenced this issue Feb 7, 2019

Remove dataLocation from bySubject/toMonth #5445

d5d5230

We will try to add this again later. Is it not actually required.

matthew-a-dunlap added a commit that referenced this issue Feb 7, 2019

remove dataLocation from files #5445

6b547f6

landreev self-assigned this Feb 8, 2019

landreev added a commit that referenced this issue Feb 8, 2019

a custom query for datasetsBySubjectToMonth, that works for local, or…

5ec3552

… harvested datasets (or both). (ref #5445)

djbrooke unassigned landreev Feb 11, 2019

matthew-a-dunlap added a commit that referenced this issue Feb 11, 2019

Rewire datasetBySubject metric to take dataLocation #5445

eeb871c

matthew-a-dunlap added Status: QA and removed Status: Development labels Feb 11, 2019

matthew-a-dunlap removed their assignment Feb 11, 2019

kcondon self-assigned this Feb 12, 2019

matthew-a-dunlap added a commit that referenced this issue Feb 12, 2019

Move metrics db changes to correct upgrade script #5445

dc69bc7

matthew-a-dunlap added a commit that referenced this issue Feb 12, 2019

Quieter logging metrics #5445

b1af223

kcondon closed this as completed in 3e00d03 Feb 13, 2019

kcondon removed the Status: QA label Feb 13, 2019

TaniaSchlatter mentioned this issue Mar 5, 2019

Visualization for Harvard Dataverse home page #5603

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dynamic Custom Homepage - ROUND TWO #5445

Dynamic Custom Homepage - ROUND TWO #5445

mheppler commented Jan 9, 2019 •

edited by djbrooke

Loading

scolapasta commented Jan 9, 2019 •

edited by mheppler

Loading

matthew-a-dunlap commented Jan 9, 2019 •

edited

Loading

scolapasta commented Jan 9, 2019

djbrooke commented Jan 9, 2019

sbarbosadataverse commented Jan 9, 2019 via email

TaniaSchlatter commented Jan 11, 2019

mheppler commented Jan 15, 2019

landreev commented Feb 7, 2019

matthew-a-dunlap commented Feb 7, 2019

matthew-a-dunlap commented Feb 7, 2019 •

edited

Loading

matthew-a-dunlap commented Feb 8, 2019 •

edited

Loading

landreev commented Feb 8, 2019

landreev commented Feb 8, 2019

matthew-a-dunlap commented Feb 8, 2019

landreev commented Feb 8, 2019

landreev commented Feb 8, 2019 •

edited

Loading

Dynamic Custom Homepage - ROUND TWO #5445

Dynamic Custom Homepage - ROUND TWO #5445

Comments

mheppler commented Jan 9, 2019 • edited by djbrooke Loading

scolapasta commented Jan 9, 2019 • edited by mheppler Loading

matthew-a-dunlap commented Jan 9, 2019 • edited Loading

scolapasta commented Jan 9, 2019

djbrooke commented Jan 9, 2019

sbarbosadataverse commented Jan 9, 2019 via email

TaniaSchlatter commented Jan 11, 2019

mheppler commented Jan 15, 2019

landreev commented Feb 7, 2019

matthew-a-dunlap commented Feb 7, 2019

matthew-a-dunlap commented Feb 7, 2019 • edited Loading

matthew-a-dunlap commented Feb 8, 2019 • edited Loading

landreev commented Feb 8, 2019

landreev commented Feb 8, 2019

matthew-a-dunlap commented Feb 8, 2019

landreev commented Feb 8, 2019

landreev commented Feb 8, 2019 • edited Loading

mheppler commented Jan 9, 2019 •

edited by djbrooke

Loading

scolapasta commented Jan 9, 2019 •

edited by mheppler

Loading

matthew-a-dunlap commented Jan 9, 2019 •

edited

Loading

matthew-a-dunlap commented Feb 7, 2019 •

edited

Loading

matthew-a-dunlap commented Feb 8, 2019 •

edited

Loading

landreev commented Feb 8, 2019 •

edited

Loading